发明名称 NORMALIZING ELECTRONIC COMMUNICATIONS USING NEURAL NETWORKS
摘要 Electronic communications can be normalized using neural networks. For example, an electronic representation of a noncanonical communication can be received. A normalized version of the noncanonical communication can be determined using a normalizer including a neural network. The neural network can receive a single vector at an input layer of the neural network and transform an output of a hidden layer of the neural network into multiple values that sum to a total value of one. Each value of the multiple values can be a number between zero and one and represent a probability of a particular character being in a particular position in the normalized version of the noncanonical communication. The neural network can determine the normalized version of the noncanonical communication based on the multiple values. Whether the normalized version should be output can be determined based on a result from a flagger including another neural network.
申请公布号 US2016350650(A1) 申请公布日期 2016.12.01
申请号 US201514937810 申请日期 2015.11.10
申请人 SAS Institute Inc. ;North Carolina State University 发明人 Leeman-Munk Samuel Paul;Min Wookhee;Mott Bradford Wayne;Lester, II James Curtis;Cox James Allen
分类号 G06N3/08;G06N3/04 主分类号 G06N3/08
代理机构 代理人
主权项 1. A non-transitory computer readable medium comprising program code executable by a processor for causing the processor to: receive an electronic representation of a noncanonical communication; feed a vector that is representative of the noncanonical communication as input to a normalizer, wherein the normalizer is operable for causing a first neural network to: receive the vector at an input layer of the first neural network,perform matrix operations on the vector using a plurality of hidden layers to generate hidden values,provide hidden values from a final hidden layer of the plurality of hidden layers to an output layer of the first neural network,perform, at the output layer, a softmax operation on the hidden values to generate a plurality of values representing probabilities of particular characters being in particular positions in a normalized version of the noncanonical communication, anddetermine the normalized version of the noncanonical communication based on the plurality of values, wherein the normalized version of the noncanonical communication is different from the noncanonical communication; feed the vector as input to a flagger for operating a second neural network that is trained separately from the first neural network; determine, based on a result from the flagger, that the normalized version of the noncanonical communication should be outputted; and output the normalized version of the noncanonical communication or a corrected version of the normalized version of the noncanonical communication, wherein the corrected version of the noncanonical communication is different from the noncanonical communication.
地址 Cary NC US