发明名称 Apparatus and method for building and using inference engines based on representations of data that preserve relationships between objects
摘要 This disclosure describes, among other things, an apparatus for generating an inference engine about a document. The apparatus includes at least one processor and a memory with instructions. The memory including instructions that, when executed by the at least one processor, cause the at least one processor to perform a number of processes. The processor accesses a set of documents. Each document has a corresponding inference. The processor also generates a vector representation for each document in the set of documents. First, the processor parses text of the document into groups of words, and generates a vector representation for each group.
申请公布号 US9563847(B2) 申请公布日期 2017.02.07
申请号 US201414295880 申请日期 2014.06.04
申请人 MultiModel Research, LLC 发明人 Gallant Stephen I.
分类号 G06F1/00;G06N5/00;G06N5/04 主分类号 G06F1/00
代理机构 Sunstein Kann Murphy & Timbers LLP 代理人 Sunstein Kann Murphy & Timbers LLP
主权项 1. An apparatus for generating an inference engine about a document, the apparatus comprising: at least one processor; a memory with instructions, the memory including instructions that, when executed by the at least one processor, cause the at least one processor to: access a set of documents, each document having a corresponding inference;generate a vector representation for each document in the set of documents by (i) parsing text of the document into groups of words;(ii) for each group of words, generating a vector representation of the group by (a) obtaining a binding operator corresponding to a type of the group, the binding operator comprising a matrix of values,(b) obtaining vector representations of the words in the group,(c) summing the vector representations of the words, and(d) multiplying the binding operator and the sum of the vector representations to generate a vector representation of the group; and(iii) summing the vector representations of the groups to generate a vector representation of the document; andtrain an inference engine on the vector representations of the documents and the corresponding inferences for each of the documents.
地址 Cambridge MA US