发明名称 DOCUMENT SIMILARITY CALCULATION DEVICE
摘要 A document similarity calculation device, configured to calculate a similarity indicating a degree of how much a plurality of documents are similar, includes: an associative word group storage portion for storing an associative word group composed of words associated with one another, a word-in-document frequency matrix generation portion for generating a matrix of word frequency in document which is a matrix each element of which is the frequency of a word present in a document with respect to each combination of the word and the document, a word-in-document frequency matrix transformation portion for transforming the generated matrix of word frequency in document based on the stored associative word group so as to reduce the number of dimensions of the matrix of word frequency in document, and a similarity calculation portion for calculating the similarity based on the transformed matrix of word frequency in document.
申请公布号 US2012330955(A1) 申请公布日期 2012.12.27
申请号 US201213472414 申请日期 2012.05.15
申请人 MIURA MITSUGU;NEC CORPORATION 发明人 MIURA MITSUGU
分类号 G06F17/30 主分类号 G06F17/30
代理机构 代理人
主权项
地址