发明名称 APPARATUS FOR CLUSTERING A PLURALITY OF DOCUMENTS
摘要 According to an aspect, there are provided an apparatus, a program for causing a computer to function as such an apparatus, and a method, wherein the apparatus includes a selection section for selecting a plurality of sample documents from a plurality of documents and a first parameter generation section for analyzing the plurality of sample documents to generate an initial parameter matrix expressing a probability that each of a plurality of words included in the plurality of sample documents is included in each of a plurality of topics. The apparatus also includes a second parameter generation section for analyzing the plurality of documents by using each value included in the initial parameter matrix as an initial value to generate a parameter matrix expressing a probability that each of a plurality of words included in the plurality of documents is included in each of a plurality of topics.
申请公布号 US2013212106(A1) 申请公布日期 2013.08.15
申请号 US201313766889 申请日期 2013.02.14
申请人 MACHINES CORPORATION INTERNATIONAL BUSINESS;INTERNATIONAL BUSINESS MACHINES CORPORATION 发明人 INAGAKI TAKESHI
分类号 G06F17/30 主分类号 G06F17/30
代理机构 代理人
主权项
地址