发明名称 DEVICE, METHOD AND PROGRAM FOR CLASSIFYING DOCUMENT
摘要 PROBLEM TO BE SOLVED: To perform association between documents having similar contents while reflecting a change of interest of an author, about a lot of documents. SOLUTION: A document classification device 1000 includes: a storage part 2 storing a prescribed calculation expression and a prescribed calculation end condition used in the plurality of documents and a probability distribution model; an initial setting part 3 randomly setting initial values of document classes to which the respective documents belong and topic classes to which words belong; a document class evaluation part 4 estimating the document class to which the document belongs in each document; a topic class evaluation part 5 estimating the topic class to which the word belongs in each word; a convergence decision part 6 making the document class evaluation part 4 and the topic class evaluation part 5 repeat the estimation until satisfying the prescribed calculation end condition; and an output part 7 outputting a calculation result including contents of the document class. COPYRIGHT: (C)2011,JPO&INPIT
申请公布号 JP2010267017(A) 申请公布日期 2010.11.25
申请号 JP20090116899 申请日期 2009.05.13
申请人 NIPPON TELEGR & TELEPH CORP <NTT> 发明人 KAWAMAE NORIAKI;YAMADA TAKESHI
分类号 G06F17/30 主分类号 G06F17/30
代理机构 代理人
主权项
地址