发明名称 |
DEVICE, METHOD AND PROGRAM FOR CLASSIFYING DOCUMENT |
摘要 |
PROBLEM TO BE SOLVED: To perform association between documents having similar contents while reflecting a change of interest of an author, about a lot of documents. SOLUTION: A document classification device 1000 includes: a storage part 2 storing a prescribed calculation expression and a prescribed calculation end condition used in the plurality of documents and a probability distribution model; an initial setting part 3 randomly setting initial values of document classes to which the respective documents belong and topic classes to which words belong; a document class evaluation part 4 estimating the document class to which the document belongs in each document; a topic class evaluation part 5 estimating the topic class to which the word belongs in each word; a convergence decision part 6 making the document class evaluation part 4 and the topic class evaluation part 5 repeat the estimation until satisfying the prescribed calculation end condition; and an output part 7 outputting a calculation result including contents of the document class. COPYRIGHT: (C)2011,JPO&INPIT
|
申请公布号 |
JP2010267017(A) |
申请公布日期 |
2010.11.25 |
申请号 |
JP20090116899 |
申请日期 |
2009.05.13 |
申请人 |
NIPPON TELEGR & TELEPH CORP <NTT> |
发明人 |
KAWAMAE NORIAKI;YAMADA TAKESHI |
分类号 |
G06F17/30 |
主分类号 |
G06F17/30 |
代理机构 |
|
代理人 |
|
主权项 |
|
地址 |
|