发明名称 DOCUMENT CLASSIFICATION DEVICE AND METHOD AND PROGRAM
摘要 <P>PROBLEM TO BE SOLVED: To provide stable and high classification performance capable of following a change of appearance tendency of word at high speed. <P>SOLUTION: A document classification device divides an input document into sets of words, obtains a prior probability indicating importance of each class by an exponentially weighted moving average on the basis of a class belonging to the input document, stores word appearance information related to an appearance of each word included in the input document on the basis of the prior probability of each class, estimates a word long period appearance probability in each class of each word included in the input document, estimates a word short period appearance probability in each class of each word included in the input document by using the word appearance information, determines whether or not the word is a trend word on the basis of the word long period appearance probability and the word short period appearance probability of the word for each word included in the input document, and calculates a posterior probability which means the input document belongs to each class and classifies the input document into one or more classes on the basis of the determination result. <P>COPYRIGHT: (C)2013,JPO&INPIT
申请公布号 JP2013109584(A) 申请公布日期 2013.06.06
申请号 JP20110254230 申请日期 2011.11.21
申请人 NIPPON TELEGR & TELEPH CORP <NTT> 发明人 NISHIDA KYOSUKE;FUJIMURA TAKASHI;HOSHIIDE TAKAHIDE
分类号 G06F17/30 主分类号 G06F17/30
代理机构 代理人
主权项
地址