发明名称 DOCUMENT CLASSIFICATION DEVICE, DOCUMENT CLASSIFICATION METHOD, PROGRAM, AND RECORDING MEDIUM
摘要 <P>PROBLEM TO BE SOLVED: To improve accuracy of a task for classifying a document into any category in a predetermined category set. Ž<P>SOLUTION: When a category set with which a set of documents corresponding to a category for each category is associated is inputted, a document classification method calculates each document vector of the category for each category as a center of balance of word vectors acquired from a word concept base of words in each document, clusters the document vectors acquired by a document vector acquisition means of each document of the category for each category, acquires a sub-category set in which each cluster of the document vectors acquired as the clustering result is set as a sub-category, and acquires a center of balance of the document vectors, of each document in the sub-category, acquired by the document vector acquisition means as a sub-category vector of the sub-category for each sub-category, of the category, acquired by a document clustering means. Ž<P>COPYRIGHT: (C)2010,JPO&INPIT Ž
申请公布号 JP2010026782(A) 申请公布日期 2010.02.04
申请号 JP20080187335 申请日期 2008.07.18
申请人 NIPPON TELEGR & TELEPH CORP <NTT> 发明人 BESSHO KATSUTO;UCHIYAMA TOSHIRO;UCHIYAMA MASASHI
分类号 G06F17/30 主分类号 G06F17/30
代理机构 代理人
主权项
地址