发明名称 DEVICE FOR MULTIPLE-CLASSIFYING TEXT, METHOD FOR MULTIPLE-CLASSIFYING TEXT, PROGRAM AND STORAGE MEDIUM
摘要 PROBLEM TO BE SOLVED: To provide a multiple classification device capable of multiple-classifying text even if learning data is not prepared in advance. SOLUTION: First text is decomposed into sentence units; each the decomposed sentence is morpheme-analyzed; a noun obtained by the morpheme analysis is extracted as a retrieval word; retrieval is performed on the Web by use of the extracted retrieval word; a retrieved second text is morpheme-analyzed; a noun having frequency of a preset threshold value or above among nouns obtained by the morpheme analysis is acquired as a related term in the sentence units; the retrieval word extracted from one sentence among the plurality of sentences obtained by decomposing the first text, and the related term acquired by use of the retrieval word are combined to produce a keyword set, a word appearing in common between a plurality of the keyword sets is extracted as a common word, and the extracted common word is output as a term showing a field of the first text. COPYRIGHT: (C)2008,JPO&INPIT
申请公布号 JP2008065468(A) 申请公布日期 2008.03.21
申请号 JP20060240640 申请日期 2006.09.05
申请人 NIPPON TELEGR & TELEPH CORP <NTT> 发明人 ABE NAOTO;TANABE KATSUYOSHI;OKUDA HIDENORI
分类号 G06F17/30 主分类号 G06F17/30
代理机构 代理人
主权项
地址