发明名称 UNNECESSARY WORD DECIDING APPARATUS AND PROGRAM
摘要 <P>PROBLEM TO BE SOLVED: To decide an unnecessary word without using a threshold for deciding whether or not a word is an unnecessary word. Ž<P>SOLUTION: By a parameter learning section 14, appearance probability of each topic of each word in a word group included in learning document data, which maximizes likelihood for the learning document data is learned and searched. By a word classification section 16, each word in the word group is classified by a topic having highest appearance probability. By an unnecessary word decision section 20, a word group of a topic, by which words with appearance probabilities for every topic respectively falling within a predetermined range and distributed uniformly are classified, is decided as an unnecessary word. Ž<P>COPYRIGHT: (C)2010,JPO&INPIT Ž
申请公布号 JP2010055253(A) 申请公布日期 2010.03.11
申请号 JP20080217867 申请日期 2008.08.27
申请人 FUJI XEROX CO LTD 发明人 ISOZAKI TAKASHI;FUKUI MOTOFUMI;KATO SUKEJI
分类号 G06F17/30 主分类号 G06F17/30
代理机构 代理人
主权项
地址