发明名称 WORD EXTRACTION DEVICE, WORD EXTRACTION METHOD AND PROGRAM
摘要 <P>PROBLEM TO BE SOLVED: To accurately extract a word that appears frequently in text, without using any dictionary. <P>SOLUTION: A word extraction device 1 includes: a target character string extraction unit for extracting, as target character strings from text, character strings that have a predetermined number of characters or more and appear in the text a predetermined number of times or more; a first deletion unit for deleting, from a group of the target character strings extracted by the target character string extraction unit, a character string that is a partial character string of other target character strings and appears at positions other than positions included in the other target character strings in the text a less number of times than the predetermined number of times; and a word extraction unit for setting, as a word, a character string included in the group of the target character strings after the deletion of the character string by the first deletion unit. <P>COPYRIGHT: (C)2013,JPO&INPIT
申请公布号 JP2012194870(A) 申请公布日期 2012.10.11
申请号 JP20110059263 申请日期 2011.03.17
申请人 NTT COMWARE CORP 发明人 TSUNODA MAKOTO;WATABE SHUHEI
分类号 G06F17/27;G06F17/30 主分类号 G06F17/27
代理机构 代理人
主权项
地址