发明名称 FEATURE WORD EXTRACTION DEVICE, PROGRAM AND METHOD
摘要 <P>PROBLEM TO BE SOLVED: To automatically set an appropriate break between words for document data which is an object of text mining. <P>SOLUTION: This feature word extraction device comprises; a document storage part storing a plurality of pieces of document data; a generation part dividing each of clauses in a first piece of document data among the plurality of pieces of document data as changing a break position and a number of breaks, and storing character strings obtained by the dividing process in a data storage part; a calculation part calculating a feature degree for each of the character strings stored in the data storage part, by using an appearance frequency of the character string in the first document data and a number of pieces of document data in which the character string appears among the plurality of pieces of document data stored in the document storage part; and a specification part specifying a character string whose feature degree is the highest among character strings for a clause, for each of the clauses in the first document data, and storing it in a feature word storage part. <P>COPYRIGHT: (C)2012,JPO&INPIT
申请公布号 JP2012150576(A) 申请公布日期 2012.08.09
申请号 JP20110007354 申请日期 2011.01.17
申请人 FUJITSU LTD 发明人 TAKAHASHI TETSURO;IGATA NOBUYUKI
分类号 G06F17/30;G06F17/27 主分类号 G06F17/30
代理机构 代理人
主权项
地址