发明名称 TEXT SEGMENTATION DEVICE, METHOD, PROGRAM, AND COMPUTER-READABLE RECORDING MEDIUM
摘要 <P>PROBLEM TO BE SOLVED: To realize text segmentation using web retrieval that enables text segmentation without requiring learning data. <P>SOLUTION: A text segmentation device divides an input text into sentence units, subjects the divided sentences to morphological analysis, extracts all words except particles, subjected to the morphological analysis as retrieval words, converts the words having inflected forms into the words having end forms, subjects a text obtained by web retrieval on the basis of the retrieval words to the morphological analysis, extracts all the words except the particles as related words, converts the words having inflected forms into the words having end forms, determines semantic paragraphs on the basis of connectivity between the sentences by using a set of keywords which are combinations of the retrieval words and the related words stored in related word storage means, and creates division candidates, and evaluates the division candidates to select one division result to output the result. <P>COPYRIGHT: (C)2013,JPO&INPIT
申请公布号 JP2013101679(A) 申请公布日期 2013.05.23
申请号 JP20130015670 申请日期 2013.01.30
申请人 NIPPON TELEGR & TELEPH CORP <NTT> 发明人 ABE NAOTO;UCHIYAMA TOSHIRO;UCHIYAMA MASASHI
分类号 G06F17/27;G06F17/21;G06F17/30 主分类号 G06F17/27
代理机构 代理人
主权项
地址