摘要 |
<P>PROBLEM TO BE SOLVED: To realize text segmentation using web retrieval that enables text segmentation without requiring learning data. <P>SOLUTION: A text segmentation device divides an input text into sentence units, subjects the divided sentences to morphological analysis, extracts all words except particles, subjected to the morphological analysis as retrieval words, converts the words having inflected forms into the words having end forms, subjects a text obtained by web retrieval on the basis of the retrieval words to the morphological analysis, extracts all the words except the particles as related words, converts the words having inflected forms into the words having end forms, determines semantic paragraphs on the basis of connectivity between the sentences by using a set of keywords which are combinations of the retrieval words and the related words stored in related word storage means, and creates division candidates, and evaluates the division candidates to select one division result to output the result. <P>COPYRIGHT: (C)2013,JPO&INPIT |