摘要 |
<p>Included are a candidate extraction unit 61 that extracts, from a document formed by a group of character strings, a longest consecutive partial string common to one character string and the other character string as a candidate for an important word related to the one character string; a candidate integration unit 62 that selects a longest partial string of the candidate for the important word related to the one character string and extracted by the candidate extraction unit 61; and a group integration unit 63 that integrates a group of the longest partial string of each character string selected by the candidate integration unit 62, this group not forming a subset of a group of the other character string, thereby forming a group of the important word.</p> |