摘要 |
<p>In association with a method for extracting a similar sentence, it is possible to automatically create a keyword combination which can accurately collect (classify) texts from a target text obtained by an analyzer through sampling based on a specific information source, i.e., a classification rule. A plurality of sampling sentence groups (211) are compared to an extraction object sentence group (212). That is, under control of a similar sentence determination unit (106), a case quantity similarity calculation unit (103), an extraction unit (104), a removal unit (105), and the similar sentence determination unit (106) repeat a process to narrow the extraction object sentence group (212) so as to contain only each of the morpheme pairs extracted from the sampling sentence group (211) in the descending order of similarity, i.e., in the ascending order of the distance between the numbers of appearing sentences. Thus, it is possible to effectively extract a sentence similar to the sampling sentence group (211) from the extraction object sentence group (212).</p> |