摘要 |
PROBLEM TO BE SOLVED: To enable automatic creation of a keyword combination which can accurately collect (classify) texts from a target text obtained by an analyzer through sampling based on a specific information source, i.e., a classification rule, in association with a method for extracting a similar sentence. SOLUTION: By a similar sentence extraction program, between a plurality of sampling sentence groups 211 and an extraction object sentence group 212 and regarding a plurality of morpheme pairs extracted from the sampling sentence group 211, in 103 to 105, in the order of there is close the number of their appearing sentences under control of a similar sentence determination unit 106, a case quantity similarity calculation unit 103, an extraction unit 104, and a removal unit 105 repeatedly execute a process to narrow the extraction object sentence group so as to contain only each of the morpheme pairs in the descending order of the distance (higher similarity) between the numbers of appearing sentences thereof. Thus, it is possible to effectively extract a sentence similar to the sampling sentence group 211 from the extraction object sentence group 212. COPYRIGHT: (C)2010,JPO&INPIT
|