摘要 |
<P>PROBLEM TO BE SOLVED: To extract an important phrase without extracting an unimportant phrase contained in very few documents. <P>SOLUTION: This invention defines a sentence group having different roles in documents such as "a title", "a text", and "a comment" as "a section" and, on the basis of a set of training documents including a plurality of sections, extracts an important phrase of input document from the input document using the number of sections containing the phrase in the input document, and a residual inverse document frequency calculated from the training set and the input document. <P>COPYRIGHT: (C)2012,JPO&INPIT |