摘要 |
<p>PROBLEM TO BE SOLVED: To determine a regular expression with little leakage and/or noise from among a plurality of candidates for regular expressions.SOLUTION: A program causes an information processor 30 to execute processing which stores a plurality of regular expressions suitable to one or more desired text parts in certain document data in a storage unit 334, calculates the number of text parts suitable to each of the plurality of regular expressions of the storage unit in each of a plurality of pieces of document data, acquires the number of desired text parts in each of the plurality of pieces of document data, and determines a regular expression having a high degree at which the number of the suitable text parts about the plurality of pieces of document data coincides with the number of the desired text parts among the plurality of regular expressions.</p> |