摘要 |
A text comparison apparatus computes the occurrence count of text elements, stores those text elements that have an occurrence count of at least a occurrence count threshold for storage in a text element storage unit, uses those text elements that have an occurrence count of at least a occurrence count threshold for similarity calculation to calculate similarity, and calculates discrepancy for those text elements for which the difference of occurrence counts is at least a occurrence count threshold for discrepancy calculation.
|