摘要 |
<p><P>PROBLEM TO BE SOLVED: To provide a method for determining at least one second document of a set of documents in a second language having the same textual content as a first document in a first language. <P>SOLUTION: The method includes steps for: generating a first histogram, the first histogram being indicative of the textual content of a first document; generating a second histogram for each document of a set of documents, each second histogram being indicative of the textual content of a document of the set of documents; comparing each second histogram with the first histogram to determine at least one histogram from a plurality of second histograms which match the first histogram; and identifying at least one second document as the document having the at least one histogram. <P>COPYRIGHT: (C)2009,JPO&INPIT</p> |