摘要 |
Based on an area detection signal, a layer separation section outputs a text component of a document, to a feature point calculating section, and generates four layers from a pictorial component of the document to output the generated layers to the feature point calculating section. The feature point calculating section sums feature points extracted for each component. A features calculating section calculates a hash value based on the feature points. A vote processing section searches a hash table based on the hash value, and votes for a reference image associated with the hash value. Based on the voting result, a similarity determination processing section determines whether the document image is similar to any reference image, and then outputs the determination result. Thus, even if the document contains a photograph, accurate matching can be performed.
|