摘要 |
PROBLEM TO BE SOLVED: To retrieve document images of similar contents faster than the case where character recognition processing is performed on the document images. ! SOLUTION: A second feature extraction part 31 extracts each region circumscribed to a portion corresponding to at least a part of one character from a document image and extracts a feature sequence in which features of a plurality of regions neighboring to each other in a predetermined direction are arranged side by side in an array order of regions. A retrieval part 33 collates each of the plurality of feature sequences extracted from a plurality of registered document images and stored in a hash table 25 correspondingly to identification information of the registered document images with the feature sequence extracted from a target document image. ! COPYRIGHT: (C)2015,JPO&INPIT |