摘要 |
PROBLEM TO BE SOLVED: To realize the retrieval of document image data by using a character index. SOLUTION: A position index preparing part 102 prepares an index for which the result of character recognition processing to a document image by a character recognition part 101 is associated with two-dimensional position information in the document image of the character image of each recognized character, and stores it in an index storage part 103. A retrieval part 104 decides whether or not the arrangement of the characters corresponding to the retrieval character string inputted as retrieval conditions exists in the document image based on the two-dimensional position information of each character held by an index stored in an index storage part 103. Thus, the document image in which it is decided that the arrangement of the characters corresponding to the retrieval character string exists is outputted to, for example, a retrieval result list as a retrieval result. COPYRIGHT: (C)2006,JPO&NCIPI
|