摘要 |
PROBLEM TO BE SOLVED: To reduce the amount of data by separating image data and character data generated by recognizing character strings included in the image data and to generate a high-usability output file out of which the image data and character data can be read. SOLUTION: This image processor is equipped with a data storage part 104 which stores divided image data divided by an area division part 102 and a structured document generation part 105 which generates a structured document containing the storage addresses of the divided image data stored in the data storage part 104 and character data generated by a character recognition part 103 while corresponding to the divided image data, side by side, and using XML(eXtensible Markup Language) or SGML(Standard Generalized Markup Language). |