摘要 |
<p>An image processing apparatus comprises a character recognition unit for executing character recognition for character images in a document image to obtain character codes corresponding to the respective character images, a generation unit for generating a digital document which includes the document image, character codes obtained by the character recognition unit, glyph data indicating a character shape to be used for rendering characters corresponding to the character codes, and a description designating glyph data to be used for each of the character codes, wherein the description designates identical glyph data for different character codes so that the identical glyph data is used commonly to the different character codes when rendering characters corresponding to the different character codes, and wherein the description further includes a designation to render the identical glyph data for the character codes at positions corresponding to ends of the respective character images in the document image. ( Fig. 6 )</p> |