摘要 |
A document image processing apparatus includes an specifying section, an extracting section, a recognizing section, an interpreting section, an arranging section and a generating section. The specifying section specifies a sentence region including a character row from a document image. The extracting section extracts at least one of character row images included in the specified sentence region. The recognizing section recognizes respective characters included in the extracted character row image. The interpreting section interprets an original sentence character row comprising the recognized characters and generates an interpreted sentence character row. The arranging section arranges the respective character row images in the sentence region by contracting the respective character row images. The arranging section arranges the generated respective interpreted sentence character rows in a vacant region except a region arranging the respective character row images from the sentence region. |