摘要 |
<p>An apparatus, method, system, and computer program , each capable of applying document layout analysis to a document image with control of a non-character area. A non-character area is extracted from a document image to be processed. A character image is generated from the document image by removing the non-character area from the document image. The character image is segmented into a plurality of sections to generate a segmented image. The segmented image is adjusted using a selected component of the non-character image to generate an adjusted segmented image. A segmentation result is output, which is generated based on the adjusted segmented image.</p> |