摘要 |
A method and apparatus for segmenting a document which has both text and image regions. The method and apparatus implement a technique in which large text pixels and image pixels are identified in a document having a relatively low resolution. The method and apparatus then detect dark text pixels on a light background region of a document and assign segmentation labels to each pixel. The pixel labels are post-processed using a plurality of syntactic rules to correct mislabeling of pixels. This process does not change the visual perception of the image regions in the document. Pixels identified as being in the background region of the document are assigned a white label and pixels identified as being in the text region are assigned a black label. The resulting processed document contains sharp black text and white background, resulting in improved perceptual quality and efficient ink utilization during a printing process.
|