摘要 |
A method of analyzing a source image to separate text from graphics, by: (a) scanning and digitizing the source image to obtain a binary image including black and white objects; (b) filtering out the noise from the binary image; (c) extracting the contours therefrom of the black objects and the white objects; (d) evaluating inclusion relationships between the objects, and generating a tree-like structure of such relationships; (e) utilizing the contours for measuring the objects to obtain the shape properties of each object; (f) effecting classification of the objects as graphics or text according to the measured shape properties and then generating tree-like structure of the inclusion relationships; (g) and utilizing the source image and the classification of the objects for generating outputs representing graphics and text, respectively. <IMAGE> <IMAGE> |