摘要 |
PROBLEM TO BE SOLVED: To make it possible to deal with any type of chart and book without preliminarily presuming the format of a document. SOLUTION: A run histogram calculating part 5 calculates the histogram of an extracted black run, and a ruled line parameter extracting part 7 extracts a parameter(the threshold value of the black run) for the extraction of a ruled line based on the histogram. A rectangle extracting part 8 extracts connected components (a rectangle) constituted of the black runs more than the threshold value, and a solid ruled line extracting part 10 synthesizes the adjacent rectangles, and extracts a solid ruled line. A ruled line parameter extracting part 14 extracts a parameter (the threshold value of ruled line length) necessary for the extraction of the ruled line from a ruled line histogram. |