摘要 |
PROBLEM TO BE SOLVED: To provide a method for character extraction from color document image that can separate the background color and the character color of an image and extract characters even from a complex and varied background by clustering the color information. SOLUTION: The method for character extraction from color document image, which extracts only character color portions from a color document image with a complex and varied background, removes dithers by smoothing from the color document image, converts the color values of the dither-removed image from the RGB system to the L*u*v* system and prepares a histogram thereof, clusters the color information on a fuzzy basis, prepares color-separated images (binary images) according to the degree of assignment, removes noise from the binary images, provides labels to black pixels and white pixels, then selects a binary image suitable for character extraction, and extracts a line of characters. |