摘要 |
A character region extracting method adapted to character recognition comprises the following three steps. The first step is to segmentate a first region (BL1) from a document image by extracting projection of the document image in a first direction parallel to a character string and detecting a range in which the projection is continuous in a second direction perpendicular to the first direction. The range defines the first region. The second step is to segmentate a second region (sBL1) from the first region by extracting projection of the first region in the second direction and detecting a range in which the projection is continuous in the first direction. The range defines the second region. The third step is to segmentate a third region (BL2) from the second region by extracting projection of the second region in the first direction and detecting a range in which the projection is continuous in the second direction. The range defines the third region. The third region may be joined together so as to form a fourth region containing characters or a character string.
|