发明名称 |
Method and system for preprocessing an image for optical character recognition |
摘要 |
<p>A method and system for preprocessing an image, wherein the image includes a plurality of columns, or regions, of text is disclosed. A plurality of components associated with the text is determined. On determining the plurality of components, a line height and a column spacing is determined for the components. The components are then associated with a column based on the line height and the column spacing. A set of characteristic parameters are calculated for each column and the plurality of components of each column are merged based on the characteristic parameters to form sub-words and words. A first plurality of words and/or subwords is merged and processed as a first region and a second plurality of words and/or subwords is merged and processed as a second region wherein at least a portion of the second region vertically overlaps at least a portion of the first region.</p> |
申请公布号 |
EP2662802(A1) |
申请公布日期 |
2013.11.13 |
申请号 |
EP20130162939 |
申请日期 |
2013.04.09 |
申请人 |
KING ABDULAZIZ CITY FOR SCIENCE & TECHNOLOGY (KACST) |
发明人 |
AL-OMARI, HUSSEIN KHALID;KHORSHEED, MOHAMMAD SULAIMAN |
分类号 |
G06K9/00 |
主分类号 |
G06K9/00 |
代理机构 |
|
代理人 |
|
主权项 |
|
地址 |
|