发明名称 |
Method and system for preprocessing an image for optical character recognition |
摘要 |
The present invention provides method and system for preprocessing an image including one or more of Arabic text and non-text items for Optical Character Recognition (OCR). The method includes determining a plurality of components associated with one or more of the Arabic text and the non-text items, wherein a component includes a set of connected pixels. A first set of characteristic parameters is then calculated for the plurality of components. The plurality of components are subsequently merged based on the first set of characteristic parameters to form one or more of one or more sub-words and one or more words. |
申请公布号 |
US8194983(B2) |
申请公布日期 |
2012.06.05 |
申请号 |
US20100779152 |
申请日期 |
2010.05.13 |
申请人 |
AL-OMARI HUSSEIN KHALID;KHORSHEED MOHAMMAD SULAIMAN |
发明人 |
AL-OMARI HUSSEIN KHALID;KHORSHEED MOHAMMAD SULAIMAN |
分类号 |
G06K9/48;G06K9/00;G06K9/18;G06K9/32 |
主分类号 |
G06K9/48 |
代理机构 |
|
代理人 |
|
主权项 |
|
地址 |
|