发明名称 METHOD AND SYSTEM FOR PREPROCESSING AN IMAGE FOR OPTICAL CHARACTER RECOGNITION
摘要 A method and system for preprocessing an image for Optical Character Recognition (OCR), wherein the image includes a plurality of columns is disclosed. Each column includes one or more of Arabic text and non-text items. The method includes determining a plurality of components associated with one or more of the Arabic text and the non-text items, wherein a component includes a set of connected pixels. On determining the plurality of components, a line height and a column spacing is determined for the plurality of components. The plurality of components are then associated with a column of the plurality of columns based on the line height and the column spacing. Subsequently, a set of characteristic parameters are calculated for each column and the plurality of components of each column are merged based on the set of characteristic parameters to form sub-words and words.
申请公布号 US2011305387(A1) 申请公布日期 2011.12.15
申请号 US20100814448 申请日期 2010.06.12
申请人 AL-OMARI HUSSEIN KHALID;KHORSHEED MOHAMMAD SULAIMAN;KING ABDUL AZIZ CITY FOR SCIENCE AND TECHNOLOGY 发明人 AL-OMARI HUSSEIN KHALID;KHORSHEED MOHAMMAD SULAIMAN
分类号 G06K9/00;G06K9/18 主分类号 G06K9/00
代理机构 代理人
主权项
地址