发明名称 METHODS AND SYSTEMS THAT USE A HIERARCHICALLY ORGANIZED DATA STRUCTURE CONTAINING STANDARD FEATURE SYMBOLS IN ORDER TO CONVERT DOCUMENT IMAGES TO ELECTRONIC DOCUMENTS
摘要 The current application is directed to methods and systems that convert document images, which contain Arabic text and text in other languages in which symbols are joined together to produce continuous words and portions of words, into corresponding electronic documents. In one implementation, a document-image-processing method and system to which the current application is directed employs numerous techniques and features that render efficiently computable an otherwise intractable or impractical document-image-to-electronic-document conversion. These techniques and features include transformation of text-image morphemes and words into feature symbols with associated parameters, efficiently identifying similar morphemes and words in an electronic store of standard-feature-symbol-encoded morphemes and words, and identifying candidate inter-character division points and corresponding traversal paths using the similar morphemes and words identified in the word store.
申请公布号 WO2014204338(A1) 申请公布日期 2014.12.24
申请号 WO2013RU00515 申请日期 2013.06.18
申请人 ABBYY DEVELOPMENT LLC 发明人 CHULININ, YURY GEORGIEVICH
分类号 G06K9/00;G06K9/68;G06K9/72 主分类号 G06K9/00
代理机构 代理人
主权项
地址