发明名称 Post-processing system and method for correcting machine recognized text
摘要 A method of post-processing character data from an optical character recognition (OCR) engine and apparatus to perform the method. This exemplary method includes segmenting the character data into a set of initial words. The set of initial words is word level processed to determine at least one candidate word corresponding to each initial word. The set of initial words is segmented into a set of sentences. Each sentence in the set of sentences includes a plurality of initial words and candidate words corresponding to the initial words. A sentence is selected from the set of sentences. The selected sentence is word disambiguity processed to determine a plurality of final words. A final word is selected from the at least one candidate word corresponding to a matching initial word. The plurality of final words is then assembled as post-processed OCR data.
申请公布号 US7092567(B2) 申请公布日期 2006.08.15
申请号 US20020288645 申请日期 2002.11.04
申请人 MATSUSHITA ELECTRIC INDUSTRIAL CO., LTD. 发明人 MA YUE;GUO JINHONG KATHERINE;LI MU;TONG YU-KUN;YAO TIAN-SHUN;ZHU JING-BO
分类号 G06K9/34;G06F17/27;G06K9/03;G06K9/72 主分类号 G06K9/34
代理机构 代理人
主权项
地址
您可能感兴趣的专利