发明名称 Method for identifying and resolving erroneous characters output by an optical character recognition system
摘要 A post-processing method for an optical character recognition (OCR) method for combining different OCR engines to identify and resolve characters and attributes of the characters that are erroneously recognized by multiple optical character recognition engines. The characters can originate from many different types of character environments. OCR engine outputs are synchronized in order to detect matches and mismatches between said OCR engine outputs by using synchronization heuristics. The mismatches are resolved using resolution heuristics and neural networks. The resolution heuristics and neural networks are based on observing many different conventional OCR engines in different character environments to find what specific OCR engine correctly identifies a certain character having particular attributes. The results are encoded into the resolution heuristics and neural networks to create an optimal OCR post-processing solution.
申请公布号 US5418864(A) 申请公布日期 1995.05.23
申请号 US19940272451 申请日期 1994.07.11
申请人 MOTOROLA, INC. 发明人 MURDOCK, MICHAEL C.;NEWMAN, MARC A.
分类号 G06K9/62;G06K9/03;G06K9/68;(IPC1-7):G06K9/03;G06K9/72 主分类号 G06K9/62
代理机构 代理人
主权项
地址