发明名称 Low resolution OCR for camera acquired documents
摘要 A global optimization framework for optical character recognition (OCR) of low-resolution photographed documents that combines a binarization-type process, segmentation, and recognition into a single process. The framework includes a machine learning approach trained on a large amount of data. A convolutional neural network can be employed to compute a classification function at multiple positions and take grey-level input which eliminates binarization. The framework utilizes preprocessing, layout analysis, character recognition, and word recognition to output high recognition rates. The framework also employs dynamic programming and language models to arrive at the desired output.
申请公布号 US2005259866(A1) 申请公布日期 2005.11.24
申请号 US20040850335 申请日期 2004.05.20
申请人 发明人 JACOBS CHARLES E.;RINKER JAMES R.;SIMARD PATRICE Y.;VIOLA PAUL A.
分类号 G06K9/20;G06K9/34;G06K9/46;G06K9/66;G06K9/72;G06N3/00;(IPC1-7):G06K9/62;G06K9/18 主分类号 G06K9/20
代理机构 代理人
主权项
地址
您可能感兴趣的专利