发明名称 SYSTEMS AND METHODS FOR HANDLING AND DISTINGUISHING BINARIZED, BACKGROUND ARTIFACTS IN THE VICINITY OF DOCUMENT TEXT AND IMAGE FEATURES INDICATIVE OF A DOCUMENT CATEGORY
摘要 A method of enhancing electronic documents received from a plurality of users by a document analysis system for improving automatic recognition and classification of the received electronic documents, is provided. For each page of a received electronic document, the method filters the page to infer binarized-background artifacts resulting from the binarization of the original grayscale or color image source document and which reside in the vicinity of binarized text and binarized image features in the page, so that the binarized text and binarized images may be distinguished from the binarized-background artifacts and extracted from the document. The method then uses the extracted features from the filtered document to automatically recognized and classify a document into a document category.
申请公布号 US2009119296(A1) 申请公布日期 2009.05.07
申请号 US20080266465 申请日期 2008.11.06
申请人 COPANION, INC. 发明人 NEOGI DEPANKAR;LADD STEVEN K.;AHMED DILNAWAJ;SHESHARAM LOHITH C.
分类号 G06F17/30 主分类号 G06F17/30
代理机构 代理人
主权项
地址