发明名称 OCR-based image compression
摘要 A method for compressing a digitized image of a document using optical character recognition (OCR). The method includes performing optical character recognition (OCR) on the digitized image, identifying, based, at least in part, on a result of the performing step, a plurality of classes of characters comprised in the image, each the class of characters having an associated character value and comprising at least one character, pruning each class of characters, thereby producing information describing the plurality of classes of characters and a residual image, and utilizing the information describing the plurality of classes of characters and the residual image as a compressed digitized image in further processing.Related methods and apparatus are also disclosed.
申请公布号 US6487311(B1) 申请公布日期 2002.11.26
申请号 US19990304861 申请日期 1999.05.04
申请人 INTERNATIONAL BUSINESS MACHINES CORPORATION 发明人 GAL YANIV;SORIN ALEXANDER;HEILPER ANDREI;WALLACH EUGENE
分类号 H04N1/411;(IPC1-7):G06K9/68;G06K9/20;G06K9/46;G06K9/62;H04N1/32 主分类号 H04N1/411
代理机构 代理人
主权项
地址