发明名称 WORD SEARCHABLE DATABASE FROM HIGH VOLUME SCANNING OF NEWSPAPER DATA
摘要 A process for digitizing newsprint information from a newspaper includes scanning the information into a digital image format and then processing the image to produce searchable text. The processing includes removing data stamps and other marks that are written over the newsprint, enhancing the image using a library of image processing functions, and performing voting-OCR to select an optimal OCR output. The OCR output yields highly accurate text which can be word searched using adaptive pattern recognition processing, fuzzy logic, morphology, and other techniques to provide a word searchable database of newsprint information from newspapers. The process is software controlled so that the work flow, both electronic and non-electronic, between various processes or stations is tracked and sequenced, and appropriate data is collected and stored.
申请公布号 WO0113279(A2) 申请公布日期 2001.02.22
申请号 WO2000US22492 申请日期 2000.08.17
申请人 PROGRESSIVE TECHNOLOGY FEDERAL SYSTEMS, INC.;YOKLEY, JOHN, R.;NISSEN, DON;SCHWARTZ, ERIK;KORNELE, BRYAN;LEE, ED;KAPEL, KEVIN 发明人 YOKLEY, JOHN, R.;NISSEN, DON;SCHWARTZ, ERIK;KORNELE, BRYAN;LEE, ED;KAPEL, KEVIN
分类号 G06F17/30;G06K9/00;(IPC1-7):G06F17/30 主分类号 G06F17/30
代理机构 代理人
主权项
地址