发明名称 Automated method for extracting highlighted regions in scanned source
摘要 An automated method for extracting highlighted regions in a scanned text documents includes color masking of highlight regions, extracting text from highlighted regions, recognizing the characters in extracted text optically and inserting the recognized characters to new document in order to easily identify highlighted text in scanned images. Using a two-layer multi-mask compression technology configured in a scanned export image path, edges and text regions can be extracted and together with the use of mask coordinates and associated mask colors, all highlighted texts can be easily identified and extracted. Optical Character Recognition (OCR) can then be utilized to appropriate summarization of different extracted highlighted texts.
申请公布号 US2007253620(A1) 申请公布日期 2007.11.01
申请号 US20060414053 申请日期 2006.04.27
申请人 XEROX CORPORATION 发明人 NAGARAJAN RAMESH;CAMPANELLI MICHAEL R.;SIMMONS ISAIAH
分类号 G06K9/34 主分类号 G06K9/34
代理机构 代理人
主权项
地址