发明名称 Method and apparatus for separating text from images
摘要 The invention described herein provides a method and apparatus for document processing that efficiently separates and interrelates single modalities, such as text, handwriting, and images. In particular, the present invention starts with the recognition of text characters and words for the efficient separation of text paragraphs from images by maintaining their relationships for a possible reconstruction of the original page. The text separation and extraction is based on a hierarchical framing process. The process starts with the framing of a single character, after its recognition, continues with the recognition and framing of a word, and ends with the framing of all text lines. The method and apparatus described herein can process different types of documents, such as typed, handwritten, skewed, mixed, but not half-tone ones.
申请公布号 US7082219(B2) 申请公布日期 2006.07.25
申请号 US20020314483 申请日期 2002.12.05
申请人 发明人
分类号 G06K9/00;G06K9/20;G06K9/34 主分类号 G06K9/00
代理机构 代理人
主权项
地址