发明名称 SEGMENTATION OF A WORD BITMAP INTO INDIVIDUAL CHARACTERS OR GLYPHS DURING AN OCR PROCESS
摘要 An image processing apparatus is provided that includes a character chopper component that segments words into individual characters in a bitmap of a textual image undergoing an OCR process. The Character chopper component is configured to produce a set of (possibly curved) chop-lines which divide a bitmap of any given word into its individual character or glyph candidates. Cases where an input bitmap contains two separate words are handled by marking a place where those words should be split. The character segmentation algorithm computes the set of vertically oriented, curved chop-lines by considering glyph and background colors in a given word bitmap. The set is filtered afterwards using various heuristics, in order to preserve those lines that indeed do separate a word's glyphs and minimize the number of those that do not.
申请公布号 US2011274354(A1) 申请公布日期 2011.11.10
申请号 US20100776576 申请日期 2010.05.10
申请人 MICROSOFT CORPORATION 发明人 NIJEMCEVIC DJORDJE
分类号 G06K9/34 主分类号 G06K9/34
代理机构 代理人
主权项
地址