发明名称 Method for identifying word bounding boxes in text
摘要 A method for determining the boundaries of text or character strings represented in an array of image data by shape, without a requirement for individually detecting and/or identifying the character or characters making up the strings. The method relies upon the detection of connected components within words to first determine text line boundaries and to isolate the connected components into text rows. Subsequently, the structural relationships between the components within and defining rows (i.e. overlap, inter-character spacing, and inter-word spacing), are used to further combine adjacent sets of connected components into words or similar units of semantic understanding within text rows.
申请公布号 US5410611(A) 申请公布日期 1995.04.25
申请号 US19930169949 申请日期 1993.12.17
申请人 XEROX CORPORATION 发明人 HUTTENLOCHER, DANIEL P.;JAQUITH, ERIC W.
分类号 G06K9/20;G06K9/32;G06K9/40;(IPC1-7):G06K9/34 主分类号 G06K9/20
代理机构 代理人
主权项
地址