发明名称 Precise identification of text pixels from scanned document images
摘要 A system or method for identifying text in a document. A group of connected components is created. A plurality of characteristics of different types is calculated for each connected component. Statistics are computed which describe the group of characteristics. Outlier components are identified as connected components whose computed characteristics are outside a statistical range. The outlier components are removed from the group of connected components. Text pixels are identified by segmenting pixels in the group of connected components into a group of text pixels and a group of background pixels.
申请公布号 US7873215(B2) 申请公布日期 2011.01.18
申请号 US20070769467 申请日期 2007.06.27
申请人 SEIKO EPSON CORPORATION 发明人 XIAO JING;BHATTACHARJYA ANOOP K.
分类号 G06K9/34 主分类号 G06K9/34
代理机构 代理人
主权项
地址