发明名称 Neural network based character position detector for use in optical character recognition
摘要 Apparatus, and an accompanying method, for use in an optical character recognition (OCR) system (5) for locating, e.g., center positions ("hearts") of all desired characters within a field (310; 510) of characters such that the desired characters can be subsequently recognized using an appropriate classification process. Specifically, a window (520) is slid in a step-wise convolutional-like fashion (5201, 5202, 5203) across a field of preprocessed, specifically uniformly scaled, characters. Each pixel in the window is applied as an input to a positioning neural network (152) that has been trained to produce an output activation whenever a character "heart" is spatially coincident with a pixel position within an array (430) centrally located within the window. As the window is successively moved across the field, in a stepped fashion, the activation outputs of the neural network are averaged, on a weighted basis, for each different window position and separately for each horizontal pixel position in the field. The resulting averaged activation output values, typically in the form of a Gaussian distribution for each character, are then filtered, thresholded and then used, via a weighted average calculation with horizontal pixel positions being used as the weights, to determine the character "heart" position as being the center pixel position in the distribution.
申请公布号 US5542006(A) 申请公布日期 1996.07.30
申请号 US19940263304 申请日期 1994.06.21
申请人 发明人
分类号 G06K9/34;G06K9/66;(IPC1-7):G06K9/62 主分类号 G06K9/34
代理机构 代理人
主权项
地址