发明名称 Data processing.
摘要 An existing character (42), in a text defined in image form by data such as a two-dimensional array, is copied to add a new character to the text. The existing character is found by performing character recognition on a two-dimensional data array defining an image that includes part of the text, such as a page. The array can be obtained from a scanner. A word (30) that is recognized as including characters of the type needed is tested to determine whether it can be divided into the correct number of characters. The word is divided by finding connected components in the part of the array in which the word was found during recognition. The connected components are grouped into sets, each set being likely to be a character. If the word can be correctly divided, character-size arrays for its characters are obtained and saved. One of the arrays for the character type of the new character is selected and used to produce an array for the word in which it is included. The new word's array is then used to produce an array for a line in which the new word replaces an old word. The characters of the new word are spaced according to the spacing of the characters of the old word. The new character is positioned transverse to the line based on the transverse positioning of the existing character. The interword spaces of the line are adjusted. The line's array is then used to produce data defining a modified version of the text in image form. The old word can be an incorrectly spelled word, detected by spell-checking the character recognition results. Alternative spelling corrections can be presented to the user for selection of a correction to be used.
申请公布号 EP0439951(A2) 申请公布日期 1991.08.07
申请号 EP19900314107 申请日期 1990.12.21
申请人 XEROX CORPORATION 发明人 BAGLEY, STEVEN C.;KAPLAN RONALD M.;HICKS, WAYLAND R.;DAVIES, DANIEL
分类号 G06K9/03;G06F17/21;G06K9/00;G06K9/72;G06T11/60 主分类号 G06K9/03
代理机构 代理人
主权项
地址