发明名称 METHOD FOR WORD RECOGNITION IN CHARACTER SEQUENCES
摘要 The method according to the invention for word recognition in sequences of N characters, of which one or more characters may be ambiguous, uses a memory (15), a display (13), and a processor device (12). The memory comprises n-grams (character chains with a length n) and frequency values associated with said character chains, with the total number of all n-grams in a language sample used for word recognition being used as the frequency value of an n-gram. The display (12) shows selected n-grams and/or recognized words, wherein the processor device (12) is connected to the memory (15) and the display (13). A list L of all n-grams with N characters that may be formed from the individual characters in the N-character sequence, taking into account the ambiguity of the characters present in said sequence, is prepared from an examined character sequence. All n-gram combinations with a word probability of zero are removed from the list L of possible n-gram combinations, wherein the word probability p = ? pn is determined from the n-grams included in the character sequence with n = 1 to N-1. The words (14) represented by the remaining n-gram combinations from the list L are displayed.
申请公布号 WO2008116843(A3) 申请公布日期 2009.01.29
申请号 WO2008EP53430 申请日期 2008.03.20
申请人 DEINZER, FRANK 发明人 DEINZER, FRANK
分类号 G06F17/27;G06F3/023 主分类号 G06F17/27
代理机构 代理人
主权项
地址