发明名称 Document image decoding using an integrated stochastic language model
摘要 A text recognition system represents the decoded message of a document image as a path through an image network. A method for integrating a language model into the network selectively expands the network to accommodate the language model only for certain ones of the paths in the network, effectively managing the memory storage requirements and computational complexities of integrating the language model efficiently into the network. The language model generates probability distributions indicating the probability of a certain character occurring in a string, given one or more previous characters in the string. Selectively expanding the image network is achieved by initially using upper bounds on the language model probabilities on the branches of an unexpanded image network. A best path search operation is then performed to determine an estimated best path through the image network using these upper bound scores. After decoding, only the nodes on the estimated best path are expanded with new nodes and with branches incoming to the new nodes that accommodate new language model scores reflecting actual character histories in place of the upper bound scores. Decoding and selectively expanding the image network are repeated until the final output transcription of the text image has been produced.
申请公布号 US6678415(B1) 申请公布日期 2004.01.13
申请号 US20000570730 申请日期 2000.05.12
申请人 XEROX CORPORATION 发明人 POPAT ASHOK C.;BLOOMBERG DAN S.;GREENE DANIEL H.
分类号 G06F17/10;G06K9/62;G06K9/70;G06K9/72;G06T7/00;H03M7/30;(IPC1-7):G06K9/62 主分类号 G06F17/10
代理机构 代理人
主权项
地址