发明名称 Handwriting and speech recognizer
摘要 A method and system for recognizing user input information including cursive handwriting and spoken words. A time-delayed neural network having an improved architecture is trained at the word level with an improved method, which, along with preprocessing improvements, results in a recognizer with greater recognition accuracy. Preprocessing is performed on the input data and, for example, may include resampling the data with sample points based on the second derivative to focus the recognizer on areas of the input data where the slope change per time is greatest. The input data is segmented, featurized and fed to the time-delayed neural network which outputs a matrix of character scores per segment. The neural network architecture outputs a separate score for the start and the continuation of a character. A dynamic time warp (DTW) is run against dictionary words to find the most probable path through the output matrix for that word, and each word is assigned a score based on the least costly path that can be traversed through the output matrix. The word (or words) with the overall lowest score (or scores) are returned. A DTW is similarly used in training, whereby the sample ink only need be labeled at the word level.
申请公布号 AU2193100(A) 申请公布日期 2000.07.24
申请号 AU20000021931 申请日期 1999.12.16
申请人 MICROSOFT CORPORATION 发明人 ANGSHUMA GUHA;PATRICK M. HALUPTZOK;JAMES A. PITTMAN
分类号 G06K9/00;G06K9/22;G06K9/72 主分类号 G06K9/00
代理机构 代理人
主权项
地址