发明名称 CONTINUOUS SPEECH RECOGNITION
摘要 <p>IMPROVEMENTS IN CONTINUOUS SPEECH RECOGNITION An improved speech recognition method and apparatus for recognizing keywords in a continuous audio signal are disclosed. The keywords, generally either a word or a string of words, are each represented by an element template defined by a plurality of target patterns. Each target pattern is represented by a plurality of statistics describing the expected behavior of a group of spectra selected from plural short-term spectra generated by processing of the incoming audio. The incoming audio spectra are processed to enhance the separation between the spectral pattern classes during later analysis. The processed audio spectra are grouped into multi-frame spectral patterns and are compared, using likelihood statistics, with the target patterns of the element templates. Each multi-frame pattern is forced to contribute to each of a plurality of pattern scores as represented by the element templates. The method and apparatus use speaker independent word models during the training stage to generate, automatically, improved target patterns. The apparatus and method further employ grammatical syntax during the training stage for identifying the boundaries of unknown keywords. During the recognition process, improved performance is achieved by use of alternate spellings for "silence" and memory requirements and the computational load is reduced using an augmented grammatical syntax. A concatenation technique is employed, using dynamic programming techniques, to determine the correct identity of the word string.</p>
申请公布号 CA1182223(A) 申请公布日期 1985.02.05
申请号 CA19820412844 申请日期 1982.10.05
申请人 EXXON CORPORATION 发明人 BAHLER, LAWRENCE G.
分类号 G10L11/00;G10L15/00;G10L15/04;G10L15/06;G10L15/10;G10L15/12;G10L15/18;G10L15/28;(IPC1-7):G10L1/04 主分类号 G10L11/00
代理机构 代理人
主权项
地址