摘要 |
<p>METHOD AND APPARATUS FOR CONTINUOUS WORD STRING-RECOGNITION A speech recognition method and apparatus for recognizing word strings in a continuous audio signal are disclosed. The word strings are made up of a plurality of elements, and each element, generally a word, is represented by an element template defined by a plurality of target patterns. Each target pattern is represented by a plurality of statistics describing the expected behavior of a group of spectra selected from plural short-term spectra generated by processing of the incoming audio. The incoming audio spectra are processed to enhance the separation between the spectral pattern classes during later analysis. The processed audio spectra are grouped into multi-frame spectral patterns and are compared, using likelihood statistics, with the target patterns of the element templates. Each multi-frame pattern is forced to contribute to each of a pluarality of pattern scores as represented by the element templates. A concatenation technique is employed, using dynamic programming techniques, to determine the correct identity of the word string.</p> |