发明名称 Use of instantaneous and transitional spectral information in speech recognizers.
摘要 <p>Any of several prior art word recognizers are improved by changing the speech analysis portion thereof to include a more comprehensive spectral representation which can be characterized as two-dimensional. The first dimension is the "freeze-frame" or instantaneous sample of the spectral characteristics as derived from a typical 45 millisecond sample of the speech to be recognized. The second dimension, in a sense orthogonal to the first, spans several such time frames or samples and yields what amounts to a time derivative of the spectral properties obtained in the current time frame. In the testing or processing of the input speech in comparison to the reference patterns, essentially equal weight or effectiveness is attributed to both parts of the two dimensional reference spectral pattern and to both parts of the input speech spectral pattern. That is, equal importance is attributed to the essentially instantaneous information and the time-derivative information spanning a plurality of neighboring time frames including the frame of interest. Also disclosed are the use of cepstral information and time-derivative cepstral information derived from linear prediction coefficient methods and the use specifically in connected-word recognizers using level-building concepts, or in those using two-stage processing.</p>
申请公布号 EP0316112(A2) 申请公布日期 1989.05.17
申请号 EP19880310337 申请日期 1988.11.03
申请人 AMERICAN TELEPHONE AND TELEGRAPH COMPANY 发明人 RABINER, LAWRENCE RICHARD;SOONG, FRANK KAO-PING;WILPON, JAY GORDON
分类号 G10L15/10;G10L15/00 主分类号 G10L15/10
代理机构 代理人
主权项
地址