发明名称 Method and apparatus for speech analysis and speech recognition.
摘要 <p>A method and apparatus are disclosed for speech analysis and speech recognition. Each speech utterance under examination in accordance with the method of the present invention is digitally sampled and represented as a temporal sequence of data frames. Each data frame is then analyzed by the application of a Fast Fourier Transform (FFT) to obtain an indication of the energy content of each data frame in a plurality of frequency bands or bins. An indication of each of the most significant frequency bands, in terms of energy content, are then plotted by bin number for all data frames and graphically combined to create a power content signature for the speech utterance which is indicative of the movement of audio power through the audio spectrum over time for that utterance. By comparing the power content signature of an unknown speech utterance to a number of previously stored power content signatures, each associated with a known utterance, it is possible to identify an unknown speech utterance with a high degree of accuracy. In one preferred embodiment of the present invention, comparisons of power content signatures from unknown speech utterances are made with stored power content signatures utilizing a least squares fit or other suitable technique.</p>
申请公布号 EP0485315(A2) 申请公布日期 1992.05.13
申请号 EP19910480157 申请日期 1991.10.10
申请人 INTERNATIONAL BUSINESS MACHINES CORPORATION 发明人 JACKSON, JOHN W.
分类号 G10L11/00;G10L15/00;G10L15/02;G10L15/10;G10L21/06 主分类号 G10L11/00
代理机构 代理人
主权项
地址