发明名称 SPEECH PROCESSING SYSTEM
摘要 A speech processing system (10) incorporates an analogue to digital converter (16) to digitize input speech signals for Fourier transformation to produce short-term spectral cross-sections. These cross-sections are compared with one hundred and fifty reference patterns in a store (34), the patterns having respective stored sets of formant frequencies assigned thereto by a human expert. Six stored patterns most closely matching each input cross-section are selected for further processing by dynamic programming, which indicates the pattern which is a best match to the input cross-section by using frequency-scale warping to achieve alignment. The stores formant frequencies of the best matching pattern are modified by the frequency warping, and the results are used as formant frequency estimates for the input cross-section. The frequencies are further refined on the basis of the shape of the input cross-section near to the chosen formants. Formant amplitudes are produced from input cross-section amplitudes at estimated formant frequencies. The formant frequencies and amplitudes are used with a computer (25) to provide speech indications or with a Hidden Markov Model word matcher (24) to provide word recognition.
申请公布号 EP0938727(A1) 申请公布日期 1999.09.01
申请号 EP19970945008 申请日期 1997.10.13
申请人 QINETIQ LIMITED 发明人 HOLMES, JOHN, NICHOLAS
分类号 G10L11/00;G10L15/02;G10L15/10;G10L15/12;G10L15/14;(IPC1-7):G10L9/06 主分类号 G10L11/00
代理机构 代理人
主权项
地址