发明名称 |
Noise adaptation system of speech model, noise adaptation method, and noise adaptation program for speech recognition |
摘要 |
An object of the present invention is to enable optimal clustering for many types of noise data and to improve the accuracy of estimation of a speech model sequence of input speech. Noise is added to speech in accordance with noise-to-signal ratio conditions to generate noise-added speech (step S1), the mean value of speech cepstral is subtracted from the generated, noise-added speech (step 2), a Gaussian distribution model of each piece of noise-added speech is created (step S3), the likelihoods of the pieces of noise-added speech are calculated to generate a likelihood matrix (step S4) to obtain a clustering result. An optimum model is selected (step S7) and linear transformation is performed to provide a maximized likelihood (step S8). Because noise-added speech is consistently used both in clustering and model learning, clustering for many types of noise data and an accurate estimation of a speech model sequence can be achieved.
|
申请公布号 |
US7552049(B2) |
申请公布日期 |
2009.06.23 |
申请号 |
US20040796283 |
申请日期 |
2004.03.10 |
申请人 |
NTT DOCOMO, INC.;SADAOKI FURUI |
发明人 |
ZHANG ZHIPENG;OTSUJI KIYOTAKA;SUGIMURA TOSHIAKI;FURUI SADAOKI |
分类号 |
G10L15/00;G10L15/06;G10L15/02;G10L15/14;G10L15/20 |
主分类号 |
G10L15/00 |
代理机构 |
|
代理人 |
|
主权项 |
|
地址 |
|