发明名称 Noise adaptation system of speech model, noise adaptation method, and noise adaptation program for speech recognition
摘要 An object of the present invention is to enable optimal clustering for many types of noise data and to improve the accuracy of estimation of a speech model sequence of input speech. Noise is added to speech in accordance with noise-to-signal ratio conditions to generate noise-added speech (step S1), the mean value of speech cepstral is subtracted from the generated, noise-added speech (step S2), a Gaussian distribution model of each piece of noise-added speech is created (step S3), the likelihoods of the pieces of noise-added speech are calculated to generate a likelihood matrix (step S4) to obtain a clustering result. An optimum model is selected (step S7) and linear transformation is performed to provide a maximized likelihood (step S8). Because noise-added speech is consistently used both in clustering and model learning, clustering for many types of noise data and an accurate estimation of a speech model sequence can be achieved.
申请公布号 US2004204937(A1) 申请公布日期 2004.10.14
申请号 US20040796283 申请日期 2004.03.10
申请人 NTT DOCOMO, INC. 发明人 ZHANG ZHIPENG;OTSUJI KIYOTAKA;SUGIMURA TOSHIAKI;FURUI SADAOKI
分类号 G10L15/06;G10L15/02;G10L15/14;G10L15/20;(IPC1-7):G10L15/00 主分类号 G10L15/06
代理机构 代理人
主权项
地址