发明名称
摘要 An object of the present invention is to enable optimal clustering for many types of noise data and to improve the accuracy of estimation of a speech model sequence of input speech. <??>Noise is added to speech in accordance with noise-to-signal ration conditions to generated noise-added speech (step S1), the mean value of speech cepstral is subtracted from the generated, noise-added speech (step S2), a Gaussian distribution model of each piece of noise-added speech is created (step S3), the likelihoods of the pieces of noise-added speech are calculated to generate a likelihood matrix (step S4) to obtain a clustering result. An optimum model is selected (step S7) and linear transformation is performed to provide a maximized likelihood (step S8). <??>Because noise-added speech is consistently used both in clustering and model learning, clustering for many types of noise data and an accurate estimation of a speech model sequence can be achieved. <IMAGE>
申请公布号 JP4033299(B2) 申请公布日期 2008.01.16
申请号 JP20030066933 申请日期 2003.03.12
申请人 发明人
分类号 G10L15/06;G10L15/02;G10L15/14;G10L15/20 主分类号 G10L15/06
代理机构 代理人
主权项
地址