发明名称 METHOD FOR SPECTRAL ESTIMATION TO IMPROVE NOISE ROBUSTNESS FOR SPEECH RECOGNITION
摘要 A method is disclosed for use in preprocessing noisy speech to minimize likelihood of error in estimation for use in a recognizer. The computationally-feasible technique, herein called Minimum-Mean-Log-Spectral-Distance (MMLSD) estimation using mixture models and Markov models, comprises the steps of calculating for each vector of speech in the presence of noise corresponding to a single time frame, an estimate of clean speech, where the basic assumptions of the method of the estimator are that the probability distribution of clean speech can be modeled by a mixture of components each representing a different speech class assuming different frequency channels are uncorrelated within each class and that noise at different frequency channels is uncorrelated. In a further embodiment of the invention, the method comprises the steps of calculating for each sequence of vectors of speech in the presence of noise corresponding to a sequence of time frames, an estimate of clean speech, where the basic assumptions of the method of the estimator are that the probability distribution of clean speech can be modeled by a Markov process assuming different frequency channels are uncorrelated within each state of the Markov process and that noise at different frequency channels is uncorrelated.
申请公布号 WO9113430(A1) 申请公布日期 1991.09.05
申请号 WO1991US01333 申请日期 1991.02.25
申请人 SRI INTERNATIONAL 发明人 ERELL, ADORAM;WEINTRAUB, MITCHEL
分类号 G10L15/02;G10L15/20;G10L21/02;H04B15/00 主分类号 G10L15/02
代理机构 代理人
主权项
地址