发明名称 Speech recognition method, storage medium storing speech recognition program, and speech recognition apparatus
摘要 The present invention provides a speech recognition method for achieving a high recognition rate even under an environment where plural types of noise exist. Noise is eliminated by the spectral subtraction noise elimination method from each of speech data on which different types of noise are superposed, and acoustic models corresponding to each of the noise types are created based on the feature vectors obtained by analyzing the features of each of the speech data which have undergone the noise elimination. When a speech recognition is performed, a first speech feature analysis is performed on speech data to be recognized, and it is determined whether the speech data is a noise segment or a speech segment. When a noise segment is detected, the feature data thereof is stored, and when a speech segment is detected, the type of the noise is determined based on the feature data which has been stored, and a corresponding acoustic model is selected based on the result thereof. The noise is eliminated by the spectral subtraction noise elimination method from the speech data to be recognized, and a second feature analysis is performed on the speech data which has undergone the noise elimination to obtain a feature vector to be used in speech recognition.
申请公布号 US2002049587(A1) 申请公布日期 2002.04.25
申请号 US20010981996 申请日期 2001.10.19
申请人 SEIKO EPSON CORPORATION 发明人 MIYAZAWA YASUNAGA
分类号 G10L15/06;G10L15/20;G10L21/02;(IPC1-7):G10L15/20;G10L15/00 主分类号 G10L15/06
代理机构 代理人
主权项
地址