发明名称 SPEAKER NORMALIZING PROCESSOR AND VOICE RECOGNITION DEVICE
摘要 PROBLEM TO BE SOLVED: To generate an acoustic model and to recognize a voice by highly accurately estimating a frequency warping function, executing speaker normalization and learning an initial HMM. SOLUTION: A speaker normalization processing part 12 estimates the featured value of a vocal path shape to be the anatomical shape of each learning speaker's vocal path based on the vocal waveform data of each learning speaker while referring to correspondence relation between a vocal path shape parameter determined based on the vocal path model of a reference speaker and formant frequency, and also estimates the formant frequency based on the estimated result and generates a frequency warping function by straight interpolating the estimated formant frequency and the formant frequency corresponding to the reference speaker in relation to the frequency warping function. After applying speaker normalization to the voice waveform data of each learning speaker by using the frequency warping function, the processing part 12 extracts an acoustic feature parameter and learns an initial HMM based on text data corresponding to the acoustic feature parameter to generate a normalized HMM. Thus voice recognition is executed by using the HMM.
申请公布号 JPH11327592(A) 申请公布日期 1999.11.26
申请号 JP19990011720 申请日期 1999.01.20
申请人 ATR ONSEI HONYAKU TSUSHIN KENKYUSHO:KK 发明人 NAITO MASAKI;RI DEN;KOSAKA YOSHINORI
分类号 G10L15/02;G10L15/06;G10L15/10;G10L15/14;G10L15/20;G10L21/02;(IPC1-7):G10L3/02;G10L3/00 主分类号 G10L15/02
代理机构 代理人
主权项
地址