发明名称 Method of speaker normalization for speech recognition using frequency conversion and speech recognition apparatus applying the preceding method
摘要 An input speech utterance is segmented into a prefixed time length to make frames, to extract an acoustic feature parameter of each frame. The acoustic feature parameter is frequency-converted by using pluralfrequency conversion coefficients previously defined. By using all combinations of plural post-conversion feature parameters obtained by the frequency conversion and at least one standard phonemic model, to compute plural similarities or distances of between the post-conversion feature parameters of each of the frames and the standard phonemic model. A frequency converting condition for normalizing the input utterance is decided by using the pluralsimilarities or distances. By using the frequency converting condition, the input utterance is normalized. With this method, even in case there is change of the speaker making a speech utterance, the individual difference of input utterance can be corrected thereby improving the performance of speech recognition.
申请公布号 US2004117181(A1) 申请公布日期 2004.06.17
申请号 US20030670636 申请日期 2003.09.24
申请人 MORII KEIKO;NAKATOH YOSHIHISA;KUWANO HIROYASU 发明人 MORII KEIKO;NAKATOH YOSHIHISA;KUWANO HIROYASU
分类号 G10L15/00;G10L17/00;(IPC1-7):G10L15/00 主分类号 G10L15/00
代理机构 代理人
主权项
地址