发明名称 SPEECH RECOGNITION METHOD OF SELF LEARNING SPEAKER ADAPTATION TYPE
摘要 PURPOSE: To reduce unsupervised segmentation error and to facilitate a succeeding phone model adaptive execution by eliminating an acoustic spectrum fluctuation source casing recognition performance deterioration by decomposing the spectrum fluctuation source. CONSTITUTION: In a training side 10, a spectrum bias (h) is subtracted from a training speech spectrum Xt of the speaker in a logarithmic domain to generate a set of a normalized spectrum, and is made into a model in a process 26 to generate the models M2, M3 of a normalized unspecified speaker. The normalized phone models M2, M3 are supplied to a decoder 30, and are used for decoding the test speech of the speaker (q). Before the speaker (q) recognized a sentence, short generation of a proofreading speech Xc is supplied to an h- estimater 24, and the estimated spectrum bias h<(q)> for speaker is generated, and it is subtracted from the training speech spectrum Xt . A bias parameter generates the normalized spectrum, and the normalized spectrum is supplied to the decoder 30 to constitute a word line.
申请公布号 JPH0863182(A) 申请公布日期 1996.03.08
申请号 JP19950206511 申请日期 1995.07.19
申请人 MATSUSHITA ELECTRIC IND CO LTD 发明人 YANKIN TSUAO
分类号 G10L15/04;G10L15/06;G10L15/10;G10L15/14;G10L15/20;G10L21/02;(IPC1-7):G10L3/00;G10L3/00;G10L3/02 主分类号 G10L15/04
代理机构 代理人
主权项
地址