发明名称 RAPIDLY TRAINING A SPEECH RECOGNIZER TO A SUBSEQUENT SPEAKER GIVEN TRAINING DATA OF A REFERENCE SPEAKER
摘要 Apparatus and method for training the statistics of a Markov Model speech recognizer to a subsequent speaker who utters part of a training text after the recognizer has been trained for the statistics of a reference speaker who utters a full training text. Where labels generated by an acoustic processor in response to uttered speech serve as outputs for Markov models, the present apparatus and method determine label output probabilities at transitions in the Markov models corresponding to the subsequent speaker where there is sparse training data. Specifically, label output probabilities for the subsequent speaker are re-parameterized based on confusion matrix entries having values indicative of the similarity between an l th label output of the subsequent speaker and a kth label output for the reference speaker. The label output probabilities based on re-parameterized data are combined with initialized label output probabilities to form "smoothed" label output probabilities which feature smoothed probability distributions. Based on label outputs generated when the subsequent speaker utters the shortened training text, "basic" label output probabilities computed by conventional methodology are linearly averaged against the smoothed label output probabilities to produce improved label output probabilities.
申请公布号 EP0303022(A3) 申请公布日期 1989.08.23
申请号 EP19880109620 申请日期 1988.06.16
申请人 INTERNATIONAL BUSINESS MACHINES CORPORATION 发明人 BAHL, LALIT RAI;MERCER, ROBERT LEROY;NAHAMOO, DAVID
分类号 G10L11/00;G10L15/06;G10L15/14;(IPC1-7):G10L5/06 主分类号 G10L11/00
代理机构 代理人
主权项
地址