发明名称 Low complexity, high accuracy clustering method for speech recognizer
摘要 The clustering technique produces a low complexity and yet high accuracy speech representation for use with speech recognizers. The task database comprising the test speech to be modeled is segmented into subword units such as phonemes and labeled to indicate each phoneme in its left and right context (triphones). Hidden Markov Models are constructed for each context-independent phoneme and trained. Then the center states are tied for all phonemes of the same class. Triphones are trained and all poorly-trained models are eliminated by merging their training data with the nearest well-trained model using a weighted divergence computation to ascertain distance. Before merging, the threshold for each class is adjusted until the number of good models for each phoneme class is within predetermined upper and lower limits. Finally, if desired, the number of mixture components used to represent each model may be increased and the models retrained. This latter step increases the accuracy.
申请公布号 US5806030(A) 申请公布日期 1998.09.08
申请号 US19960642767 申请日期 1996.05.06
申请人 JUNQUA, JEAN-CLAUDE 发明人 JUNQUA, JEAN-CLAUDE
分类号 G10L15/02;G10L15/04;G10L15/06;G10L15/14;(IPC1-7):G10L5/00 主分类号 G10L15/02
代理机构 代理人
主权项
地址