发明名称 System and method for increasing recognition rates of in-vocabulary words by improving pronunciation modeling
摘要 The present disclosure relates to systems, methods, and computer-readable media for generating a lexicon for use with speech recognition. The method includes receiving symbolic input as labeled speech data, overgenerating potential pronunciations based on the symbolic input, identifying potential pronunciations in a speech recognition context, and storing the identified potential pronunciations in a lexicon. Overgenerating potential pronunciations can include establishing a set of conversion rules for short sequences of letters, converting portions of the symbolic input into a number of possible lexical pronunciation variants based on the set of conversion rules, modeling the possible lexical pronunciation variants in one of a weighted network and a list of phoneme lists, and iteratively retraining the set of conversion rules based on improved pronunciations. Symbolic input can include multiple examples of a same spoken word. Speech data can be labeled explicitly or implicitly and can include words as text and recorded audio.
申请公布号 US8095365(B2) 申请公布日期 2012.01.10
申请号 US20080328436 申请日期 2008.12.04
申请人 CONKIE ALISTAIR D.;GILBERT MAZIN;LJOLJE ANDREJ;AT&T INTELLECTUAL PROPERTY I, L.P. 发明人 CONKIE ALISTAIR D.;GILBERT MAZIN;LJOLJE ANDREJ
分类号 G10L13/08 主分类号 G10L13/08
代理机构 代理人
主权项
地址