发明名称 |
System and method for increasing recognition rates of in-vocabulary words by improving pronunciation modeling |
摘要 |
The present disclosure relates to systems, methods, and computer-readable media for generating a lexicon for use with speech recognition. The method includes overgenerating potential pronunciations based on symbolic input, identifying potential pronunciations in a speech recognition context, and storing the identified potential pronunciations in a lexicon. Overgenerating potential pronunciations can include establishing a set of conversion rules for short sequences of letters, converting portions of the symbolic input into a number of possible lexical pronunciation variants based on the set of conversion rules, modeling the possible lexical pronunciation variants in one of a weighted network and a list of phoneme lists, and iteratively retraining the set of conversion rules based on improved pronunciations. Symbolic input can include multiple examples of a same spoken word. Speech data can be labeled explicitly or implicitly and can include words as text and recorded audio. |
申请公布号 |
US8892441(B2) |
申请公布日期 |
2014.11.18 |
申请号 |
US201113311512 |
申请日期 |
2011.12.05 |
申请人 |
AT&T Intellectual Property I, L.P. |
发明人 |
Conkie Alistair D.;Gilbert Mazin;Ljolje Andrej |
分类号 |
G10L15/187;G10L15/06 |
主分类号 |
G10L15/187 |
代理机构 |
|
代理人 |
|
主权项 |
1. A method comprising:
overgenerating potential pronunciations by converting portions of symbolic input into a number of possible lexical pronunciation variants based on an established set of conversion rules, wherein the symbolic input comprises labeled speech data; identifying, via a processor of a computing device, potential pronunciations in a speech recognition context to yield identified potential pronunciations; and storing the identified potential pronunciations in a lexicon. |
地址 |
Atlanta GA US |