发明名称 |
System and method for unsupervised and active learning for automatic speech recognition |
摘要 |
A system and method is provided for combining active and unsupervised learning for automatic speech recognition. This process enables a reduction in the amount of human supervision required for training acoustic and language models and an increase in the performance given the transcribed and un-transcribed data. |
申请公布号 |
US8914283(B2) |
申请公布日期 |
2014.12.16 |
申请号 |
US201313959351 |
申请日期 |
2013.08.05 |
申请人 |
AT&T Intellectual Property II, L.P. |
发明人 |
Hakkani-Tur Dilek Zeynep;Riccardi Giuseppe |
分类号 |
G10L15/26 |
主分类号 |
G10L15/26 |
代理机构 |
|
代理人 |
|
主权项 |
1. A method comprising:
identifying, in a database of utterances, transcribed utterances and un-transcribed utterances; ordering transcription candidate utterances from the un-transcribed utterances based on confidence scores of the transcription candidate utterances, to yield a selectively sampled order; transcribing, via a processor, a top n utterances from the selectively sampled order, to yield additional transcribed utterances and remainder un-transcribed utterances, wherein the remainder un-transcribed utterances are the un-transcribed utterances without the additional transcribed utterances; receiving human-transcribed utterances, wherein the human-transcribed utterances are selected from the remainder un-transcribed utterances for human transcription based on the confidence scores; and adding the additional transcribed utterances and the human-transcribed utterances to the database of utterances. |
地址 |
Atlanta GA US |