发明名称 System and method for unsupervised and active learning for automatic speech recognition
摘要 A system and method is provided for combining active and unsupervised learning for automatic speech recognition. This process enables a reduction in the amount of human supervision required for training acoustic and language models and an increase in the performance given the transcribed and un-transcribed data.
申请公布号 US8914283(B2) 申请公布日期 2014.12.16
申请号 US201313959351 申请日期 2013.08.05
申请人 AT&T Intellectual Property II, L.P. 发明人 Hakkani-Tur Dilek Zeynep;Riccardi Giuseppe
分类号 G10L15/26 主分类号 G10L15/26
代理机构 代理人
主权项 1. A method comprising: identifying, in a database of utterances, transcribed utterances and un-transcribed utterances; ordering transcription candidate utterances from the un-transcribed utterances based on confidence scores of the transcription candidate utterances, to yield a selectively sampled order; transcribing, via a processor, a top n utterances from the selectively sampled order, to yield additional transcribed utterances and remainder un-transcribed utterances, wherein the remainder un-transcribed utterances are the un-transcribed utterances without the additional transcribed utterances; receiving human-transcribed utterances, wherein the human-transcribed utterances are selected from the remainder un-transcribed utterances for human transcription based on the confidence scores; and adding the additional transcribed utterances and the human-transcribed utterances to the database of utterances.
地址 Atlanta GA US