发明名称 Method of active learning for automatic speech recognition
摘要 State-of-the-art speech recognition systems are trained using transcribed utterances, preparation of which is labor-intensive and time-consuming. The present invention is an iterative method for reducing the transcription effort for training in automatic speech recognition (ASR). Active learning aims at reducing the number of training examples to be labeled by automatically processing the unlabeled examples and then selecting the most informative ones with respect to a given cost function for a human to label. The method comprises automatically estimating a confidence score for each word of the utterance and exploiting the lattice output of a speech recognizer, which was trained on a small set of transcribed data. An utterance confidence score is computed based on these word confidence scores; then the utterances are selectively sampled to be transcribed using the utterance confidence scores.
申请公布号 US7149687(B1) 申请公布日期 2006.12.12
申请号 US20020329139 申请日期 2002.12.24
申请人 AT&T CORP. 发明人 GORIN ALLEN LOUIS;HAKKANI-TUR DILEK Z.;RICCARDI GIUSEPPE
分类号 G10L15/06 主分类号 G10L15/06
代理机构 代理人
主权项
地址