发明名称 LEARNING IN AUTOMATIC SPEECH RECOGNITION
摘要 Utterance data that includes at least a small amount of manually transcribed data is provided. Automatic speech recognition is performed on ones of the utterance data not having a corresponding manual transcription to produce automatically transcribed utterances. A model is trained using all of the manually transcribed data and the automatically transcribed utterances. A predetermined number of utterances not having a corresponding manual transcription are intelligently selected and manually transcribed. Ones of the automatically transcribed data as well as ones having a corresponding manual transcription are labeled. In another aspect of the invention, audio data is mined from at least one source, and a language model is trained for call classification from the mined audio data to produce a language model.
申请公布号 HK1095013(A1) 申请公布日期 2009.05.29
申请号 HK20070101337 申请日期 2007.02.05
申请人 AT&T CORP. 发明人 HAKKANI-TUR, DILEK Z.;RAHIM, MAZIN G.;TUR, GOKHAN;RICCARDI, GIUSEPPE
分类号 G10L 主分类号 G10L
代理机构 代理人
主权项
地址