发明名称 IDENTIFYING KEYWORD OCCURRENCES IN AUDIO DATA
摘要 Occurrences of one or more keywords in audio data are identified using a speech recognizer employing a language model to derive a transcript of the keywords. The transcript is converted into a phoneme sequence. The phonemes of the phoneme sequence are mapped to the audio data to derive a time-aligned phoneme sequence that is searched for occurrences of keyword phoneme sequences corresponding to the phonemes of the keywords. Searching includes computing a confusion matrix. The language model used by the speech recognizer is adapted to keywords by increasing the likelihoods of the keywords in the language model. For each potential occurrences keywords detected, a corresponding subset of the audio data may be played back to an operator to confirm whether the potential occurrences correspond to actual occurrences of the keywords.
申请公布号 CA2690174(C) 申请公布日期 2014.10.14
申请号 CA20102690174 申请日期 2010.01.13
申请人 CRIM (CENTRE DE RECHERCHE INFORMATIQUE DE MONTREAL) 发明人 GUPTA, VISHWA NATH;BOULIANNE, GILLES
分类号 G10L15/22;G10L15/02 主分类号 G10L15/22
代理机构 代理人
主权项
地址