发明名称 Audio-only backoff in audio-visual speech recognition system
摘要 Techniques for performing audio-visual speech recognition, with improved recognition performance, in a degraded visual environment. For example, in one aspect of the invention, a technique for use in accordance with an audio-visual speech recognition system for improving a recognition performance thereof includes the steps/operations of: (i) selecting between an acoustic-only data model and an acoustic-visual data model based on a condition associated with a visual environment; and (ii) decoding at least a portion of an input spoken utterance using the selected data model. Advantageously, during periods of degraded visual conditions, the audio-visual speech recognition system is able to decode (recognize) input speech data using audio-only data, thus avoiding recognition inaccuracies that may result from performing speech recognition based on acoustic-visual data models and degraded visual data.
申请公布号 US2004260554(A1) 申请公布日期 2004.12.23
申请号 US20030601350 申请日期 2003.06.23
申请人 INTERNATIONAL BUSINESS MACHINES CORPORATION 发明人 CONNELL JONATHAN H.;HAAS NORMAN;MARCHERET ETIENNE;NETI CHALAPATHY VENKATA;POTAMIANOS GERASIMOS
分类号 G10L15/24;(IPC1-7):G10L21/00 主分类号 G10L15/24
代理机构 代理人
主权项
地址