发明名称 Methods and apparatus for audio-visual speech detection and recognition
摘要 In a first aspect of the invention, methods and apparatus for providing speech recognition comprise the steps of processing a video signal associated with an arbitrary content video source, processing an audio signal associated with the video signal, and decoding the processed audio signal in conjunction with the processed video signal to generate a decoded output signal representative of the audio signal. In a second aspect 6f the invention, methods and apparatus for providing speech detection in accordance with a speech recognition system comprise the steps of processing a video signal associated with a video source to detect whether one or more features associated with the video signal are representative of speech, and processing an audio signal associated with the video signal in accordance with the speech recognition system to generate a decoded output signal representative of the audio signal when the one or more features associated with the video signal are representative of speech. Speech detection may also be performed using information from both the video path and the audio path simultaneously.
申请公布号 US6594629(B1) 申请公布日期 2003.07.15
申请号 US19990369707 申请日期 1999.08.06
申请人 INTERNATIONAL BUSINESS MACHINES CORPORATION 发明人 BASU SANKAR;DE CUETOS PHILIPPE CHRISTIAN;MAES STEPHANE HERMAN;NETI CHALAPATHY VENKATA;SENIOR ANDREW WILLIAM
分类号 G10L11/02;G10L15/24;(IPC1-7):G10L15/00 主分类号 G10L11/02
代理机构 代理人
主权项
地址