发明名称 NOISE REDUCTION AND AUDIO-VISUAL SPEECH ACTIVITY DETECTION
摘要 The present invention generally relates to the field of noise reduction systems which are equipped with an audio-visual user interface, in particular to an audio-visual speech activity recognition system (200b/c) of a video-enabled telecommunication device which runs a real-time lip tracking application that can advantageously be used for a near-speaker detection algorithm in an environment where a speaker's voice is interfered by a statistically distributed background noise (n'(t)) including both environmental noise (n(t)) and surrounding persons' voices ( SIGMA j aj.sj(t-Tj) with j NOTEQUAL i). Said real-time lip tracking application combines a visual feature vector (o nu ,nT) that comprises features extracted from a digital video sequence ( nu (nT)) showing the speaker's face by detecting and analyzing lip movements and facial expressions of said speaker (Si) with an audio feature vector (oa,nT) which comprises features extracted from a recorded analog audio sequence (s(t)) representing the voice of said speaker (Si) interfered by said background noise (n'(t)). <IMAGE> <IMAGE>
申请公布号 IL169550(A) 申请公布日期 2009.09.01
申请号 IL20050169550 申请日期 2005.07.06
申请人 SONY ERICSSON MOBILE COMMUNICATIONS AB 发明人
分类号 G10L11/02;G10L15/20;G10L15/24;G10L21/02 主分类号 G10L11/02
代理机构 代理人
主权项
地址