发明名称 METHOD FOR DETECTING VOICE SECTION FROM TIME-SPACE BY USING AUDIO AND VIDEO INFORMATION AND APPARATUS THEREOF
摘要 The present invention relates to a method for detecting a voice section in time-space by using audio and video information. According to an embodiment of the present invention, a method for detecting a voice section from time-space by using audio and video information comprises the steps of: detecting a voice section in an audio signal which is inputted into a microphone array; verifying a speaker from the detected voice section; sensing the face of the speaker by using a video signal which is inputted into a camera if the speaker is successfully verified, and then estimating the direction of the face of the speaker; and determining the detected voice section as the voice section of the speaker if the estimated face direction corresponds to a reference direction which is previously stored.
申请公布号 WO2010098546(A2) 申请公布日期 2010.09.02
申请号 WO2010KR00833 申请日期 2010.02.10
申请人 KOREA UNIVERSITY INDUSTRIAL & ACADEMIC COLLABORATION FOUNDATION;YOOK, DONGSUK;LEE, HYUB-WOO 发明人 YOOK, DONGSUK;LEE, HYUB-WOO
分类号 G10L21/00;G10L17/00;G10L21/02;G10L21/0216;G10L25/78 主分类号 G10L21/00
代理机构 代理人
主权项
地址