发明名称 |
METHOD FOR DETECTING VOICE SECTION FROM TIME-SPACE BY USING AUDIO AND VIDEO INFORMATION AND APPARATUS THEREOF |
摘要 |
The present invention relates to a method for detecting a voice section in time-space by using audio and video information. According to an embodiment of the present invention, a method for detecting a voice section from time-space by using audio and video information comprises the steps of: detecting a voice section in an audio signal which is inputted into a microphone array; verifying a speaker from the detected voice section; sensing the face of the speaker by using a video signal which is inputted into a camera if the speaker is successfully verified, and then estimating the direction of the face of the speaker; and determining the detected voice section as the voice section of the speaker if the estimated face direction corresponds to a reference direction which is previously stored. |
申请公布号 |
WO2010098546(A2) |
申请公布日期 |
2010.09.02 |
申请号 |
WO2010KR00833 |
申请日期 |
2010.02.10 |
申请人 |
KOREA UNIVERSITY INDUSTRIAL & ACADEMIC COLLABORATION FOUNDATION;YOOK, DONGSUK;LEE, HYUB-WOO |
发明人 |
YOOK, DONGSUK;LEE, HYUB-WOO |
分类号 |
G10L21/00;G10L17/00;G10L21/02;G10L21/0216;G10L25/78 |
主分类号 |
G10L21/00 |
代理机构 |
|
代理人 |
|
主权项 |
|
地址 |
|