发明名称 |
SYSTEM AND METHOD FOR JOINT SPEAKER AND SCENE RECOGNITION IN A VIDEO/AUDIO PROCESSING ENVIRONMENT |
摘要 |
An example method is provided and includes receiving a media file that includes video data and audio data; determining an initial scene sequence in the media file; determining an initial speaker sequence in the media file; and updating a selected one of the initial scene sequence and the initial speaker sequence in order to generate an updated scene sequence and an updated speaker sequence respectively. The initial scene sequence is updated based on the initial speaker sequence, and wherein the initial speaker sequence is updated based on the initial scene sequence.
|
申请公布号 |
US2013300939(A1) |
申请公布日期 |
2013.11.14 |
申请号 |
US201213469886 |
申请日期 |
2012.05.11 |
申请人 |
CHOU JIM CHEN;KAJAREKAR SACHIN;CATCHPOLE JASON J.;SANKAR ANANTH;CISCO TECHNOLOGY, INC. |
发明人 |
CHOU JIM CHEN;KAJAREKAR SACHIN;CATCHPOLE JASON J.;SANKAR ANANTH |
分类号 |
H04N5/14 |
主分类号 |
H04N5/14 |
代理机构 |
|
代理人 |
|
主权项 |
|
地址 |
|