发明名称 SYSTEM AND METHOD FOR JOINT SPEAKER AND SCENE RECOGNITION IN A VIDEO/AUDIO PROCESSING ENVIRONMENT
摘要 An example method is provided and includes receiving a media file that includes video data and audio data; determining an initial scene sequence in the media file; determining an initial speaker sequence in the media file; and updating a selected one of the initial scene sequence and the initial speaker sequence in order to generate an updated scene sequence and an updated speaker sequence respectively. The initial scene sequence is updated based on the initial speaker sequence, and wherein the initial speaker sequence is updated based on the initial scene sequence.
申请公布号 US2013300939(A1) 申请公布日期 2013.11.14
申请号 US201213469886 申请日期 2012.05.11
申请人 CHOU JIM CHEN;KAJAREKAR SACHIN;CATCHPOLE JASON J.;SANKAR ANANTH;CISCO TECHNOLOGY, INC. 发明人 CHOU JIM CHEN;KAJAREKAR SACHIN;CATCHPOLE JASON J.;SANKAR ANANTH
分类号 H04N5/14 主分类号 H04N5/14
代理机构 代理人
主权项
地址
您可能感兴趣的专利