发明名称 METHOD AND APPARATUS FOR DISCOVERING AND LABELING SPEAKERS IN A LARGE AND GROWING COLLECTION OF VIDEOS WITH MINIMAL USER EFFORT
摘要 In one embodiment, an audio stream is partitioned into a plurality of segments such that the plurality of segments are clustered into one or more clusters, each of the one or more clusters identifying a subset of the plurality of segments in the audio stream and corresponding to one of a first set of one or more speaker models, each speaker model in the first set of speaker models representing one of a first set of hypothetical speakers. The speaker models in the first set of speaker models are compared with a second set of one or more speaker models, where each speaker model in the second set of speaker models represents one of a second set of hypothetical speakers. Labels associated with one or more speaker models in the second set of speaker models are propagated to one or more speaker models in the first set of speaker models according to a result of the comparing step.
申请公布号 US2013144414(A1) 申请公布日期 2013.06.06
申请号 US201113312800 申请日期 2011.12.06
申请人 KAJAREKAR SACHIN;SANKAR ANANTH;GANNU SATTISH;KHARE APARNA;CISCO TECHNOLOGY, INC. 发明人 KAJAREKAR SACHIN;SANKAR ANANTH;GANNU SATTISH;KHARE APARNA
分类号 G06F17/00 主分类号 G06F17/00
代理机构 代理人
主权项
地址