发明名称 Methods and apparatus for tracking speakers in an audio stream
摘要 Speakers are automatically identified in an audio (or video) source. The audio information is processed to identify potential segment boundaries. Homogeneous segments are clustered substantially concurrently with the segmentation routine, and a cluster identifier is assigned to each identified segment. A segmentation subroutine identifies potential segment boundaries using the BIC model selection criterion. A clustering subroutine uses a BIC model selection criterion to assign a cluster identifier to each of the identified segments. If the difference of BIC values for each model is positive, the two clusters are merged.
申请公布号 GB2351592(B) 申请公布日期 2003.05.21
申请号 GB20000015194 申请日期 2000.06.22
申请人 * INTERNATIONAL BUSINESS MACHINES CORPORATION 发明人 SCOTT SHAONBING * CHEN;ALAIN CHARLES LOUIS * TRITSCHLER;MAHESH * VISWANATHAN;MAHESH * VISWANATHAN;ALAIN CHARLES LOUIS * TRITSCHLER;SCOTT SHAONBING * CHEN
分类号 G06F3/16;G10L15/04;G10L15/10;G10L17/00;G10L21/02;(IPC1-7):G10L17/00 主分类号 G06F3/16
代理机构 代理人
主权项
地址