发明名称 |
Methods and apparatus for tracking speakers in an audio stream |
摘要 |
Speakers are automatically identified in an audio (or video) source. The audio information is processed to identify potential segment boundaries. Homogeneous segments are clustered substantially concurrently with the segmentation routine, and a cluster identifier is assigned to each identified segment. A segmentation subroutine identifies potential segment boundaries using the BIC model selection criterion. A clustering subroutine uses a BIC model selection criterion to assign a cluster identifier to each of the identified segments. If the difference of BIC values for each model is positive, the two clusters are merged. |
申请公布号 |
GB2351592(B) |
申请公布日期 |
2003.05.21 |
申请号 |
GB20000015194 |
申请日期 |
2000.06.22 |
申请人 |
* INTERNATIONAL BUSINESS MACHINES CORPORATION |
发明人 |
SCOTT SHAONBING * CHEN;ALAIN CHARLES LOUIS * TRITSCHLER;MAHESH * VISWANATHAN;MAHESH * VISWANATHAN;ALAIN CHARLES LOUIS * TRITSCHLER;SCOTT SHAONBING * CHEN |
分类号 |
G06F3/16;G10L15/04;G10L15/10;G10L17/00;G10L21/02;(IPC1-7):G10L17/00 |
主分类号 |
G06F3/16 |
代理机构 |
|
代理人 |
|
主权项 |
|
地址 |
|