发明名称 |
Unsupervised speaker clustering for automatic speaker indexing of recorded audio data |
摘要 |
A system and method for unsupervised clustering of audio data segments in an audio data recording containing speech from multiple speakers including the steps of: 1) providing a portion of the audio data containing speech from all of the speakers; 2) forming initial clusters by dividing the portion of the audio data into segments, each of which includes an ordered data set; 3) computing the pairwise distance between each pair of clusters using a likelihood ration independent of the order of data within the segments; and 4) combining into a new cluster the two clusters with a minimum pairwise distance. These steps are repeated until a number of clusters equal to the number of speakers is obtained.
|
申请公布号 |
US5659662(A) |
申请公布日期 |
1997.08.19 |
申请号 |
US19960710013 |
申请日期 |
1996.09.09 |
申请人 |
XEROX CORPORATION |
发明人 |
WILCOX, LYNN D.;KIMBER, DONALD G. |
分类号 |
G10L15/06;G10L15/10;G10L15/14;G10L17/00;H04R3/00;(IPC1-7):G10L9/00 |
主分类号 |
G10L15/06 |
代理机构 |
|
代理人 |
|
主权项 |
|
地址 |
|