发明名称 Unsupervised speaker clustering for automatic speaker indexing of recorded audio data
摘要 A system and method for unsupervised clustering of audio data segments in an audio data recording containing speech from multiple speakers including the steps of: 1) providing a portion of the audio data containing speech from all of the speakers; 2) forming initial clusters by dividing the portion of the audio data into segments, each of which includes an ordered data set; 3) computing the pairwise distance between each pair of clusters using a likelihood ration independent of the order of data within the segments; and 4) combining into a new cluster the two clusters with a minimum pairwise distance. These steps are repeated until a number of clusters equal to the number of speakers is obtained.
申请公布号 US5659662(A) 申请公布日期 1997.08.19
申请号 US19960710013 申请日期 1996.09.09
申请人 XEROX CORPORATION 发明人 WILCOX, LYNN D.;KIMBER, DONALD G.
分类号 G10L15/06;G10L15/10;G10L15/14;G10L17/00;H04R3/00;(IPC1-7):G10L9/00 主分类号 G10L15/06
代理机构 代理人
主权项
地址