发明名称 |
Method and apparatus for automatic speaker-based speech clustering |
摘要 |
Reliable speaker-based clustering of speech utterances allows improved speaker recognition and speaker-based speech segmentation. According to at least one example embodiment, an iterative bottom-up speaker-based clustering approach employs voiceprints of speech utterances, such as i-vectors. At each iteration, a clustering confidence score in terms of Silhouette Width Criterion (SWC) values is evaluated, and a pair of nearest clusters is merged into a single cluster. The pair of nearest clusters merged is determined based on a similarity score indicative of similarity between voiceprints associated with different clusters. A final clustering pattern is then determined as a set of clusters associated with an iteration corresponding to the highest clustering confidence score evaluated. The SWC used may further be a modified SWC enabling detection of an early stop of the iterative approach. |
申请公布号 |
EP2808866(A1) |
申请公布日期 |
2014.12.03 |
申请号 |
EP20140170662 |
申请日期 |
2014.05.30 |
申请人 |
NUANCE COMMUNICATIONS, INC. |
发明人 |
COLIBRO, DANIELS ERNESTO;VAIR, CLAUDIO;FARRELL, KEVIN R. |
分类号 |
G10L17/04;G06K9/62 |
主分类号 |
G10L17/04 |
代理机构 |
|
代理人 |
|
主权项 |
|
地址 |
|