发明名称 Method and apparatus for automatic speaker-based speech clustering
摘要 Reliable speaker-based clustering of speech utterances allows improved speaker recognition and speaker-based speech segmentation. According to at least one example embodiment, an iterative bottom-up speaker-based clustering approach employs voiceprints of speech utterances, such as i-vectors. At each iteration, a clustering confidence score in terms of Silhouette Width Criterion (SWC) values is evaluated, and a pair of nearest clusters is merged into a single cluster. The pair of nearest clusters merged is determined based on a similarity score indicative of similarity between voiceprints associated with different clusters. A final clustering pattern is then determined as a set of clusters associated with an iteration corresponding to the highest clustering confidence score evaluated. The SWC used may further be a modified SWC enabling detection of an early stop of the iterative approach.
申请公布号 EP2808866(A1) 申请公布日期 2014.12.03
申请号 EP20140170662 申请日期 2014.05.30
申请人 NUANCE COMMUNICATIONS, INC. 发明人 COLIBRO, DANIELS ERNESTO;VAIR, CLAUDIO;FARRELL, KEVIN R.
分类号 G10L17/04;G06K9/62 主分类号 G10L17/04
代理机构 代理人
主权项
地址