摘要 |
A novel and useful method of using labeled training data and machine learning tools to train a speaker diarization system. Intra-speaker variability profiles are created from training data consisting of an audio stream labeled where speaker changes occur (i.e. which participant is speaking at any given time). These intra-speaker variability profiles are then applied to an unlabeled audio stream to segment the audio stream into speaker homogeneous segments and to cluster segments according to speaker identity.
|