发明名称 |
Blind Diarization of Recorded Calls with Arbitrary Number of Speakers |
摘要 |
In a method of diarization of audio data, audio data is segmented into a plurality of utterances. Each utterance is represented as an utterance model representative of a plurality of feature vectors. The utterance models are clustered. A plurality of speaker models are constructed from the clustered utterance models. A hidden Markov model is constructed of the plurality of speaker models. A sequence of identified speaker models is decoded. |
申请公布号 |
US2015025887(A1) |
申请公布日期 |
2015.01.22 |
申请号 |
US201414319860 |
申请日期 |
2014.06.30 |
申请人 |
VERINT SYSTEMS LTD. |
发明人 |
Sidi Oana;Wein Ron |
分类号 |
G10L15/06 |
主分类号 |
G10L15/06 |
代理机构 |
|
代理人 |
|
主权项 |
1. A method of diarization of audio data, the method comprising:
segmenting audio data into a plurality of utterances; representing each utterance as an utterance model representative of a plurality of feature vectors of each utterance; clustering the utterance models; constructing a plurality of speaker models from the clustered utterance models; constructing a hidden Markov model of the plurality of speaker models; and decoding a sequence of identified speaker models that best corresponds to the utterances of the audio data. |
地址 |
Herzilya Pituach IL |