发明名称 Blind Diarization of Recorded Calls with Arbitrary Number of Speakers
摘要 In a method of diarization of audio data, audio data is segmented into a plurality of utterances. Each utterance is represented as an utterance model representative of a plurality of feature vectors. The utterance models are clustered. A plurality of speaker models are constructed from the clustered utterance models. A hidden Markov model is constructed of the plurality of speaker models. A sequence of identified speaker models is decoded.
申请公布号 US2015025887(A1) 申请公布日期 2015.01.22
申请号 US201414319860 申请日期 2014.06.30
申请人 VERINT SYSTEMS LTD. 发明人 Sidi Oana;Wein Ron
分类号 G10L15/06 主分类号 G10L15/06
代理机构 代理人
主权项 1. A method of diarization of audio data, the method comprising: segmenting audio data into a plurality of utterances; representing each utterance as an utterance model representative of a plurality of feature vectors of each utterance; clustering the utterance models; constructing a plurality of speaker models from the clustered utterance models; constructing a hidden Markov model of the plurality of speaker models; and decoding a sequence of identified speaker models that best corresponds to the utterances of the audio data.
地址 Herzilya Pituach IL