发明名称 Discriminative training for speaker and speech verification
摘要 A method for discriminatively training acoustic models is provided for automated speaker verification (SV) and speech (or utterance) verification (UV) systems. The method includes: defining a likelihood ratio for a given speech segment, whose speaker identity (for SV system) or linguist identity (for UV system) is known, using a corresponding acoustic model, and an alternative acoustic model which represents all other speakers (in SV) or all other linguist identities (in UV); determining an average likelihood ratio score for the likelihood ratio scores over a set of training utterances (referred to as true data set) whose speaker identities (for SV) or linguist identities (for UV) are the same; determining an average likelihood ratio score for the likelihood ratio scores over a competing set of training utterances which excludes the speech data in the true data set (referred to as competing data set); and optimizing a difference between the average likelihood ratio score over the true data set and the average likelihood ratio score over the competing data set, thereby improving the acoustic model.
申请公布号 US7454339(B2) 申请公布日期 2008.11.18
申请号 US20050312981 申请日期 2005.12.20
申请人 PANASONIC CORPORATION 发明人 LIU CHAOJUN;KRYZE DAVID;RIGAZIO LUCA
分类号 G10L15/06;G10L17/00 主分类号 G10L15/06
代理机构 代理人
主权项
地址