发明名称 Hybrid Approach in Voice Conversion
摘要 A hybrid approach is described for combining frequency warping and Gaussian Mixture Modeling (GMM) to achieve better speaker identity and speech quality. To train the voice conversion GMM model, line spectral frequency and other features are extracted from a set of source sounds to generate a source feature vector and from a set of target sounds to generate a target feature vector. The GMM model is estimated based on the aligned source feature vector and the target feature vector. A mixture specific warping function is generated each set of mixture mean pairs of the GMM model, and a warping function is generated based on a weighting of each of the mixture specific warping functions. The warping function can be used to convert sounds received from a source speaker to approximate speech of a target speaker.
申请公布号 US2009171657(A1) 申请公布日期 2009.07.02
申请号 US20070966255 申请日期 2007.12.28
申请人 NOKIA CORPORATION 发明人 TIAN JILEI;POPA VICTOR;NURMINEN JANI KRISTIAN
分类号 G10L19/04 主分类号 G10L19/04
代理机构 代理人
主权项
地址