发明名称 VOICE FONT SPEAKER AND PROSODY INTERPOLATION
摘要 Multi-voice font interpolation is provided. A multi-voice font interpolation engine allows the production of computer generated speech with a wide variety of speaker characteristics and/or prosody by interpolating speaker characteristics and prosody from existing fonts. Using prediction models from multiple voice fonts, the multi-voice font interpolation engine predicts values for the parameters that influence speaker characteristics and/or prosody for the phoneme sequence obtained from the text to spoken. For each parameter, additional parameter values are generated by a weighted interpolation from the predicted values. Modifying an existing voice font with the interpolated parameters changes the style and/or emotion of the speech while retaining the base sound qualities of the original voice. The multi-voice font interpolation engine allows the speaker characteristics and/or prosody to be transplanted from one voice font to another or entirely new speaker characteristics and/or prosody to be generated for an existing voice font.
申请公布号 EP3111442(A1) 申请公布日期 2017.01.04
申请号 EP20150707242 申请日期 2015.02.23
申请人 Microsoft Technology Licensing, LLC 发明人 LUAN, Jian;HE, Lei;LEUNG, Max
分类号 G10L13/08;G10L13/033 主分类号 G10L13/08
代理机构 代理人
主权项
地址