发明名称 |
VOICE FONT SPEAKER AND PROSODY INTERPOLATION |
摘要 |
Multi-voice font interpolation is provided. A multi-voice font interpolation engine allows the production of computer generated speech with a wide variety of speaker characteristics and/or prosody by interpolating speaker characteristics and prosody from existing fonts. Using prediction models from multiple voice fonts, the multi-voice font interpolation engine predicts values for the parameters that influence speaker characteristics and/or prosody for the phoneme sequence obtained from the text to spoken. For each parameter, additional parameter values are generated by a weighted interpolation from the predicted values. Modifying an existing voice font with the interpolated parameters changes the style and/or emotion of the speech while retaining the base sound qualities of the original voice. The multi-voice font interpolation engine allows the speaker characteristics and/or prosody to be transplanted from one voice font to another or entirely new speaker characteristics and/or prosody to be generated for an existing voice font. |
申请公布号 |
EP3111442(A1) |
申请公布日期 |
2017.01.04 |
申请号 |
EP20150707242 |
申请日期 |
2015.02.23 |
申请人 |
Microsoft Technology Licensing, LLC |
发明人 |
LUAN, Jian;HE, Lei;LEUNG, Max |
分类号 |
G10L13/08;G10L13/033 |
主分类号 |
G10L13/08 |
代理机构 |
|
代理人 |
|
主权项 |
|
地址 |
|