发明名称 Voice personalization of speech synthesizer
摘要 The speech synthesizer is personalized to sound like or mimic the speech characteristics of an individual speaker. The individual speaker provides a quantity of enrollment data, which can be extracted from a short quantity of speech, and the system modifies the base synthesis parameters to more closely resemble those of the new speaker. More specifically, the synthesis parameters may be decomposed into speaker dependent parameters, such as context-independent parameters, and speaker independent parameters, such as context dependent parameters. The speaker dependent parameters are adapted using enrollment data from the new speaker. After adaptation, the speaker dependent parameters are combined with the speaker independent parameters to provide a set of personalized synthesis parameters. To adapt the parameters with a small amount of enrollment data, an eigenspace is constructed and used to constrain the position of the new speaker so that context independent parameters not provided by the new speaker may be estimated.
申请公布号 US6970820(B2) 申请公布日期 2005.11.29
申请号 US20010792928 申请日期 2001.02.26
申请人 MATSUSHITA ELECTRIC INDUSTRIAL CO., LTD. 发明人 JUNQUA JEAN-CLAUDE;PERRONNIN FLORENT;KUHN ROLAND;NGUYEN PATRICK
分类号 G10L13/08;G10L13/02;G10L13/04;G10L13/06;G10L21/00;(IPC1-7):G10L13/00 主分类号 G10L13/08
代理机构 代理人
主权项
地址