摘要 |
<p>The speech synthesizer is personalized to sound like or mimic the speech characteristics of an individual speaker. The individual speaker provides a quantity of enrollment data (18), which can be extracted from a short quantity of speech, and the system modifies the base synthesis parameters (12) to more closely resemble those of the new speaker (36). More specifically, the synthesis parameters (12) may be decomposed into speaker dependent parameters (30), such as context-independent parameters, and speaker independent parameters (32), such as contextindependent parameters, and speaker independent parameters (32), such as context dependent parameters. The speaker dependent parameters (30) are adapted using enrollment data (18) from the new speaker. After adaptation, the speaker dependent parameters (30) are combined with the speaker independent parameters (32) to provide a set of personalized synthesis parameters (42).</p> |