摘要 |
<P>PROBLEM TO BE SOLVED: To obtain a corpus based speech synthesizer capable of synchronizing and outputting a plurality of multiple synthesis speeches. <P>SOLUTION: A rhythm estimation section 20 estimates rhythm of each speech by using a feature data corresponding to a plurality of different speakers or their speaking tones, or both of them, for the same input text. Each estimated rhythm is classified into an estimation result regarding speech length, and an estimation result regarding elements other than that. By exchanging the estimation result regarding speech length among estimation results, a plurality of combinations of the estimation result regarding speech length, and the estimation result regarding elements other than that, are generated. A fragment selection section 30 selects a suitable speech fragment for each combination, and a determination section 100 evaluates quality of the speech fragment for each combination, and outputs speech length corresponding to the most highly evaluated combination to the fragment selection section. The fragment selection section 30 performs speech synthesis by selecting a suitable speech fragment using the speech length. <P>COPYRIGHT: (C)2009,JPO&INPIT |