摘要 |
<p>A simply configured speech synthesis device and the like for producing a natural synthetic speech at high speed. When data representing a message template is supplied, a voice piece editor (5) searches a voice piece database (7) for voice piece data on a voice piece whose sound matches a voice piece in the message template. Further, the voice piece editor (5) predicts the cadence of the message template and selects, one at a time, a best match of each voice piece in the message template from the voice piece data that has been retrieved, according to the cadence prediction result. For a voice piece for which no match can be selected, an acoustic processor (41) is instructed to supply waveform data representing the waveform of each unit voice. The voice piece data that is selected and the waveform data that is supplied by the acoustic processor (41) are combined to generate data representing a synthetic speech.</p> |