摘要 |
A speech synthesis device includes: a mouth-opening-degree generation unit which generates, for each of phonemes generated from input text, a mouth-opening-degree corresponding to oral-cavity volume, using information generated from the text and indicating the type and position of the phoneme within the text, such that the generated mouth-opening-degree is larger for a phoneme at the beginning of a sentence in the text than for a phoneme at the end of the sentence; a segment selection unit which selects, for each of the generated phonemes, segment information corresponding to the phoneme from among pieces of segment information stored in a segment storage unit and including phoneme type, mouth-opening-degree, and speech segment data, based on the type of the phoneme and the generated mouth-opening-degree; and a synthesis unit which generates synthetic speech of the text, using the selected pieces of segment information and pieces of prosody information generated from the text.
|