主权项 |
1. An apparatus for editing speech synthesis, comprising:
an acquisition unit, executed by a computer using a program stored in a memory device, configured to analyze a text, and to acquire a phonemic and prosodic information to synthesize a speech corresponding to the text; a display that displays the phonemic and prosodic information; an editing unit, executed by the computer, configured to edit at least a part of the phonemic and prosodic information displayed on the display; a speech synthesis unit, executed by the computer, configured to convert the phonemic and prosodic information in which the part is not edited to a first speech waveform, and to convert the phonemic and prosodic information in which the part is edited to a second speech waveform; a period calculation unit, executed by the computer, configured to specify a partial sequence corresponding to the part not edited in the phonemic and prosodic information, and the part edited in the phonemic and prosodic information respectively, and to calculate a contrast period corresponding to the partial sequence in the first speech waveform and the second speech waveform respectively; a speech generation unit, executed by the computer, configured to generate an output waveform by connecting a first partial waveform and a second partial waveform, the first partial waveform being the contrast period of the first speech waveform, the second partial waveform being the contrast period of the second speech waveform; and a speaker that reproduces the output waveform. |