发明名称 Apparatus and method for editing speech synthesis, and computer readable medium
摘要 An acquisition unit analyzes a text, and acquires phonemic and prosodic information. An editing unit edits a part of the phonemic and prosodic information. A speech synthesis unit converts the phonemic and prosodic information before editing the part to a first speech waveform, and converts the phonemic and prosodic information after editing the part to a second speech waveform. A period calculation unit calculates a contrast period corresponding to the part in the first speech waveform and the second speech waveform. A speech generation unit generates an output waveform by connecting a first partial waveform and a second partial waveform. The first partial waveform contains the contrast period of the first speech waveform. The second partial waveform contains the contrast period of the second speech waveform.
申请公布号 US9020821(B2) 申请公布日期 2015.04.28
申请号 US201113235656 申请日期 2011.09.19
申请人 Kabushiki Kaisha Toshiba 发明人 Nishiyama Osamu
分类号 G10L13/033;G10L13/08 主分类号 G10L13/033
代理机构 Amin, Turocy & Watson, LLP 代理人 Amin, Turocy & Watson, LLP
主权项 1. An apparatus for editing speech synthesis, comprising: an acquisition unit, executed by a computer using a program stored in a memory device, configured to analyze a text, and to acquire a phonemic and prosodic information to synthesize a speech corresponding to the text; a display that displays the phonemic and prosodic information; an editing unit, executed by the computer, configured to edit at least a part of the phonemic and prosodic information displayed on the display; a speech synthesis unit, executed by the computer, configured to convert the phonemic and prosodic information in which the part is not edited to a first speech waveform, and to convert the phonemic and prosodic information in which the part is edited to a second speech waveform; a period calculation unit, executed by the computer, configured to specify a partial sequence corresponding to the part not edited in the phonemic and prosodic information, and the part edited in the phonemic and prosodic information respectively, and to calculate a contrast period corresponding to the partial sequence in the first speech waveform and the second speech waveform respectively; a speech generation unit, executed by the computer, configured to generate an output waveform by connecting a first partial waveform and a second partial waveform, the first partial waveform being the contrast period of the first speech waveform, the second partial waveform being the contrast period of the second speech waveform; and a speaker that reproduces the output waveform.
地址 Tokyo JP
您可能感兴趣的专利