发明名称 Trajectory Tiling Approach for Text-to-Speech
摘要 Hidden Markov Models HMM trajectory tiling (HTT)-based approaches may be used to synthesize speech from text. In operation, a set of Hidden Markov Models (HMMs) and a set of waveform units may be obtained from a speech corpus. The set of HMMs are further refined via minimum generation error (MGE) training to generate a refined set of HMMs. Subsequently, a speech parameter trajectory may be generated by applying the refined set of HMMs to an input text. A unit lattice of candidate waveform units may be selected from the set of waveform units based at least on the speech parameter trajectory. A normalized cross-correlation (NCC)-based search on the unit lattice may be performed to obtain a minimal concatenation cost sequence of candidate waveform units, which are concatenated into a concatenated waveform sequence that is synthesized into speech.
申请公布号 US2012143611(A1) 申请公布日期 2012.06.07
申请号 US20100962543 申请日期 2010.12.07
申请人 QIAN YAO;YAN ZHI-JIE;WU YI-JIAN;SOONG FRANK KAO-PING;MICROSOFT CORPORATION 发明人 QIAN YAO;YAN ZHI-JIE;WU YI-JIAN;SOONG FRANK KAO-PING
分类号 G10L13/00 主分类号 G10L13/00
代理机构 代理人
主权项
地址