发明名称 METHODS AND APPARATUS FOR PREDICTING PROSODY IN SPEECH SYNTHESIS
摘要 Techniques for predicting prosody in speech synthesis may make use of a data set of example text fragments with corresponding aligned spoken audio. To predict prosody for synthesizing an input text, the input text may be compared with the data set of example text fragments to select a best matching sequence of one or more example text fragments, each example text fragment in the sequence being paired with a portion of the input text. The selected example text fragment sequence may be aligned with the input text, e.g., at the word level, such that prosody may be extracted from the audio aligned with the example text fragments, and the extracted prosody may be applied to the synthesis of the input text using the alignment between the input text and the example text fragments.
申请公布号 US2012191457(A1) 申请公布日期 2012.07.26
申请号 US201113012740 申请日期 2011.01.24
申请人 MINNIS STEPHEN;BREEN ANDREW P.;NUANCE COMMUNICATIONS, INC. 发明人 MINNIS STEPHEN;BREEN ANDREW P.
分类号 G10L13/08 主分类号 G10L13/08
代理机构 代理人
主权项
地址