ADVANCED RECURRENT NEURAL NETWORK BASED LETTER-TO-SOUND
摘要
The technology relates to performing letter-to-sound conversion utilizing recurrent neural networks (RNNs). The RNNs may be implemented as RNN modules for letter-to-sound conversion. The RNN modules receive text input and convert the text to corresponding phonemes. In determining the corresponding phonemes, the RNN modules may analyze the letters of the text and the letters surrounding the text being analyzed. The RNN modules may also analyze the letters of the text in reverse order. The RNN modules may also receive contextual information about the input text. The letter-to-sound conversion may then also be based on the contextual information that is received. The determined phonemes may be utilized to generate synthesized speech from the input text.
申请公布号
WO2015191651(A1)
申请公布日期
2015.12.17
申请号
WO2015US34993
申请日期
2015.06.10
申请人
MICROSOFT TECHNOLOGY LICENSING, LLC
发明人
ZHAO, PEI;YAO, KAISHENG;LEUNG, MAX;HWANG, MEI-YUH;ZHAO, SHENG;YAN, BO;ZWEIG, GEOFFREY;ALLEVA, FILENO A.