发明名称 |
Method, device and system for using statistical information to reduce computation and memory requirements of a neural network based speech synthesis system |
摘要 |
A method (400), device and system (300) provide, in response to linguistic information, efficient generation of a parametric representation of speech using a neural network. The method provides, in response to linguistic information efficient generation of a refined parametric representation of speech, comprising the steps of: A) using a data selection module to retrieve representative parameter vectors for each segment description according to the phonetic segment type and the phonetic segment types included in adjacent segment descriptions; B) interpolating between the representative parameter vectors according to the segment descriptions and duration to provide interpolated statistical parameters; C) converting the interpolated statistical parameters and linguistic information to neural network input parameters; D) utilizing a statistically enhanced neural network/neural network with post-processor to provide neural network output parameters that correspond to a parametric representation of speech; and converting the neural network output parameters to a refined parametric representation of speech.
|
申请公布号 |
US5913194(A) |
申请公布日期 |
1999.06.15 |
申请号 |
US19970892295 |
申请日期 |
1997.07.14 |
申请人 |
MOTOROLA, INC. |
发明人 |
KARAALI, ORHAN;MASSEY, NOEL;CORRIGAN, GERALD |
分类号 |
G10L13/02;(IPC1-7):G10L5/02 |
主分类号 |
G10L13/02 |
代理机构 |
|
代理人 |
|
主权项 |
|
地址 |
|