发明名称 Method, device and system for using statistical information to reduce computation and memory requirements of a neural network based speech synthesis system
摘要 A method (400), device and system (300) provide, in response to linguistic information, efficient generation of a parametric representation of speech using a neural network. The method provides, in response to linguistic information efficient generation of a refined parametric representation of speech, comprising the steps of: A) using a data selection module to retrieve representative parameter vectors for each segment description according to the phonetic segment type and the phonetic segment types included in adjacent segment descriptions; B) interpolating between the representative parameter vectors according to the segment descriptions and duration to provide interpolated statistical parameters; C) converting the interpolated statistical parameters and linguistic information to neural network input parameters; D) utilizing a statistically enhanced neural network/neural network with post-processor to provide neural network output parameters that correspond to a parametric representation of speech; and converting the neural network output parameters to a refined parametric representation of speech.
申请公布号 US5913194(A) 申请公布日期 1999.06.15
申请号 US19970892295 申请日期 1997.07.14
申请人 MOTOROLA, INC. 发明人 KARAALI, ORHAN;MASSEY, NOEL;CORRIGAN, GERALD
分类号 G10L13/02;(IPC1-7):G10L5/02 主分类号 G10L13/02
代理机构 代理人
主权项
地址