发明名称 METHOD FOR FORMING THE EXCITATION SIGNAL FOR A GLOTTAL PULSE MODEL BASED PARAMETRIC SPEECH SYNTHESIS SYSTEM
摘要 A method is presented for forming the excitation signal for a glottal pulse model based parametric speech synthesis system. In one embodiment, fundamental frequency values are used to form the excitation signal. The excitation is modeled using a voice source pulse selected from a database of a given speaker. The voice source signal is segmented into glottal segments, which are used in vector representation to identify the glottal pulse used for formation of the excitation signal. Use of a novel distance metric and preserving the original signals extracted from the speakers voice samples helps capture low frequency information of the excitation signal. In addition, segment edge artifacts are removed by applying a unique segment joining method to improve the quality of synthetic speech while creating a true representation of the voice quality of a speaker.
申请公布号 EP3149727(A1) 申请公布日期 2017.04.05
申请号 EP20140893138 申请日期 2014.05.28
申请人 Interactive Intelligence Group, Inc. 发明人 DACHIRAJU, Rajesh;GANAPATHIRAJU, Aravind
分类号 G10L13/00 主分类号 G10L13/00
代理机构 代理人
主权项
地址