System and method for voice transformation,申请号US201213625317-传众专利搜索

发明名称	System and method for voice transformation
摘要	The present invention is a method and system to convert speech signal into a parametric representation in terms of timbre vectors, and to recover the speech signal thereof. The speech signal is first segmented into non-overlapping frames using the glottal closure instant information, each frame is converted into an amplitude spectrum using a Fourier analyzer, and then using Laguerre functions to generate a set of coefficients which constitute a timbre vector. A sequence of timbre vectors can be subject to a variety of manipulations. The new timbre vectors are converted back into voice signals by first transforming into amplitude spectra using Laguerre functions, then generating phase spectra from the amplitude spectra using Kramers-Knonig relations. A Fourier transformer converts the amplitude spectra and phase spectra into elementary waveforms, then superposed to become the output voice. The method and system can be used for voice transformation, speech synthesis, and automatic speech recognition.
申请公布号	US8744854(B1)	申请公布日期	2014.06.03
申请号	US201213625317	申请日期	2012.09.24
申请人		发明人	Chen Chengjun Julian
分类号	G10L15/04	主分类号	G10L15/04
代理机构		代理人
主权项	1. A method of voice transformation using one or more processors comprising: segmenting the voice-signal into non-overlapping frames, wherein for voiced sections the frames are pitch periods; generating amplitude spectra of the said frames using Fourier analysis; transforming the said amplitude spectra into timbre vectors using orthogonal functions; manipulating the voice parameters of the said timbre vectors to generate a set of new timbre vectors according to the specifications of voice transformation; inverse transforming the new timbre vectors into new amplitude spectra using orthogonal functions; generating new phase spectra from the new amplitude spectra using Kramers-Kronig relations; generating elementary acoustic waves from the new amplitude spectra and the new phase spectra using Fourier transform; producing a new voice waveform by superposing the said elementary acoustic waves according to the timing data given by the new timbre vectors.
地址