发明名称 |
SYNTHESIS OF SPEECH FROM PITCH PROTOTYPE WAVEFORMS BY TIME-SYNCHRONOUS WAVEFORM INTERPOLATION |
摘要 |
In a method of synthesizing voiced speech from pitch prototype waveforms by time-synchronous waveform interpolation (TSWI), one or more pitch prototypes is extracted from a speech signal or a residue signal (300). The extraction process is performed in such a way that the prototype has minimum energy at the boundary. Each prototype is circularly shifted so as to be time-synchronous with the original signal. A linear phase shift is applied to each extracted prototype relative to the previously extracted prototype so as to maximize the cross-correlation between successive extracted prototypes (302). A two-dimensional prototype-evolving surface is constructed by unsampling the prototypes to every sample point (303). The two-dimensional prototype-evolving surface is re-sampled to generate a one-dimensional, synthesized signal frame with sample points defined by piecewise continuous cubic phase contour functions computed from the pitch lags and the phase shifts added to the extracted prototypes (305). A pre-selection filter may be applied to determine whether to abandon the TSWI technique in favor of another algorithm for the current frame. A post-selection performance measure may be obtained and compared with a predetermined threshold to determine whether the TSWI algorithm is performing adequately.
|
申请公布号 |
WO0030073(A1) |
申请公布日期 |
2000.05.25 |
申请号 |
WO1999US26849 |
申请日期 |
1999.11.12 |
申请人 |
QUALCOMM INCORPORATED |
发明人 |
DAS, AMITAVA;CHOY, EDDIE, L., T. |
分类号 |
G10L19/00;G10L11/04;G10L13/00;G10L19/02;G10L19/12;(IPC1-7):G10L19/02 |
主分类号 |
G10L19/00 |
代理机构 |
|
代理人 |
|
主权项 |
|
地址 |
|