发明名称 Coding, modification and synthesis of speech segments
摘要 The invention relates to a method for speech signal analysis, modification and synthesis comprising a phase for the location of analysis windows by means of an iterative process for the determination of the phase of the first sinusoidal component and comparison between the phase value of said component and a predetermined value, a phase for the selection of analysis frames corresponding to an allophone and readjustment of the duration and the fundamental frequency according to certain thresholds and a phase for the generation of synthetic speech from synthesis frames taking the information of the closest analysis frame as spectral information of the synthesis frame and taking as many synthesis frames as periods that the synthetic signal has. The method allows a coherent location of the analysis windows within the periods of the signal and the exact generation of the synthesis instants in a manner synchronous with the fundamental period.
申请公布号 US8812324(B2) 申请公布日期 2014.08.19
申请号 US201013254479 申请日期 2010.12.21
申请人 Telefonica, S.A. 发明人 Rodriguez Crespo Miguel Angel;Escalada Sardina Jose Gregorio;Armenta Lopez de Vicuna Ana
分类号 G10L13/00 主分类号 G10L13/00
代理机构 Katten Muchin Rosenman LLP 代理人 Katten Muchin Rosenman LLP
主权项 1. Method for speech signal analysis, modification and synthesis comprising: a. a phase for the location of analysis windows by means of an iterative process for the determination of the phase of the first sinusoidal component of the signal and comparison between the phase value of said component and a predetermined value until finding a position for which the phase difference represents a time shift less than half a speech sample b. a phase for the selection of analysis frames corresponding to an allophone and readjustment of the duration and the fundamental frequency according to a model, such that if the difference between the original duration or the original fundamental frequency and those which are to be imposed exceeds certain thresholds, the duration and the fundamental frequency are adjusted to generate synthesis frames, c. a phase for the generation of synthetic speech from synthesis frames, taking the information of the closest analysis frame as spectral information of the synthesis frame and taking as many synthesis frames as periods that the synthetic signal has.
地址 Madrid ES