摘要 |
A representation of an audio signal having a first frame, a second frame following the first frame, and a third frame following the second frame, is derived by estimating first warp information for the first and the second frame and second warp information for the second frame and the third frame, the warp information describing a pitch information of the audio signal. First spectral coefficients for the first and the second frame are derived using the first warp information and a first weighted representation of the first and the second frame, the first weighted representation derived by applying a first window function to the first and the second frames, wherein the first window function depends on the first warp information. Second spectral coefficients for the second and the third frame are derived using the second warp information and a second weighted representation of the second and the third frame, the second weighted representation derived by applying a second window function to the second and the third frames, wherein the second window function depends on the second warp information. The representation of the audio signal is generated including the first and the second spectral coefficients.
|