摘要 |
A speech coder (100) computes scalar statistics (180), ensemble statistics (190), spectral parameters (150), and a normalized excitation waveform (270) which describe a frame of speech samples. The coder (100) encodes the statistics (220, 230), spectral parameters (155), and the normalized waveform (290) for later decoding and synthesis. A speech synthesizer (900) decodes the encoded scalar statistics (570), encoded ensemble statistics (560), encoded spectral parameters (490), and encoded normalized excitation waveform (550). The synthesizer (900) then denormalizes (670) the normalized excitation waveform using the scalar statistics and the ensemble statistics, resulting in a decoded excitation waveform. Speech is synthesized (710) from the decoded excitation waveform and the decoded spectral parameters.
|