摘要 |
The present invention provides a method for pitch estimation which utilizes perception based analysis by synthesis for improved pitch estimation over a variety of input speech conditions. Initially, pitch candidates are generated corresponding to a plurality of sub-ranges within a pitch search range. Then a residual spectrum is determined for a segment of speech and a reference speech signal is generated from the residual spectrum using sinusoidal synthesis and linear predictive coding (LPC) synthesis. A synthetic speech signal is generated for each of the pitch candidates using sinusoidal and LPC synthesis. Finally, the synthetic speech signal for each pitch candidate is compared with the reference residual signal to determine an optimal pitch estimate based on a pitch period of a synthetic speech signal that provides a maximum signal to noise ratio.
|