摘要 |
The present invention provides a method of pitch estimation which utilizes perception based analysis by synthesis for improved pitch estimation over a variety of input speech conditions. Initially, pitch candidates are generated corresponding to a plurality of sub-ranges within a pitch search range (item 2). Then a residual spectrum is determined for a segment of speech (item 4) and a reference speech signal is generated from the residual spectrum using sinusoidal synthesis (item 8) and linear predictive coding (LPC) synthesis (item 9). A synthetic speech signal is generated for each of the pitch candidates using sinusoidal (item 12) and LPC synthesis (item 13). Finally, the synthetic speech signal for each pitch candidate is compared with the reference residual signal (item 14) to determine an optimal pitch estimate based on a pitch period of a synthetic speech signal that provides a maximum signal to noise ratio.
|