摘要 |
A method of recognizing speech pauses in a speech signal even when the signal is disturbed by a slowly varying noise signal superposed thereon. Mean values which are an approximate measure of the average power of successive sections of the disturbed signal are determined from the short-time Fourier coefficients of the disturbed speech signal. The sequential short-time mean values are then smoothed by a linear digital filter or a median filter. An estimate of the noise signal power averaged over a few seconds is also recovered from the sequence of short-time mean values. A speech pause is signified when the smoothed short-time mean value (output of GL) more than once falls to a threshold which is proportional to the estimated noise power (output of PA). |