发明名称 Reduction of background noise for speech enhancement
摘要 Properties of human audio perception are used to perform spectral and time masking to reduce perceived loudness of noise added to the speech signal. A signal is divided temporally into blocks which are then passed through notch filters to remove narrow frequency band components of the noise. Each block is then appended to part of the previous block in a manner which avoids block boundary discontinuities. An FFT is then performed on the resulting larger block, after which the spectral components of the signal are fed to a background noise estimator. Each frequency component of the signal is analyzed with respect to the background noise to determine, within various confidence levels, whether it is pure noise or a noise-and-signal combination. The frequency band's gain function is determined, based on the confidence levels. A spectral valley finder detects and fills in spectral valleys in the frequency component gain function, after which the function is used to modify the magnitude components of the FFT. An inverse FFT then maps the signal back from the frequency domain to the time domain to give a frame of noise-reduced signal. This signal is then multiplied by a temporal window and joined to the previous frame's signal to derive the output.
申请公布号 US5550924(A) 申请公布日期 1996.08.27
申请号 US19950402550 申请日期 1995.03.13
申请人 PICTURETEL CORPORATION 发明人 HELF, BRANT M.;CHU, PETER L.
分类号 G10L13/00;G10L21/02;H03H17/02;H04B1/10;H04B15/00;H04S1/00;(IPC1-7):H04B15/00 主分类号 G10L13/00
代理机构 代理人
主权项
地址