发明名称 Adaptive speech filter for attenuation of ambient noise
摘要 According to a preferred aspect of the instant invention, there is provided a system and method that allows the user to attenuate ambient noise in speech recordings in the audio part of a video recording. The user does not need to define particular sections or samples or individual parameters. The system is automatically analyzing the input signal and in a plurality of individual steps detects the ambient noise, determines an adaptive filter, implements the filter and therewith attenuates the ambient noise accordingly.
申请公布号 US9269370(B2) 申请公布日期 2016.02.23
申请号 US201414569134 申请日期 2014.12.12
申请人 MAGIX AG 发明人 Herberger Tilman;Tost Titus;Flemming Georg
分类号 G10L21/02;G10L21/0208 主分类号 G10L21/02
代理机构 Fellers, Snider, Blakenship, Bailey & Tippens P.C. 代理人 Fellers, Snider, Blakenship, Bailey & Tippens P.C. ;Watt Terry L.
主权项 1. A method of enhancing a speech signal in the presence of noise, comprising: performing, by computer processing hardware, operations of: a. reading an audio signal containing said speech signal therein; b. transforming said audio signal to the frequency domain, thereby forming a transformed audio signal; c. determining via a recursive spectral analysis a plurality of spectral components in the frequency domain that have a most energy; d. identifying at least one null point in the time domain associated with each of said plurality of spectral components; e. determining a gradient of each of said null points; f. determining a variance of each of said determined gradients; g. analyzing the variance of each of said determined gradients to assign each of said determined gradients to a category, wherein said gradient with a high variance is classified as noise, wherein said gradient with a middle variance is classified as part of a tonal part of said speech signal, and wherein said gradient with a low variance is classified as a tonal component not a part of said speech signal; h. determining whether the plurality spectral components with the most energy belong to a harmonic series, wherein frequencies of the plurality spectral components with the most energy are a multiple of a base frequency; i. calculating a transfer function using said analysis of each variance and said determination of belonging to harmonic series of said plurality of spectral components with the most energy; j. applying said transfer function to said transformed audio signal, thereby forming a filtered audio signal; k. inverse transforming said filtered audio signal, thereby forming an enhanced speech signal.
地址 DE