发明名称 Efficient dereverberation in networked audio systems
摘要 Features are disclosed for performing efficient dereverberation of speech signals captured with single- and multi-channel sensors in networked audio systems. Such features could be used in applications requiring automatic recognition of speech captured with sensors. Dereverberation is performed in the sub-band domain, and hence provides improved dereverberation performance in terms of signal quality, algorithmic delay, computational efficiency, and speed of convergence.
申请公布号 US9390723(B1) 申请公布日期 2016.07.12
申请号 US201414568033 申请日期 2014.12.11
申请人 Amazon Technologies, Inc. 发明人 McDonough, Jr. John Walter;Chu Wai Chung;Chhetri Amit Singh;Ayrapetian Robert
分类号 H04R3/00;G10L21/02;G10K11/175 主分类号 H04R3/00
代理机构 Knobbe Martens Olson & Bear LLP 代理人 Knobbe Martens Olson & Bear LLP
主权项 1. A device for reducing reverberation in an audio signal, the device comprising: computer-readable memory storing executable instructions; one or more physical computer processors in communication with the computer-readable memory, wherein the one or more physical computer processors are programmed by the executable instructions to at least: receive an input audio signal;determine a first sub-band sample from the input audio signal, wherein the first sub-band sample corresponds to a first capture time range and a first frequency band, the first capture time range identifying a first period of time during which the first sub-band sample was captured;obtain first dereverberation weights corresponding to the first frequency band;determine a first dereverberated sub-band sample using the first dereverberation weights, the first sub-band sample, and a first plurality of sub-band samples corresponding to a period of time of capture preceding the first capture time range, wherein the first dereverberated sub-band sample corresponds to the first frequency band and the first capture time range, and the first plurality of sub-band samples includes samples having frequencies included in the first frequency band;generate a first dereverberated output audio sample using the first dereverberated sub-band sample;determine a second sub-band sample from the input audio signal, wherein the second sub-band sample corresponds to a second capture time range and the first frequency band, the second capture time range identifying a second period of time of capture occurring after the first capture time range;obtain a first Cholesky factor of a first matrix corresponding to the first dereverberation weights;generate a second Cholesky factor of a second matrix using the second sub-band sample, the first Cholesky factor, and a second plurality of sub-band samples corresponding to a third period of time of capture preceding the second capture time range and including samples having frequencies included in the first frequency band;generate second dereverberation weights using the second Cholesky factor;generate a second dereverberated sub-band sample using the second dereverberation weights, the second sub-band sample, and the second plurality of sub-band samples, wherein the second dereverberated sub-band sample corresponds to the first frequency band and the second capture time range; andgenerate a second dereverberated output audio sample using the second dereverberated sub-band sample.
地址 Seattle WA US