摘要 |
<p>There is disclosed a method of improving audibility of speech in a multi-channel audio 5 signal, comprising: comparing a first characteristic and a second characteristic of the multi-channel audio signal to generate an attenuation factor, wherein the first characteristic corresponds to a first channel of the multi-channel audio signal that contains speech audio and non-speech audio, wherein the first characteristic corresponds to a first power spectrum of a signal in the first channel, wherein the 1o second characteristic corresponds to a second channel of the multi-channel audio signal that contains predominantly non-speech audio, and wherein the second characteristic corresponds to a second power spectrum of a signal in the second channel, wherein comparing the first characteristic and the second characteristic comprises: performing intelligibility prediction based on the first power spectrum and the second power 1s spectrum to generate a predicted intelligibility; adjusting a gain applied to the second power spectrum until the predicted intelligibility meets a criterion; and using the gain, having been adjusted, as the attenuation factor once the predicted intelligibility meets the criterion; adjusting the attenuation factor according to a speech likelihood value to generate an adjusted attenuation factor; and attenuating the second channel using the 20 adjusted attenuation factor.</p> |