发明名称 Multi-microphone audio source separation based on combined statistical angle distributions
摘要 Systems, methods, and computer media for separating audio sources in a multi-microphone system are provided. A plurality of audio sample groups can be received. Each audio sample group comprises at least two samples of audio information captured by different microphones during a sample group time interval. For each audio sample group, an estimated angle between an audio source and the multi-microphone system can be estimated based on a phase difference of the samples in the group. The estimated angle can be modeled as a combined statistical distribution that is a mixture of a target audio signal statistical distribution and a noise component statistical distribution. The combined statistical distribution can be analyzed to provide an accurate characterization of each sample group as either target audio signal or noise. The target audio signal can then be resynthesized from samples identified as part of the target audio signal.
申请公布号 US9131295(B2) 申请公布日期 2015.09.08
申请号 US201213569092 申请日期 2012.08.07
申请人 Microsoft Technology Licensing, LLC 发明人 Kim Chanwoo;Khawand Charbel
分类号 H04R3/00;G10L21/0272;H04R27/00 主分类号 H04R3/00
代理机构 代理人 Webster Bryan;Drakos Kate;Minhas Micky
主权项 1. One or more computer-readable memory or storage devices storing instructions that, when executed by a computing device having a processor, perform a method of separating audio sources in a multi-microphone system, the method comprising: receiving audio sample groups, with an audio sample group comprising at least two samples of audio information, the at least two samples captured by different microphones during a sample group time interval; and for a plurality of audio sample groups: estimating, for the corresponding sample group time interval, an angle between a first reference line extending from an audio source to the multi-microphone system and a second reference line extending through the multi-microphone system, the estimated angle being based on a phase difference between the at least two samples in the audio sample group;modeling the estimated angle as a combined statistical distribution, the combined statistical distribution being a mixture of a target audio signal statistical distribution and a noise component statistical distribution; anddetermining whether the audio sample group is part of a target audio signal or a noise component based at least in part on the combined statistical distribution.
地址 Redmond WA US