发明名称 THRESHOLD ADAPTATION IN TWO-CHANNEL NOISE ESTIMATION AND VOICE ACTIVITY DETECTION
摘要 A method for adapting a threshold used in multi-channel audio voice activity detection. Strengths of primary and secondary sound pick up channels are computed. A separation, being a measure of difference between the strengths of the primary and secondary channels, is also computed. An analysis of the peaks in separation is performed, e.g. using a leaky peak capture function that captures a peak in the separation and then decays over time, or using a sliding window min-max detector. A threshold that is to be used in a voice activity detection (VAD) process is adjusted, in accordance with the analysis of the peaks. Other embodiments are also described and claimed.
申请公布号 US2015221322(A1) 申请公布日期 2015.08.06
申请号 US201414170136 申请日期 2014.01.31
申请人 Apple Inc. 发明人 Iyengar Vasu;LindahI Aram M.
分类号 G10L25/84;G10L21/0208 主分类号 G10L25/84
代理机构 代理人
主权项 1. A method for adapting a threshold used in multi-channel audio noise estimation, comprising: computing strength of a primary sound pick up channel; computing strength of a secondary sound pick up channel; computing separation versus time, being a measure of difference between the strengths of the primary and secondary channels; analyzing a plurality of peaks in the separation versus time; and adjusting a threshold that is to be used in an audio noise estimation process in accordance with the analysis of the peaks.
地址 Cupertino CA US