发明名称 |
THRESHOLD ADAPTATION IN TWO-CHANNEL NOISE ESTIMATION AND VOICE ACTIVITY DETECTION |
摘要 |
A method for adapting a threshold used in multi-channel audio voice activity detection. Strengths of primary and secondary sound pick up channels are computed. A separation, being a measure of difference between the strengths of the primary and secondary channels, is also computed. An analysis of the peaks in separation is performed, e.g. using a leaky peak capture function that captures a peak in the separation and then decays over time, or using a sliding window min-max detector. A threshold that is to be used in a voice activity detection (VAD) process is adjusted, in accordance with the analysis of the peaks. Other embodiments are also described and claimed. |
申请公布号 |
US2015221322(A1) |
申请公布日期 |
2015.08.06 |
申请号 |
US201414170136 |
申请日期 |
2014.01.31 |
申请人 |
Apple Inc. |
发明人 |
Iyengar Vasu;LindahI Aram M. |
分类号 |
G10L25/84;G10L21/0208 |
主分类号 |
G10L25/84 |
代理机构 |
|
代理人 |
|
主权项 |
1. A method for adapting a threshold used in multi-channel audio noise estimation, comprising:
computing strength of a primary sound pick up channel; computing strength of a secondary sound pick up channel; computing separation versus time, being a measure of difference between the strengths of the primary and secondary channels; analyzing a plurality of peaks in the separation versus time; and adjusting a threshold that is to be used in an audio noise estimation process in accordance with the analysis of the peaks. |
地址 |
Cupertino CA US |