发明名称 Voice activity identiftication for speaker tracking in a packet based conferencing system with distributed processing
摘要 A distributed conferencing system has a plurality of conferencing nodes to connect groups of participants to a conference. Each of the conferencing nodes provides for the connection of one or more participants to the conference. Each node includes a DSP for distributed signal processing. The node DSP includes: A signal measuring device for measuring features of the signals from each of the participants such as power, zero crossing rate and short term energy. The nodes include voice activity determination and a communication device for communicating the measured signal characteristics for a plurality of participant input signals to all other conferencing nodes. Muting means for muting individual participant input signals so that only selected signals are transmitted over the conference bus to the other participants. The voice activity detection utilizes a state machine with three states, voice state, transition state and noise state, dependant upon the measured energy level, zero crossing rate and other features of the signals. A high threshold and a low energy threshold; zero crossing rates; average energies; energy level means and variances and other features are used in differentiating voice and noise. The state machine will not move directly from voice to noise state but will move to a transition state first, to reduce the likelihood of missclassification of a weak voice signal as noise and to avoid frequent clipping which can be caused if the state machine moves to noise state during brief pauses in voice.
申请公布号 US7020257(B2) 申请公布日期 2006.03.28
申请号 US20020123483 申请日期 2002.04.17
申请人 TEXAS INSTRUMENTS INCORPORATED 发明人 LI DUNLING
分类号 G10L21/02;H04M3/56;H04M7/00 主分类号 G10L21/02
代理机构 代理人
主权项
地址