发明名称 |
Voice Activity Detection (VAD) for a Coded Speech Bitstream without Decoding |
摘要 |
A system, method and computer program product are described for voice activity detection (VAD) within a digitally encoded bitstream. A parameter extraction module is configured to extract parameters from a sequence of coded frames from a digitally encoded bitstream containing speech. A VAD classifier is configured to operate with input of the digitally encoded bitstream to evaluate each coded frame based on bitstream coding parameter classification features to output a VAD decision indicative of whether or not speech is present in one or more of the coded frames. |
申请公布号 |
US2015154981(A1) |
申请公布日期 |
2015.06.04 |
申请号 |
US201314094025 |
申请日期 |
2013.12.02 |
申请人 |
Nuance Communications, Inc. |
发明人 |
Barreda Daniel A.;Lainez Jose E.G.;Sharma Dushyant;Naylor Patrick |
分类号 |
G10L25/78;G10L19/00 |
主分类号 |
G10L25/78 |
代理机构 |
|
代理人 |
|
主权项 |
1. A system for voice activity detection (VAD) within a digitally encoded bitstream, the system comprising:
a parameter extraction module configured to extract parameters from a sequence of coded frames from a digitally encoded bitstream containing speech; and a VAD classifier configured to operate with input of the digitally encoded bitstream to evaluate each coded frame based on bitstream coding parameter classification features to output a VAD decision indicative of whether or not speech is present in one or more of the coded frames. |
地址 |
Burlington MA US |