发明名称 Voice Activity Detection (VAD) for a Coded Speech Bitstream without Decoding
摘要 A system, method and computer program product are described for voice activity detection (VAD) within a digitally encoded bitstream. A parameter extraction module is configured to extract parameters from a sequence of coded frames from a digitally encoded bitstream containing speech. A VAD classifier is configured to operate with input of the digitally encoded bitstream to evaluate each coded frame based on bitstream coding parameter classification features to output a VAD decision indicative of whether or not speech is present in one or more of the coded frames.
申请公布号 US2015154981(A1) 申请公布日期 2015.06.04
申请号 US201314094025 申请日期 2013.12.02
申请人 Nuance Communications, Inc. 发明人 Barreda Daniel A.;Lainez Jose E.G.;Sharma Dushyant;Naylor Patrick
分类号 G10L25/78;G10L19/00 主分类号 G10L25/78
代理机构 代理人
主权项 1. A system for voice activity detection (VAD) within a digitally encoded bitstream, the system comprising: a parameter extraction module configured to extract parameters from a sequence of coded frames from a digitally encoded bitstream containing speech; and a VAD classifier configured to operate with input of the digitally encoded bitstream to evaluate each coded frame based on bitstream coding parameter classification features to output a VAD decision indicative of whether or not speech is present in one or more of the coded frames.
地址 Burlington MA US
您可能感兴趣的专利