发明名称 Method and apparatus for detecting voice activity by using signal and noise power prediction values
摘要 A robust method and apparatus to detect voice activity based on the power level of an audio frame. The method may include performing primary active/non-active voice period determination of an input audio frame according to a power level of the audio frame, extracting a noise power prediction value and a signal power prediction value by referring to power levels of current and previous audio frames according to a primary active/non-active voice period determination value, and performing secondary active/non-active voice period determination for the input audio frame by comparing the extracted signal power prediction value with the extracted noise power prediction value.
申请公布号 US8744842(B2) 申请公布日期 2014.06.03
申请号 US20080127942 申请日期 2008.05.28
申请人 Samsung Electronics Co., Ltd. 发明人 Cho Jae-youn
分类号 G10L25/93;G10L21/00;G10L25/90 主分类号 G10L25/93
代理机构 代理人
主权项 1. A method of detecting voice activity, the method comprising: performing primary active/non-active voice period determination of an input audio frame according to a power level of a current audio frame to generate a primary active/non-active voice period determination value indicating whether the current audio frame has an active or non-active voice period; extracting a noise power prediction value and a signal power prediction value of the input audio frame by referring to power levels of current and previous audio frames according to the primary active/non-active voice period determination value; performing secondary active/non-active voice period determination of the input audio frame by comparing the extracted signal power prediction value with the extracted noise power prediction value; and filtering the secondary active/non-active voice period determination values to smooth consecutive periods between frames in which the active/non-active voice change.
地址 Suwon-si KR