发明名称 |
VOICE ACTIVITY DETECTION SYSTEM, METHOD, AND PROGRAM PRODUCT |
摘要 |
A voice activity detection method in a low SNR environment. The voice activity detection is performed by extracting a long-term spectrum variation component and a harmonic structure as feature vectors from a speech signal and increasing difference in feature vectors between speech and non-speech (i) using the long-term spectrum variation component feature or (ii) using a long-term spectrum variation component extraction and a harmonic structure feature extraction. A correct rate and an accuracy rate of the voice activity detection is improved over conventional methods by using a long-term spectrum variation component having a window length over an average phoneme duration of an utterance in the speech signal. The voice activity detection system and method provides speech processing, automatic speech recognition, and speech output capable of very accurate voice activity detection.
|
申请公布号 |
US2009222258(A1) |
申请公布日期 |
2009.09.03 |
申请号 |
US20090394631 |
申请日期 |
2009.02.27 |
申请人 |
FUKUDA TAKASHI;ICHIKAWA OSAMU;NISHIMURA MASAFUMI |
发明人 |
FUKUDA TAKASHI;ICHIKAWA OSAMU;NISHIMURA MASAFUMI |
分类号 |
G10L19/02 |
主分类号 |
G10L19/02 |
代理机构 |
|
代理人 |
|
主权项 |
|
地址 |
|