发明名称 |
System for detecting speech with background voice estimates and noise estimates |
摘要 |
A system detects a speech segment that may include unvoiced, fully voiced, or mixed voice content. The system includes a digital converter that converts a time-varying input signal into a digital-domain signal. A window function passes signals within a programmed aural frequency range while substantially blocking signals above and below the programmed aural frequency range when multiplied by an output of the digital converter. A frequency converter converts the signals passing within the programmed aural frequency range into a plurality of frequency bins. A background voice detector estimates the strength of a background speech segment relative to the noise of selected portions of the aural spectrum. A noise estimator estimates a maximum distribution of noise to an average of an acoustic noise power of some of the plurality of frequency bins. A voice detector compares the strength of a desired speech segment to a criterion based on an output of the background voice detector and an output of the noise estimator. |
申请公布号 |
US8311819(B2) |
申请公布日期 |
2012.11.13 |
申请号 |
US20080079376 |
申请日期 |
2008.03.26 |
申请人 |
HETHERINGTON PHILLIP A.;FALLAT MARK;QNX SOFTWARE SYSTEMS LIMITED |
发明人 |
HETHERINGTON PHILLIP A.;FALLAT MARK |
分类号 |
G10L15/20;G10L15/04 |
主分类号 |
G10L15/20 |
代理机构 |
|
代理人 |
|
主权项 |
|
地址 |
|