发明名称 |
System for detecting speech with background voice estimates and noise estimates |
摘要 |
A system detects a speech segment that may include unvoiced, fully voiced, or mixed voice content. The system includes a window function that passes signals within a programmed aural frequency range while substantially blocking signals above and below the programmed aural frequency range. A frequency converter converts the signals passing within the programmed aural frequency range into a plurality of frequency bins. A background voice detector estimates the strength of a background speech segment relative to the noise of selected portions of the aural spectrum. A noise estimator estimates a maximum distribution of noise to an average of an acoustic noise power of some of the plurality of frequency bins. A voice detector compares the strength of a desired speech segment to a maximum of an output of the background voice detector and an output of the noise estimator.
|
申请公布号 |
US8457961(B2) |
申请公布日期 |
2013.06.04 |
申请号 |
US201213566603 |
申请日期 |
2012.08.03 |
申请人 |
HETHERINGTON PHILLIP ALAN;FALLAT MARK RYAN;QNX SOFTWARE SYSTEMS LIMITED |
发明人 |
HETHERINGTON PHILLIP ALAN;FALLAT MARK RYAN |
分类号 |
G10L15/20;G10L15/04 |
主分类号 |
G10L15/20 |
代理机构 |
|
代理人 |
|
主权项 |
|
地址 |
|