发明名称 System for detecting speech with background voice estimates and noise estimates
摘要 A system detects a speech segment that may include unvoiced, fully voiced, or mixed voice content. The system includes a digital converter that converts a time-varying input signal into a digital-domain signal. A window function passes signals within a programmed aural frequency range while substantially blocking signals above and below the programmed aural frequency range when multiplied by an output of the digital converter. A frequency converter converts the signals passing within the programmed aural frequency range into a plurality of frequency bins. A background voice detector estimates the strength of a background speech segment relative to the noise of selected portions of the aural spectrum. A noise estimator estimates a maximum distribution of noise to an average of an acoustic noise power of some of the plurality of frequency bins. A voice detector compares the strength of a desired speech segment to a criterion based on an output of the background voice detector and an output of the noise estimator.
申请公布号 US8311819(B2) 申请公布日期 2012.11.13
申请号 US20080079376 申请日期 2008.03.26
申请人 HETHERINGTON PHILLIP A.;FALLAT MARK;QNX SOFTWARE SYSTEMS LIMITED 发明人 HETHERINGTON PHILLIP A.;FALLAT MARK
分类号 G10L15/20;G10L15/04 主分类号 G10L15/20
代理机构 代理人
主权项
地址