发明名称 System for detecting speech with background voice estimates and noise estimates
摘要 A system detects a speech segment that may include unvoiced, fully voiced, or mixed voice content. The system includes a window function that passes signals within a programmed aural frequency range while substantially blocking signals above and below the programmed aural frequency range. A frequency converter converts the signals passing within the programmed aural frequency range into a plurality of frequency bins. A background voice detector estimates the strength of a background speech segment relative to the noise of selected portions of the aural spectrum. A noise estimator estimates a maximum distribution of noise to an average of an acoustic noise power of some of the plurality of frequency bins. A voice detector compares the strength of a desired speech segment to a maximum of an output of the background voice detector and an output of the noise estimator.
申请公布号 US8457961(B2) 申请公布日期 2013.06.04
申请号 US201213566603 申请日期 2012.08.03
申请人 HETHERINGTON PHILLIP ALAN;FALLAT MARK RYAN;QNX SOFTWARE SYSTEMS LIMITED 发明人 HETHERINGTON PHILLIP ALAN;FALLAT MARK RYAN
分类号 G10L15/20;G10L15/04 主分类号 G10L15/20
代理机构 代理人
主权项
地址