发明名称 Energy post qualification for phrase spotting
摘要 In one embodiment, a computing device can detect an utterance of a target phrase within an acoustic input signal. The computing device can further determine a first estimate of cumulative signal and noise energy for the detected utterance in the acoustic input signal with respect to a first time period spanning the duration of the detected utterance, and a second estimate of noise energy in the acoustic input signal with respect to a second time period preceding (or following) the first time period. The computing device can then calculate a signal-to-noise ratio (SNR) for the detected utterance based on the first and second estimates and can reject the detected utterance if the SNR is below an SNR threshold.
申请公布号 US9548065(B2) 申请公布日期 2017.01.17
申请号 US201414269678 申请日期 2014.05.05
申请人 Sensory, Incorporated 发明人 Vermeulen Pieter J.;Hosom John-Paul
分类号 G10L15/00;G10L25/15;G10L19/00;G10L15/08;G10L15/22;G10L25/87;G10L25/21 主分类号 G10L15/00
代理机构 Fountainhead Law Group P.C. 代理人 Fountainhead Law Group P.C.
主权项 1. A method comprising: detecting, by a phrase spotter running on a computing device, an utterance of a target phrase within an acoustic input signal, the detecting comprising applying one or more phrase spotting algorithms to the acoustic input signal; determining, by the phrase spotter, a first estimate of cumulative signal and noise energy for the detected utterance in the acoustic input signal, the first estimate being determined with respect to a first time period spanning a start time and an end time of the detected utterance; determining, by the phrase spotter, a second estimate of noise energy in the acoustic input signal, the second estimate being determined with respect to a second time period that precedes or follows the first time period; calculating, by the phrase spotter, a signal-to-noise ratio (SNR) for the detected utterance based on the first estimate and the second estimate; if the SNR is below an SNR threshold, rejecting, by the phrase spotter, the detected utterance as being an incorrect spot of the target phrase; and if the SNR is not below the SNR threshold: accepting, by the phrase spotter, the detected utterance as being a correct spot of the target phrase; andcausing the phrase spotter or another speech recognizer to identify and process a verbal command spoken after the detected utterance.
地址 Santa Clara CA US