发明名称 |
LOW LATENCY AND MEMORY EFFICIENT KEYWORK SPOTTING |
摘要 |
Features are disclosed for spotting keywords in utterance audio data without requiring the entire utterance to first be processed. Likelihoods that a portion of the utterance audio data corresponds to the keyword may be compared to likelihoods that the portion corresponds to background audio (e.g., general speech and/or non-speech sounds). The difference in the likelihoods may be determined, and keyword may be triggered when the difference exceeds a threshold, or shortly thereafter. Traceback information and other data may be stored during the process so that a second speech processing pass may be performed. For efficient management of system memory, traceback information may only be stored for those frames that may encompass a keyword; the traceback information for older frames may be overwritten by traceback information for newer frames. |
申请公布号 |
US2017098442(A1) |
申请公布日期 |
2017.04.06 |
申请号 |
US201615207183 |
申请日期 |
2016.07.11 |
申请人 |
Amazon Technologies, Inc. |
发明人 |
Hoffmeister Bjorn |
分类号 |
G10L15/02;G10L15/22;G10L15/14 |
主分类号 |
G10L15/02 |
代理机构 |
|
代理人 |
|
主权项 |
1. A system comprising:
a computer-readable memory storing executable instructions; and one or more processors in communication with the computer-readable memory, wherein the one or more processors are programmed by the executable instructions to at least:
obtain a sequence of feature vectors, wherein the sequence of feature vectors represents at least a portion of a stream of audio data;generate a keyword score based at least partly on a likelihood that a particular feature vector of the sequence of feature vectors represents audio data corresponding to a keyword;generate a background score based at least partly on a likelihood that the particular feature vector represents audio data corresponding to background audio;determine that a difference between the keyword score and the background score is greater than differences associated with feature vectors preceding the particular feature vector in a subset of the sequence of feature vectors, wherein the particular feature vector is in a center of the subset;determine that the difference is greater than differences associated with feature vectors subsequent to the particular feature vector in the subset; andgenerate data indicating the particular feature vector corresponds to an end of the keyword. |
地址 |
Seattle WA US |