发明名称 HOTWORD DETECTION ON MULTIPLE DEVICES
摘要 Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for hotword detection on multiple devices are disclosed. In one aspect, a method includes the actions of receiving, by a first computing device, audio data that corresponds to an utterance. The actions further include determining a first value corresponding to a likelihood that the utterance includes a hotword. The actions further include receiving a second value corresponding to a likelihood that the utterance includes the hotword, the second value being determined by a second computing device. The actions further include comparing the first value and the second value. The actions further include based on comparing the first value to the second value, initiating speech recognition processing on the audio data.
申请公布号 US2016104480(A1) 申请公布日期 2016.04.14
申请号 US201514675932 申请日期 2015.04.01
申请人 Google Inc. 发明人 Sharifi Matthew
分类号 G10L15/08;G10L17/22 主分类号 G10L15/08
代理机构 代理人
主权项 1. A computer-implemented method comprising: receiving, by a first computing device, audio data that corresponds to an utterance; before beginning automated speech recognition processing on the audio data, processing the audio data using a classifier that classifies audio data as including a particular hotword or as not including the particular hotword; determining, based on the processing of the audio data using the classifier that classifies audio data as including a particular hotword or as not including the particular hotword, a first value that reflects a first likelihood that the utterance includes the particular hotword; receiving a second value that reflects a second likelihood that the utterance includes the particular hotword, as determined by a second computing device; comparing the first value that reflects the first likelihood that the utterance includes the particular hotword and the second value that reflects the second likelihood that the utterance includes the particular hotword; and based on comparing the first value that reflects the first likelihood that the utterance includes the particular hotword to the second value that reflects the second likelihood that the utterance includes the particular hotword, determining whether to begin performing automated speech recognition processing on the audio data.
地址 Mountain View CA US