发明名称 APPLICATION FOCUS IN SPEECH-BASED SYSTEMS
摘要 A speech-based system includes an audio device in a user premises and a network-based service that supports use of the audio device by multiple applications. The audio device may be directed to play audio content such as music, audio books, etc. The audio device may also be directed to interact with a user through speech. The network-based service monitors event messages received from the audio device to determine which of the multiple applications currently has speech focus. When receiving speech from a user, the service first offers the corresponding meaning to the application, if any, that currently has primary speech focus. If there is no application that currently has primary speech focus, or if the application having primary speech focus is not able to respond to the meaning, the service then offers the user meaning to the application that currently has secondary speech focus.
申请公布号 US2016180853(A1) 申请公布日期 2016.06.23
申请号 US201414578056 申请日期 2014.12.19
申请人 Amazon Technologies, Inc. 发明人 VanLund Peter Spalding;Piersol Kurt Wesley;Meyers James David;Simpson Jacob Michael;Gundeti Vikram Kumar;Thomas David Robert;Miles Andrew Christopher
分类号 G10L17/22 主分类号 G10L17/22
代理机构 代理人
主权项 1. A system, comprising: a command service configured to: communicate with multiple applications, communicate with an audio device, and send a command to the audio device to perform an activity for an audio application that provides audio content to be played by the audio device, wherein the command specifies an application identifier corresponding to the audio application; control logic configured to perform acts comprising: receiving an event message from the audio device regarding sound played by the audio device, wherein the event message specifies the application identifier corresponding to the audio application;if the event message indicates that the sound played by the audio device is part of a speech interaction with a user, designating the audio application as being primarily active;if the event message indicates that the sound played by the audio device is not part of a speech interaction with a user, designating the audio application as being secondarily active; a speech recognition service configured to receive an audio signal from the audio device and to recognize user speech in the audio signal; a language understanding service configured to determine a meaning of the user speech; the control logic being configured to perform further actions comprising: if there is a primarily active application among the multiple applications, requesting that the primarily active application respond to the user speech by (a) performing a first action that is indicated at least in part by the meaning of the user speech or (b) generating a first speech response to the user speech; andif there is no primarily active application among the multiple applications and if there is a secondarily active application among the multiple applications, requesting that the secondarily active application respond to the user speech by (a) performing a second action that is indicated at least in part by the meaning of the user speech or (b) generating a second speech response to the user speech.
地址 Seattle WA US