发明名称 Voice application architecture
摘要 A voice-based system may comprise a local speech interface device and a remote control service. A user may interact with the system using speech to obtain services and perform functions. The system may allow a user to install applications to provide enhanced or customized functionality. Such applications may be installed on either the speech interface device or the control service. The control service receives user speech and determines user intent based on the speech. If an application installed on the control service can respond to the intent, that application is called. Otherwise, the intent is provided to the speech interface device which responds by invoking one of its applications to respond to the intent.
申请公布号 US9548066(B2) 申请公布日期 2017.01.17
申请号 US201414456620 申请日期 2014.08.11
申请人 Amazon Technologies, Inc. 发明人 Jain Vikas;Mutagi Rohan;Carbon Peter Paul Henri
分类号 G06F17/27;G10L25/48;G10L15/22;G06F17/28;G10L15/30;G10L15/26;G10L15/18 主分类号 G06F17/27
代理机构 Lee & Hayes, PLLC 代理人 Lee & Hayes, PLLC
主权项 1. A system comprising: one or more server computers; one or more server applications that have been selected by a user for execution on the one or more server computers, wherein the one or more server applications operate in conjunction with a speech interface device located in premises of the user to provide services for the user; a speech processing component configured to receive, from the speech interface device, an audio signal that represents user speech, wherein the user speech expresses a user intent, the speech processing component being further configured to perform automatic speech recognition on the audio signal to identify the user speech and to perform natural language understanding on the user speech to determine the user intent; and an intent router configured to perform acts comprising: identifying a first server application of the one or more server applications corresponding to the user intent;providing a first indication to the first server application to invoke an action corresponding to the user intent;providing a second indication of the user intent to the speech interface device, wherein the speech interface device is responsive to the user intent to perform the action corresponding to the user intent;receiving, at the one or more server computers, a confirmation from the speech interface device that at least one of (i) the speech interface device will perform the action in response to the user intent or (ii) the speech interface device has performed the action in response to the user intent; andproviding a third indication, based at least in part on receiving the confirmation, to the first server application to cancel responding to the user intent.
地址 Seattle WA US