发明名称 ARBITRATION BETWEEN VOICE-ENABLED DEVICES
摘要 Architectures and techniques for selecting a voice-enabled device to handle audio input that is detected by multiple voice-enabled devices are described herein. In some instances, multiple voice-enabled devices may detect audio input from a user at substantially the same time, due to the voice-enabled devices being located within proximity to the user. The architectures and techniques may analyze a variety of audio signal metric values for the voice-enabled devices to designate a voice-enabled device to handle the audio input.
申请公布号 US2017076720(A1) 申请公布日期 2017.03.16
申请号 US201514852022 申请日期 2015.09.11
申请人 Amazon Technologies, Inc. 发明人 Gopalan Ramya;Sundaram Shiva Kumar
分类号 G10L15/22;G06F3/16 主分类号 G10L15/22
代理机构 代理人
主权项 1. A method comprising: determining, by a computing device, that a first voice-enabled device and a second voice-enabled device received audio input at substantially a same time; receiving, by the computing device and from the first voice-enabled device, a first audio signal metric value indicating a signal-to-noise ratio associated with a first beamformed audio signal, the first beamformed audio signal having been determined, at the first voice-enabled device, for the audio input received at the first voice-enabled device, the first beamformed audio signal being determined for a direction relative to the first voice-enabled device; receiving, by the computing device and from the second voice-enabled device, a second audio signal metric value indicating a signal-to-noise ratio associated with a second beamformed audio signal, the second beamformed audio signal having been determined, at the second voice-enabled device, for the audio input received at the second voice-enabled device, the second beamformed audio signal being determined for a direction relative to the second voice-enabled device; determining, by the computing device, that the signal-to-noise ratio associated with the first beamformed audio signal is greater than the signal-to-noise ratio associated with the second beamformed audio signal; processing, by the computing device, the first beamformed audio signal using one or more speech recognition techniques; performing, by the computing device, a task associated with the audio input; and sending, by the computing device, an instruction to the first voice-enabled device, the instruction requesting that the first voice-enabled device output an indication that the task has been completed.
地址 Seattle WA US