In an automatic speech recognition (ASR) processing system, ASR processing may be configured to process speech based on multiple channels of audio received from a beamformer. The ASR processing system may include a microphone array and the beamformer to output multiple channels of audio such that each channel isolates audio in a particular direction. The multichannel audio signals may include spoken utterances/speech from one or more speakers as well as undesired audio, such as noise from a household appliance. The ASR device may simultaneously perform speech recognition on the multi-channel audio to provide more accurate speech recognition results.
申请公布号
EP3050052(A4)
申请公布日期
2017.03.22
申请号
EP20140846906
申请日期
2014.09.17
申请人
Amazon Technologies, Inc.
发明人
BISANI, Michael Maximilian Emanuel;STROM, Nikko;HOFFMEISTER, Bjorn;THOMAS, Ryan Paul