发明名称 Architecture for multi-domain utterance processing
摘要 Features are disclosed for processing a user utterance with respect to multiple subject matters or domains, and for selecting a likely result from a particular domain with which to respond to the utterance or otherwise take action. A user utterance may be transcribed by an automatic speech recognition (“ASR”) module, and the results may be provided to a multi-domain natural language understanding (“NLU”) engine. The multi-domain NLU engine may process the transcription(s) in multiple individual domains rather than in a single domain. In some cases, the transcription(s) may be processed in multiple individual domains in parallel or substantially simultaneously. In addition, hints may be generated based on previous user interactions and other data. The ASR module, multi-domain NLU engine, and other components of a spoken language processing system may use the hints to more efficiently process input or more accurately generate output.
申请公布号 US9070366(B1) 申请公布日期 2015.06.30
申请号 US201213720909 申请日期 2012.12.19
申请人 Amazon Technologies, Inc. 发明人 Mathias Lambert;Shi Ying;Kiss Imre Attila;Thomas Ryan Paul;Deramat Frederic Johan Georges
分类号 G10L15/00;G10L15/18;G10L15/04 主分类号 G10L15/00
代理机构 Knobbe, Martens, Olson & Bear, LLP 代理人 Knobbe, Martens, Olson & Bear, LLP
主权项 1. A system comprising: a computer-readable memory storing executable instructions; and one or more processors in communication with the computer-readable memory, wherein the one or more processors are programmed by the executable instructions to at least: receive data regarding an utterance of a user;generate a transcription of the utterance using automatic speech recognition;process the transcription with a first natural language understanding (“NLU”) module to produce a first plurality of interpretations of a requested action in the transcription, wherein the first NLU module is associated with a first domain of actions, and wherein at least a first interpretation of the first plurality of interpretations is associated with a first score indicative of whether the first interpretation corresponds to the requested action in the transcription;process the transcription with a second NLU module to produce a second plurality of interpretations of the requested action in the transcription, wherein the second NLU module is associated with a second domain of actions, and wherein at least a second interpretation of the second plurality of interpretations is associated with a second score indicative of whether the second interpretation corresponds to the requested action in the transcription;select, from the first plurality of interpretations or the second plurality of interpretations, a selected interpretation based at least in part on a score associated with the selected interpretation, wherein the score corresponds to one of the first score or the second score; andgenerate a response based at least partly on the selected interpretation.
地址 Seattle WA US