发明名称 Natural human-computer interaction for virtual personal assistant systems
摘要 Technologies for natural language interactions with virtual personal assistant systems include a computing device configured to capture audio input, distort the audio input to produce a number of distorted audio variations, and perform speech recognition on the audio input and the distorted audio variants. The computing device selects a result from a large number of potential speech recognition results based on contextual information. The computing device may measure a user's engagement level by using an eye tracking sensor to determine whether the user is visually focused on an avatar rendered by the virtual personal assistant. The avatar may be rendered in a disengaged state, a ready state, or an engaged state based on the user engagement level. The avatar may be rendered as semitransparent in the disengaged state, and the transparency may be reduced in the ready state or the engaged state. Other embodiments are described and claimed.
申请公布号 US9607612(B2) 申请公布日期 2017.03.28
申请号 US201314129435 申请日期 2013.05.20
申请人 Intel Corporation 发明人 Deleeuw William C.
分类号 G10L15/20;G10L15/02;G06K9/00;G06K9/20;G06F3/01;G06T13/80;G10L15/22;G10L21/003;G10L15/30 主分类号 G10L15/20
代理机构 Barnes & Thornburg LLP 代理人 Barnes & Thornburg LLP
主权项 1. A computing device for speech recognition, the computing device comprising: a processor; an audio sensor; an audio input module to: capture audio input using the audio sensor; and distort, by the processor, a waveform of the audio input to produce a plurality of distorted audio variations, wherein to distort the waveform comprises to adjust a temporal duration of the waveform; and a speech recognition module to: perform speech recognition on the audio input and each of the distorted audio variations to produce a plurality of speech recognition results; and select, by the processor, a result from the speech recognition results based on contextual information.
地址 Santa Clara CA US