发明名称 System, device and method for processing interlaced multimodal user input
摘要 A device, method and system are provided for interpreting and executing operations based on multimodal input received at a computing device. The multimodal input can include one or more verbal and non-verbal inputs, such as a combination of speech and gesture inputs received substantially concurrently via suitable user interface means provided on the computing device. One or more target objects is identified from the non-verbal input, and text is recognized from the verbal input. An interaction object is generated using the recognized text and identified target objects, and thus comprises a natural language expression with embedded target objects. The interaction object is then processed to identify one or more operations to be executed.
申请公布号 US9601113(B2) 申请公布日期 2017.03.21
申请号 US201314241399 申请日期 2013.05.15
申请人 XTREME INTERACTIONS INC. 发明人 Anandarajah Joe
分类号 G10L15/22;G10L15/18;G06F3/16;G06F3/038;G06F3/0481;G06F3/0484;G06F3/0488;G10L17/22;G10L15/19 主分类号 G10L15/22
代理机构 Westerman, Hattori, Daniels & Adrian, LLP 代理人 Westerman, Hattori, Daniels & Adrian, LLP
主权项 1. A method implemented at a computing device, the method comprising: receiving verbal input using a verbal input interface of the computing device; receiving, concurrently with at least part of the verbal input, at least one secondary input using a non-verbal input interface of the computing device, the non-verbal input interface being selected from the group of: a kinetic input interface, an inertial input interface, a perceptual input interface, a touch input interface, a graphical user interface, and a sensor input interface; identifying one or more target objects from the at least one secondary input; recognizing text from the received verbal input; generating an interaction object, the interaction object comprising a natural language expression having references to the one or more identified target objects identified from the at least one secondary input, the references being embedded within the recognized text, the generating of the interaction object comprising identifying at least one attribute associated with each of the one or more identified target objects or at least one operation associated with each of the one or more identified target objects; processing the interaction object to identify at least one operation to be executed on at least one of the one or more identified target objects; and executing the operation on the at least one of the one or more identified target objects.
地址 Toronto CA