SYSTEM AND METHOD FOR PROCESSING MULTI-MODAL DEVICE INTERACTIONS IN A NATURAL LANGUAGE VOICE SERVICES ENVIRONMENT,申请号US201414278645-传众专利搜索

发明名称	SYSTEM AND METHOD FOR PROCESSING MULTI-MODAL DEVICE INTERACTIONS IN A NATURAL LANGUAGE VOICE SERVICES ENVIRONMENT
摘要	A system and method for processing multi-modal device interactions in a natural language voice services environment may be provided. In particular, one or more multi-modal device interactions may be received in a natural language voice services environment that includes one or more electronic devices. The multi-modal device interactions may include a non-voice interaction with at least one of the electronic devices or an application associated therewith, and may further include a natural language utterance relating to the non-voice interaction. Context relating to the non-voice interaction and the natural language utterance may be extracted and combined to determine an intent of the multi-modal device interaction, and a request may then be routed to one or more of the electronic devices based on the determined intent of the multi-modal device interaction.
申请公布号	US2014249822(A1)	申请公布日期	2014.09.04
申请号	US201414278645	申请日期	2014.05.15
申请人	VOICEBOX TECHNOLOGIES CORPORATION	发明人	BALDWIN LARRY;WEIDER CHRIS
分类号	G10L15/18;G10L15/22	主分类号	G10L15/18
代理机构		代理人
主权项	1. A method for processing one or more multi-modal device interactions in a natural language voice services environment that includes one or more electronic devices, comprising: detecting at least one multi-modal device interaction, wherein the multi-modal device interaction includes a non-voice interaction with at least one of the electronic devices or an application associated with at least one of the electronic devices, and wherein the multi-modal device interaction further includes at least one natural language utterance relating to the non-voice interaction; extracting context information relating to the multi-modal device interaction, wherein the extracted context information includes context relating to the non-voice interaction, and wherein the extracted context information further include context relating to the natural language utterance; combining the context relating to the non-voice interaction and the context relating to the natural language utterance; determining an intent of the multi-modal device interaction based on the combined context relating to the non-voice interaction and the natural language utterance; and routing at least one request to one or more of the electronic devices based on the determined intent of the multi-modal device interaction.
地址	BELLEVUE WA US