发明名称 Updating language understanding classifier models for a digital personal assistant based on crowd-sourcing
摘要 A method for updating language understanding classifier models includes receiving via one or more microphones of a computing device, a digital voice input from a user of the computing device. Natural language processing using the digital voice input is used to determine a user voice request. Upon determining the user voice request does not match at least one of a plurality of pre-defined voice commands in a schema definition of a digital personal assistant, a GUI of an end-user labeling tool is used to receive a user selection of at least one of the following: at least one intent of a plurality of available intents and/or at least one slot for the at least one intent. A labeled data set is generated by pairing the user voice request and the user selection, and is used to update a language understanding classifier.
申请公布号 US9508339(B2) 申请公布日期 2016.11.29
申请号 US201514611042 申请日期 2015.01.30
申请人 Microsoft Technology Licensing, LLC 发明人 Kannan Vishwac Sena;Uzelac Aleksandar;Hwang Daniel J.
分类号 G06F17/27;G10L15/06;G06F3/16;G10L15/00;G10L15/18;G10L15/22 主分类号 G06F17/27
代理机构 代理人 Choi Dan;Minhas Micky
主权项 1. A server computer, comprising: a processing unit; and memory coupled to the processing unit; the server computer configured to perform operations for updating language understanding classifier models, the operations comprising: receiving from at least one computing device of a plurality of computing devices communicatively coupled to the server computer, a first user selection of at least one of the following: at least one intent of a plurality of available intents and/or at least one slot for the at least one intent, wherein: the at least one intent is associated with at least one action used to perform at least one function of a category of functions for a domain;the at least one slot indicating a value used for performing the at least one action; andthe first user selection associated with a digital voice input received at the at least one computing device; andupon receiving from at least another computing device of the plurality of computing devices, a plurality of subsequent user selections that are identical to the first user selection and a plurality of subsequent digital voice inputs corresponding to the plurality of subsequent user selections, wherein the plurality of subsequent digital voice inputs are substantially similar to the digital voice input: generating a labeled data set by pairing the digital voice input with the first user selection;selecting a language understanding classifier from a plurality of available language understanding classifiers associated with one or more agent definitions, the selecting based at least on the at least one intent; andupdating the selected language understanding classifier based on the generated labeled data set.
地址 Redmond WA US