发明名称 Techniques for evaluation, building and/or retraining of a classification model
摘要 Techniques for evaluation and/or retraining of a classification model built using labeled training data. In some aspects, a classification model having a first set of weights is retrained by using unlabeled input to reweight the labeled training data to have a second set of weights, and by retraining the classification model using the labeled training data weighted according to the second set of weights. In some aspects, a classification model is evaluated by building a similarity model that represents similarities between unlabeled input and the labeled training data and using the similarity model to evaluate the labeled training data to identify a subset of the plurality of items of labeled training data that is more similar to the unlabeled input than a remainder of the labeled training data.
申请公布号 US9031897(B2) 申请公布日期 2015.05.12
申请号 US201213429041 申请日期 2012.03.23
申请人 Nuance Communications, Inc. 发明人 Marcheret Etienne
分类号 G06N99/00;G06N7/00 主分类号 G06N99/00
代理机构 Wolf, Greenfield & Sacks, P.C. 代理人 Wolf, Greenfield & Sacks, P.C.
主权项 1. A method for use with a first classification model that classifies an input into one of a plurality of classes, wherein the first classification model was built using labeled training data, wherein the labeled training data comprises a plurality of items of labeled training data, wherein each of the plurality of items of labeled training data is labeled with one of the plurality of classes, the method comprising acts of: obtaining unlabeled input for the first classification model; building a similarity model that represents similarities between the unlabeled input and the labeled training data; and using a programmed processor and the similarity model to evaluate the labeled training data to identify a subset of the plurality of items of labeled training data that is more similar to the unlabeled input than a remainder of the labeled training data.
地址 Burlington MA US