发明名称 System And Method For High Accuracy Product Classification With Limited Supervision
摘要 Systems and methods are disclosed herein for classifying records, such as product records, using a machine learning algorithm. After training a classification model according to a machine learning algorithm using an initial training set, records are classified and high confidence classifications identified. Remaining classifications are submitted to a crowdsourcing forum that validates or invalidates the classifications or marks them as to unclear to evaluate. Invalidated classifications are automatically analyzed to identify one or both of classification values and categories having a high proportion of invalidated classifications. Requests are transmitted to analysts to generate training data that is added to the training set. The process of classifying records and obtaining crowdsourced validation thereof may then repeat.
申请公布号 US2014297570(A1) 申请公布日期 2014.10.02
申请号 US201313852884 申请日期 2013.03.28
申请人 WAL-MART STORES, INC. 发明人 Garera Nikesh Lucky;Rampalli Narasimhan;Ravikant Dintyala Venkata Subrahmanya;Subramaniam Srikanth;Sun Chong;Yallin Heather Dawn
分类号 G06N99/00 主分类号 G06N99/00
代理机构 代理人
主权项 1. A method for classification, the method comprising: training, by a computer system, a classification model using a training data set; classifying, by the computer system, using the classification model, a record set to generate a classification outcome set, the classification outcome set including classifier-record pairings for records of the record set; submitting, by the computer system, the record set and the classification outcome set to a crowdsourcing computer network; receiving, by the computer system, a validated portion and a non-validated portion of the classification outcome set from the crowdsourcing computer network; transmitting, by the computer system, the non-validated portion to an analyst computer network; receiving, by the computer system, from the analyst computer network, additional training data relating to classifiers of the classifier-record pairings; adding, by the computer system, the additional training data to the training data set.
地址 Bentonville AR US