发明名称 Data Mining Unlearnable Data Sets
摘要 This invention concerns data mining, that is the extraction of information, from "unlearnable" data sets. In particular it concerns apparatus and a method for this purpose. The invention involves creating a finite training sample from the data set ( 14 ). Then training ( 50 ) a learning device ( 32 ) using a supervised learning algorithm to predict labels for each item of the training sample. Then processing other data from the data set with the trained learning device to predict labels and determining whether the predicted labels are better (learnable) or worse (anti-learnable) than random guessing ( 52 ). And, using a reverser ( 34 ) to apply negative weighting to the predicted labels if it is worse (anti-learnable) ( 54 ).
申请公布号 US2008027886(A1) 申请公布日期 2008.01.31
申请号 US20050572193 申请日期 2005.07.18
申请人 发明人 KOWALCZYK ADAM;SMOLA ALEX;ONG CHENG S.;CHAPELLE OLIVIER
分类号 G06G7/00 主分类号 G06G7/00
代理机构 代理人
主权项
地址