摘要 |
This invention concerns data mining, that is the extraction of information, from "unlearnable" data sets. In particular it concerns apparatus and a method for this purpose. The invention involves creating a finite training sample from the data set ( 14 ). Then training ( 50 ) a learning device ( 32 ) using a supervised learning algorithm to predict labels for each item of the training sample. Then processing other data from the data set with the trained learning device to predict labels and determining whether the predicted labels are better (learnable) or worse (anti-learnable) than random guessing ( 52 ). And, using a reverser ( 34 ) to apply negative weighting to the predicted labels if it is worse (anti-learnable) ( 54 ).
|