发明名称 Resource-light method and apparatus for outlier detection
摘要 Outlier detection methods and apparatus have light computational resources requirement, especially on the storage requirement, and yet achieve a state-of-the-art predictive performance. The outlier detection problem is first reduced to that of a classification learning problem, and then selective sampling based on uncertainty of prediction is applied to further reduce the amount of data required for data analysis, resulting in enhanced predictive performance. The reduction to classification essentially consists in using the unlabeled normal data as positive examples, and randomly generated synthesized examples as negative examples. Application of selective sampling makes use of an underlying, arbitrary classification learning algorithm, the data labeled by the above procedure, and proceeds iteratively. Each iteration consisting of selection of a smaller sub-sample from the input data, training of the underlying classification algorithm with the selected data, and storing the classifier output by the classification algorithm. The selection is done by essentially choosing examples that are harder to classify with the classifiers obtained in the preceding iterations. The final output hypothesis is a voting function of the classifiers obtained in the iterations of the above procedure.
申请公布号 US8006157(B2) 申请公布日期 2011.08.23
申请号 US20070863704 申请日期 2007.09.28
申请人 INTERNATIONAL BUSINESS MACHINES CORPORATION 发明人 ABE NAOKI;LANGFORD JOHN
分类号 H04L1/00;G06F11/00;G06F11/30;G08C25/00;H03M13/00 主分类号 H04L1/00
代理机构 代理人
主权项
地址