发明名称 PARALLEL OUTLIER DETECTION
摘要 A method, system and computer program product for detecting outliers in a set of data points. In one embodiment, the method comprises partitioning the set of data points into a plurality of bins with each of the data points assigned to a respective one of the bins. A plurality of local lists are formed in parallel identifying points in the bins as outliers, and the local lists are merged into a global list to identify one or more of the points as outliers of the data set. Embodiments of the invention provide an outlier detection system that can parallelize in two levels. The dataset is split into partitions, called bins, and outliers are found in each bin in parallel. The execution of a single bin is also parallelized. Embodiments of the invention can scale to very large datasets by these two modes of parallelism.
申请公布号 US2013024159(A1) 申请公布日期 2013.01.24
申请号 US201113186777 申请日期 2011.07.20
申请人 INTERNATIONAL BUSINESS MACHINES CORPORATION;GHOTING AMOL;TURE FERHAN 发明人 GHOTING AMOL;TURE FERHAN
分类号 G06F19/00 主分类号 G06F19/00
代理机构 代理人
主权项
地址