发明名称 |
IDENTIFYING OUTLIERS IN A LARGE SET OF OBJECTS |
摘要 |
Described herein are various technologies pertaining to identifying global outlier candidates from a relatively large collection of computer-readable objects in a distributed computing environment. The collection of computer-readable objects is partitioned into a plurality of sets of objects, and local outlier candidates are identified from each set of objects in the plurality of sets of objects. The local outlier candidates are updated through a hierarchical pairwise similarity analysis until global outlier candidates are identified. Thereafter, a pairwise similarity analysis is undertaken with respect to the global outlier candidates and the sets of objects in the plurality of sets of objects to identify true global outliers. |
申请公布号 |
US2013346466(A1) |
申请公布日期 |
2013.12.26 |
申请号 |
US201213530140 |
申请日期 |
2012.06.22 |
申请人 |
ZHANG XIONG;YANG HUNG-CHIH;LANGE DANNY;MICROSOFT CORPORATION |
发明人 |
ZHANG XIONG;YANG HUNG-CHIH;LANGE DANNY |
分类号 |
G06F15/16 |
主分类号 |
G06F15/16 |
代理机构 |
|
代理人 |
|
主权项 |
|
地址 |
|