发明名称 |
Method and system for detecting deviations in data tables |
摘要 |
A method and system for automatically detecting deviations in a data table comprising a multitude of records and a multitude of columns. A column of the data table is selected as a classification column and a classification tree is calculated with respect to the classification column. Each edge of the classification tree is associated with a predicate. The leaf nodes of the classification tree are associated with a leaf record set comprising the subset of records for which the class predicate comprising all predicates along a path from a root node of the classification tree to the leaf nodes evaluates to TRUE. Leaf nodes are associated with a leaf label representing an expected value in the classification column for the corresponding leaf record sets. From the leaf record sets all records deviating with respect to the corresponding classification column from the leaf label are determined as deviation sets.
|
申请公布号 |
US2002049740(A1) |
申请公布日期 |
2002.04.25 |
申请号 |
US20010930921 |
申请日期 |
2001.08.16 |
申请人 |
INTERNATIONAL BUSINESS MACHINES CORPORATION |
发明人 |
ARNING ANDREAS;BOLLINGER TONI;KEULER REINHOLD;SCHWENKREIS FRIEDEMANN HARALD |
分类号 |
G06F17/30;(IPC1-7):G06F7/00 |
主分类号 |
G06F17/30 |
代理机构 |
|
代理人 |
|
主权项 |
|
地址 |
|