发明名称 Method and system for detecting deviations in data tables
摘要 A method and system for automatically detecting deviations in a data table comprising a multitude of records and a multitude of columns. A column of the data table is selected as a classification column and a classification tree is calculated with respect to the classification column. Each edge of the classification tree is associated with a predicate. The leaf nodes of the classification tree are associated with a leaf record set comprising the subset of records for which the class predicate comprising all predicates along a path from a root node of the classification tree to the leaf nodes evaluates to TRUE. Leaf nodes are associated with a leaf label representing an expected value in the classification column for the corresponding leaf record sets. From the leaf record sets all records deviating with respect to the corresponding classification column from the leaf label are determined as deviation sets.
申请公布号 US2002049740(A1) 申请公布日期 2002.04.25
申请号 US20010930921 申请日期 2001.08.16
申请人 INTERNATIONAL BUSINESS MACHINES CORPORATION 发明人 ARNING ANDREAS;BOLLINGER TONI;KEULER REINHOLD;SCHWENKREIS FRIEDEMANN HARALD
分类号 G06F17/30;(IPC1-7):G06F7/00 主分类号 G06F17/30
代理机构 代理人
主权项
地址