发明名称 Method for Discovering Undeclared and Fuzzy Rules in Databases
摘要 A scheme is used to automatically discover algebraic constraints between pairs of columns in relational data. The constraints may be "fuzzy" in that they hold for most, but not all, of the records, and the columns may be in the same table or different tables. The scheme first identifies candidate sets of column value pairs that are likely to satisfy an algebraic constraint. For each candidate, the scheme constructs algebraic constraints by applying statistical histogramming, segmentation, or clustering techniques to samples of column values. In query-optimization mode, the scheme automatically partitions the data into normal and exception records. During subsequent query processing, queries can be modified to incorporate the constraints; the optimizer uses the constraints to identify new, more efficient access paths. The results are then combined with the results of executing the original query against the (small) set of exception records.
申请公布号 US2008027907(A1) 申请公布日期 2008.01.31
申请号 US20070842828 申请日期 2007.08.21
申请人 INTERNATIONAL BUSINESS MACHINES CORPORATION 发明人 BROWN PAUL G.;HAAS PETER J.
分类号 G06F7/00;G06F17/30;G06N5/04 主分类号 G06F7/00
代理机构 代理人
主权项
地址