发明名称 Database query optimization using clustering data mining
摘要 A method and system for optimizing a database query. A database table populated with data is received and scanned. Statistics and single column histograms associated with single columns of the table are determined. Cardinality based on the statistics and histograms is estimated. All possible correlations among multiple columns are determined by performing clustering data mining that partitions data in the table into clusters. Top ranked columns based on the correlations are determined. The difference between the estimated cardinality and a support count of a cluster is determined to exceed a threshold, and in response, multiple column histograms based on the top ranked columns are determined. An optimal query plan based on the multiple column histograms is generated.
申请公布号 US8229917(B1) 申请公布日期 2012.07.24
申请号 US201113033803 申请日期 2011.02.24
申请人 ANEAS FABIO F. M.;KATAHIRA REINALDO T.;MARIANO ALESSANDRO B. A.;DA ROCHA PEDRO H. V.;INTERNATIONAL BUSINESS MACHINES CORPORATION 发明人 ANEAS FABIO F. M.;KATAHIRA REINALDO T.;MARIANO ALESSANDRO B. A.;DA ROCHA PEDRO H. V.
分类号 G06F7/00 主分类号 G06F7/00
代理机构 代理人
主权项
地址