发明名称 |
Automatic selection of blocking column for de-duplication |
摘要 |
A method of blocking column selection can include determining a first parameter for each column set of a plurality of column sets, wherein the first parameter indicates distribution of blocks in the column set, and determining a second parameter for each column set. The second parameter can indicate block size for the column set. For each column set, a measure of blockability that is dependent upon at least the first parameter and the second parameter can be calculated using a processor. The plurality of column sets can be ranked according to the measures of blockability.
|
申请公布号 |
US8560506(B2) |
申请公布日期 |
2013.10.15 |
申请号 |
US201213447726 |
申请日期 |
2012.04.16 |
申请人 |
CHATURVEDI SNIGDHA;FARUQUIE TANVEER A.;KARANAM HIMA P.;MENDELSSOHN MARVIN;MOHANIA MUKESH K.;SUBRAMANIAM L. VENKATA;INTERNATIONAL BUSINESS MACHINES CORPORATION |
发明人 |
CHATURVEDI SNIGDHA;FARUQUIE TANVEER A.;KARANAM HIMA P.;MENDELSSOHN MARVIN;MOHANIA MUKESH K.;SUBRAMANIAM L. VENKATA |
分类号 |
G06F7/00 |
主分类号 |
G06F7/00 |
代理机构 |
|
代理人 |
|
主权项 |
|
地址 |
|