发明名称 SYSTEM AND METHOD FOR DATA DEDUPLICATION FOR DISK STORAGE SUBSYSTEMS
摘要 A method for data deduplication includes the following steps. First, segmenting an original data set into a plurality of data segments. Next, transforming the data in each data segment into a transformed data representation that has a band-type structure for each data segment. The band-type structure includes a plurality of bands. Next, selecting a first set of bands, grouping them together and storing them with the original data set. The first set of bands includes non-identical transformed data for each data segment. Next, selecting a second set of bands and grouping them together. The second set of bands includes identical transformed data for each data segment. Next, applying a hash function onto the transformed data of the second set of bands and thereby generating transformed data segments indexed by hash function indices. Finally, storing the hash function indices and the transformed data representation of one representative data segment in a deduplication database.
申请公布号 EP2594036(A4) 申请公布日期 2016.10.26
申请号 EP20110807550 申请日期 2011.07.15
申请人 EMC CORPORATION 发明人 BATES, JOHN, W.
分类号 H04L9/14;G06F3/06;G06F11/14;G06F17/14;G06F21/00;G06K9/46 主分类号 H04L9/14
代理机构 代理人
主权项
地址