发明名称 SCALABLE DEDUPLICATION OF STORED DATA
摘要 In a method and apparatus for scalable deduplication, a data set is partitioned into multiple logical partitions, where each partition can be deduplicated independently. Each data block of the data set is assigned to exactly one partition, so that any two or more data blocks that are duplicates of each are always be assigned to the same logical partition. A hash algorithm generates a fingerprint of each data block in the volume, and the fingerprints are subsequently used to detect possible duplicate data blocks as part of deduplication. In addition, the fingerprints are used to ensure that duplicate data blocks are sent to the same logical partition, prior to deduplication. A portion of the fingerprint of each data block is used as a partition identifier to determine the partition to which the data block should be assigned. Once blocks are assigned to partitions, deduplication can be done on partitions independently.
申请公布号 WO2010019596(A3) 申请公布日期 2010.05.06
申请号 WO2009US53433 申请日期 2009.08.11
申请人 NETAPP, INC.;MONDAL, SHISHIR;KILLAMSHETTI, PRAVEEN 发明人 MONDAL, SHISHIR;KILLAMSHETTI, PRAVEEN
分类号 G06F15/16;G06F11/16;G06F12/16 主分类号 G06F15/16
代理机构 代理人
主权项
地址