发明名称 Multitier deduplication systems and methods
摘要 Multitier deduplication can reduce the amount of bandwidth and storage resources used during deduplication. In certain embodiments, the system can determine if a data block is stored in a first archive data storage. If so, the system can skip the data block. If not, the system can determine if the data block is stored or identified in a second archive data storage. In various implementations, the first archive data storage can be local to the system and the second archive data storage can be a global archive that may be remote from the system. The system can create a map of a plurality of backups stored at the first archive enabling the system to quickly check multiple archives. The multitier data deduplication can filter out inactive data blocks during or before performing the deduplication process.
申请公布号 US8898114(B1) 申请公布日期 2014.11.25
申请号 US201113218362 申请日期 2011.08.25
申请人 Dell Software Inc. 发明人 Feathergill David Allen;Mattox Jason
分类号 G06F7/00;G06F17/00;G06F17/30 主分类号 G06F7/00
代理机构 Winstead PC 代理人 Winstead PC
主权项 1. A system for deduplicating a backup archive, the system comprising: a computer system comprising computer hardware, the computer system programmed to implement: a deduplication module configured to: access one or more block directories associated with one or more backup archives at an archive data store, wherein the one or more block directories include fingerprints of data blocks associated with the one or more backup archives, and wherein at least one of the one or more backup archives is associated with a data store;create a composite block map based at least in part on the one or more block directories, wherein the composite block map includes fingerprints of each data block stored at the archive data store; andaccess one or more data blocks from the data store and for each of the one or more data blocks, the deduplication module is further configured to: create a fingerprint for the data block;determine whether the fingerprint exists in the composite block map of the archive data store;in response to determining that the fingerprint does not exist in the composite block map, determine whether the fingerprint exists in a global deduplication data store, wherein the global deduplication data store is separate from the archive data store and the composite block map; andin response to determining that the fingerprint does not exist in the global deduplication data store, identify the data block for backup storage; anda backup module configured to: backup each of the data blocks identified for backup storage as a target archive at the archive data store; andstore the fingerprint associated with each of the data blocks identified for backup storage at a target block directory associated with the target archive.
地址 Aliso Viejo CA US