发明名称 Global de-duplication in shared architectures
摘要 Redundant data is globally de-duplicated across a shared architecture that includes a plurality of storage systems. The storage systems implement copy-on-write or WAFL to generate snapshots of original data. Each storage system includes a de-duplication client to identify and reduce redundant original and/or snapshot data on the storage system. Each de-duplication client can de-duplicate a digital sequence by breaking the sequence into blocks and identifying redundant blocks already stored in the shared architecture. Identifying redundant blocks may include hashing each block and comparing the hash to a local and/or master hash table containing hashes of existing data. Once identified, redundant data previously stored is deleted (e.g., post-process de-duplication), or redundant data is not stored to begin with (e.g., inline de-duplication). In both cases, pointers to shared data blocks can be used to reassemble the digital sequence where one or more blocks were deleted or not stored on the storage system.
申请公布号 US8423726(B2) 申请公布日期 2013.04.16
申请号 US201213464017 申请日期 2012.05.04
申请人 YUEH JEDIDIAH;EMC CORPORATION 发明人 YUEH JEDIDIAH
分类号 G06F12/00;G06F13/00;G06F13/28 主分类号 G06F12/00
代理机构 代理人
主权项
地址