发明名称 OBJECT-LEVEL IDENTIFICATION OF DUPLICATE DATA IN A STORAGE SYSTEM
摘要 The technique introduced here includes a system and method for identification of duplicate data directly at a data-object level. The technique illustratively utilizes a hierarchical tree of fingerprints for each data object to compare data objects and identify duplicate data blocks referenced by the data objects. The hierarchical fingerprint trees are constructed in such a manner that a top-level fingerprint (or object-level fingerprint) of the hierarchical tree is representative of all data blocks referenced by a storage system. In embodiments, inline techniques are utilized to generate hierarchical fingerprints for new data objects as they are created, and an object-level fingerprint of the new data object is compared against preexisting object-level fingerprints in the storage system to identify exact or close matches. While exact matches result in complete deduplication of data blocks referenced by the data object, hierarchical comparison methods are used for identifying and mapping duplicate data blocks referenced by closely related data objects.
申请公布号 EP2721495(A4) 申请公布日期 2015.08.26
申请号 EP20120800807 申请日期 2012.06.07
申请人 NETAPP, INC. 发明人 YASA, GIRIDHAR APPAJI NAG;CHANDRASEKARASASTRY, NAGESH PANYAM
分类号 G06F17/30;G06F11/00;G06F12/00;G06F15/16 主分类号 G06F17/30
代理机构 代理人
主权项
地址