发明名称 System and method for preprocessing a data set to improve deduplication
摘要 The technique introduced here includes a system and method for preprocessing a data set to improve deduplication, and more specifically for reducing latency. The technique illustratively utilizes one or more preprocessing steps, including a skipping step and a folding step, which can be applied to a data set prior to deduplication to reduce the time consumed by deduplication. The folding step is applied to segments of the data set to reduce the length of the segments. The skipping step can be applied to the data set prior to the folding step to remove particular segments of the data set, to further improve deduplication performance in certain circumstances. The overall effect of the skipping and folding steps of this technique is to produce a data set of reduced total length for consideration in identifying duplicate data, which aids in reducing the time required for deduplication.
申请公布号 US8285957(B1) 申请公布日期 2012.10.09
申请号 US20100686297 申请日期 2010.01.12
申请人 NAG, YASA GIRIDHAR APPAJI;STAGER ROGER KEITH;NETAPP, INC. 发明人 NAG, YASA GIRIDHAR APPAJI;STAGER ROGER KEITH
分类号 G06F12/16;G06F7/04 主分类号 G06F12/16
代理机构 代理人
主权项
地址