发明名称 Sampling based elimination of duplicate data
摘要 A technique for eliminating duplicate data is provided. Upon receipt of a new data set, one or more anchor points are identified within the data set. A bit-by-bit data comparison is then performed of the region surrounding the anchor point in the received data set with the region surrounding an anchor point stored within a pattern database to identify forward/backward delta values. The duplicate data identified by the anchor point, forward and backward delta values is then replaced in the received data set with a storage indicator.
申请公布号 US9344112(B2) 申请公布日期 2016.05.17
申请号 US201213443650 申请日期 2012.04.10
申请人 Zheng Ling;Stager Roger;Johnston Craig;Trimmer Don;Frandzel Yuval 发明人 Zheng Ling;Stager Roger;Johnston Craig;Trimmer Don;Frandzel Yuval
分类号 H04N7/18;H03M7/00;H04N19/20;H04N19/23;H04N19/25 主分类号 H04N7/18
代理机构 Gilliam IP PLLC 代理人 Gilliam IP PLLC
主权项 1. A method for removing duplicate data stored on a storage system, the method comprising: performing an operation on a first data set to identify an anchor within the first data set, wherein the anchor defines a starting point in a first region of the first data set for potential data de-duplication; determining a number of consecutive bits or bytes of data that match between the first data set and a second data set forwards and backwards from the identified anchor; and replacing the matching data in the first data set with an indication of the second data set, the anchor, and the number of matching bits or bytes forwards from the anchor and the number of matching bits or bytes backwards from the anchor.
地址 Sunnyvale CA US