发明名称 Sampling based data de-duplication
摘要 Example apparatus, methods, and computers perform sampling based data de-duplication. One example method controls a data de-duplication computer to compute a sampling sequence for a sub-block of data and to use the sampling sequence to locate a stored sub-block known to the data de-duplication computer. Upon finding a stored sub-block to compare to, the method includes controlling the data de-duplication computer to determine a degree of similarity (e.g., duplicate, very similar, somewhat similar, very dissimilar, completely dissimilar, x % similar) between the sub-block and the stored sub-block and to control whether and how the sub-block is stored and/or transmitted based on the degree of similarity. The degree of similarity can also control whether and how the data de-duplication computer updates a dedupe data structure(s) that stores information for finding groups of similarity sampling sequence related sub-blocks.
申请公布号 US8442956(B2) 申请公布日期 2013.05.14
申请号 US201213351192 申请日期 2012.01.16
申请人 TOFANO JEFFREY VINCENT;WELLS FARGO CAPITAL FINANCE, LLC 发明人 TOFANO JEFFREY VINCENT
分类号 G06F17/30 主分类号 G06F17/30
代理机构 代理人
主权项
地址