发明名称 Data deduplication with adaptive erasure code redundancy
摘要 Example apparatus and methods combine erasure coding with data deduplication to simultaneously reduce the overall redundancy in data while increasing the redundancy of unique data. In one embodiment, an efficient representation of a data set is produced by deduplication. The efficient representation reduces duplicate data in the data set. Redundancy is then added back into the data set using erasure coding. The redundancy that is added back in adds protection to the unique data associated with the efficient representation. How much redundancy is added back in and what type of redundancy is added back in may be controlled based on an attribute (e.g., value, reference count, symbol size, number of symbols) of the unique data. Decisions concerning how much and what type of redundancy to add back in may be adapted over time based, for example, on observations of the efficiency of the overall system.
申请公布号 US9503127(B2) 申请公布日期 2016.11.22
申请号 US201414326774 申请日期 2014.07.09
申请人 Quantum Corporation 发明人 Wideman Roderick B;Arslan Suayb Sefik;Lee Jaewook;Goker Turguy
分类号 H03M13/00;H03M13/37;G06F11/14 主分类号 H03M13/00
代理机构 Eschweiler & Associates, LLC 代理人 Eschweiler & Associates, LLC
主权项 1. A non-transitory computer-readable storage medium storing computer-executable instructions that when executed by a computer cause the computer to perform a method, the method comprising: accessing a message produced by a data deduplication system; identifying a property of the message, and generating W erasure code symbols for the message, where the erasure code symbols are generated according to an X/Y erasure code policy, W, X and Y being integers, W being greater than or equal to X, W being less than or equal to Y, and where W, X or Y depend, at least in part, on a property of the message.
地址 San Jose CA US