发明名称 |
Synchronization of storage using log files and snapshots |
摘要 |
A system for processing data comprises a deduplicating system, an interface, and a processor. The deduplicating system stores a copy of data stored in a data storage system by storing a set of segments that is able to reconstruct the data stored in the data storage system. The data storage system has a stored log file. The stored log file stores a data change and an associated time for the data change. The interface receives an indication to revert data stored in the data storage system to a state at a snapshot time. The processor is configured to determine a first subset of the data stored in the data storage system that has changed since a prior snapshot using the stored log file and to determine a second subset of the data stored in the data storage system that has changed between the prior snapshot and the snapshot time using a first list of fingerprints associated with the prior snapshot and a second list of fingerprints associated with the snapshot time. |
申请公布号 |
US8782003(B1) |
申请公布日期 |
2014.07.15 |
申请号 |
US201113107791 |
申请日期 |
2011.05.13 |
申请人 |
EMC Corporation |
发明人 |
Patterson R. Hugo |
分类号 |
G06F7/00;G06F17/30 |
主分类号 |
G06F7/00 |
代理机构 |
Van Pelt, Yi & James LLP |
代理人 |
Van Pelt, Yi & James LLP |
主权项 |
1. A system for processing data, comprising:
a deduplicating system that stores a copy of at least a portion of data stored in a data storage system at a first snapshot time at least in part by generating a first snapshot of the data, wherein the deduplicating system includes a fingerprint index that comprises a list of fingerprints associated with every unique segment stored in the deduplicating system, wherein the data storage system includes a stored log file that stores one or more data changes and times associated with each data change, and wherein the deduplicating system generates the first snapshot by:
breaking at least a portion of the data at the data storage system into a plurality of segments;computing a fingerprint for each of at least a subset of the plurality of segments;storing the fingerprints in the first snapshot, wherein a snapshot includes a list of fingerprints of data comprising at least the portion of the data stored in the data storage system, and wherein a fingerprint that correspond to identical segments is repeated in a snapshot;identifying, based at least in part on the fingerprint index, fingerprints in the snapshot that are not in the fingerprint index; andstoring only segments that correspond to the identified fingerprints such that each stored segment is able to be used to reconstruct the data stored in the data storage system; an interface for receiving an indication to revert data stored in the data storage system to a state at a snapshot time; and a processor configured to:
determine a first subset of the data stored in the data storage system that has changed since a prior snapshot using the stored log file; anddetermine a second subset of the data stored in the data storage system that has changed between the prior snapshot and the snapshot time using a first list of fingerprints associated with the prior snapshot and a second list of fingerprints associated with the snapshot time. |
地址 |
Hopkinton MA US |