发明名称 Synchronization of storage using log files and snapshots
摘要 A system for processing data comprises a deduplicating system, an interface, and a processor. The deduplicating system stores a copy of data stored in a data storage system by storing a set of segments that is able to reconstruct the data stored in the data storage system. The data storage system has a stored log file. The stored log file stores a data change and an associated time for the data change. The interface receives an indication to revert data stored in the data storage system to a state at a snapshot time. The processor is configured to determine a first subset of the data stored in the data storage system that has changed since a prior snapshot using the stored log file and to determine a second subset of the data stored in the data storage system that has changed between the prior snapshot and the snapshot time using a first list of fingerprints associated with the prior snapshot and a second list of fingerprints associated with the snapshot time.
申请公布号 US8782003(B1) 申请公布日期 2014.07.15
申请号 US201113107791 申请日期 2011.05.13
申请人 EMC Corporation 发明人 Patterson R. Hugo
分类号 G06F7/00;G06F17/30 主分类号 G06F7/00
代理机构 Van Pelt, Yi & James LLP 代理人 Van Pelt, Yi & James LLP
主权项 1. A system for processing data, comprising: a deduplicating system that stores a copy of at least a portion of data stored in a data storage system at a first snapshot time at least in part by generating a first snapshot of the data, wherein the deduplicating system includes a fingerprint index that comprises a list of fingerprints associated with every unique segment stored in the deduplicating system, wherein the data storage system includes a stored log file that stores one or more data changes and times associated with each data change, and wherein the deduplicating system generates the first snapshot by: breaking at least a portion of the data at the data storage system into a plurality of segments;computing a fingerprint for each of at least a subset of the plurality of segments;storing the fingerprints in the first snapshot, wherein a snapshot includes a list of fingerprints of data comprising at least the portion of the data stored in the data storage system, and wherein a fingerprint that correspond to identical segments is repeated in a snapshot;identifying, based at least in part on the fingerprint index, fingerprints in the snapshot that are not in the fingerprint index; andstoring only segments that correspond to the identified fingerprints such that each stored segment is able to be used to reconstruct the data stored in the data storage system; an interface for receiving an indication to revert data stored in the data storage system to a state at a snapshot time; and a processor configured to: determine a first subset of the data stored in the data storage system that has changed since a prior snapshot using the stored log file; anddetermine a second subset of the data stored in the data storage system that has changed between the prior snapshot and the snapshot time using a first list of fingerprints associated with the prior snapshot and a second list of fingerprints associated with the snapshot time.
地址 Hopkinton MA US