发明名称 Segment deduplication system with compression of segments
摘要 A system for storing compressed data comprises a processor and a memory. The processor is configured to receive a compressed segment. The compressed segment is determined by breaking a data stream, a data block, or a data file into one or more segments and compressing each of the one or more segments. The processor is further configured to determine whether the compressed segment has been previously stored, and in the event that the compressed segment has not been previously stored, store the compressed segment. The memory is coupled to the processor and configured to provide the processor with instructions.
申请公布号 US8762348(B2) 申请公布日期 2014.06.24
申请号 US200912455951 申请日期 2009.06.09
申请人 EMC Corporation 发明人 Lumb Christopher R.
分类号 G06F7/00;G06F17/00;G06F17/30;G06F11/14;H03M7/30 主分类号 G06F7/00
代理机构 Van Pelt, Yi & James LLP 代理人 Van Pelt, Yi & James LLP
主权项 1. A system for storing compressed data, comprising: a processor configured to: break a data stream, a data block, or a data file into one or more segments, wherein one segment of the one or more segments is used to reconstruct more than one data streams, data blocks, or data files;compress each of the one or more segments into one or more compressed segments;receive one of the one or more compressed segments;receive metadata information associated with the compressed segment, wherein the received metadata information includes a segment identifier and compression metadata information, wherein the compression metadata information includes a compression type and a compression state, wherein the compression state comprises a state used for compressing the compressed segment, wherein the compression state comprises a key;determine whether the received compressed segment has been previously stored based at least in part on a comparison of the segment identifier associated with the received compressed segment and at least one segment identifier associated with a previously stored compressed segment;in a first event that the received compressed segment is determined to have been previously stored, store a reference to the previously stored compressed segment;in a second event that the received compressed segment is determined to have not been previously stored, store the received compressed segment, wherein the storing of the received compressed segment further comprises generating an index entry from the received compressed segment, wherein the index entry comprises one or more of the following: a digital fingerprint, SHA-1 hash, or Rabin Hash; and a memory coupled to the processor and configured to provide the processor with instructions.
地址 Hopkinton MA US