发明名称 METHOD FOR INCREASING DEDUPLICATION SPEED ON DATA STREAMS FRAGMENTED BY SHUFFLING
摘要 A computer-implemented method for deduplicating an incoming data sequence can include the steps of storing signature values for a plurality of data blocklets of a parent data sequence in a deduplication index, sequentially storing signature values for at least some of the plurality of data blocklets of the parent data sequence in a first storage location outside of the deduplication index, determining that a first data blocklet in the incoming data sequence is absent from the parent data sequence, storing a signature value for the first data blocklet in a second storage location outside of the deduplication index, storing a guarded link linking the first data blocklet to the second data blocklet into the second storage location, determining that a second data blocklet that follows the first data blocklet in the incoming data sequence is present in the parent data sequence, the second data blocklet having a signature value that is stored in the first storage location, and copying at least a portion of the contents of the first storage location and the second storage location into a cache to expedite access during deduplication of the incoming data sequence.
申请公布号 US2012124011(A1) 申请公布日期 2012.05.17
申请号 US20100946779 申请日期 2010.11.15
申请人 SPACKMAN STEPHEN P.;DOERNER DON;QUANTUM CORPORATION 发明人 SPACKMAN STEPHEN P.;DOERNER DON
分类号 G06F17/30 主分类号 G06F17/30
代理机构 代理人
主权项
地址