发明名称 SYSTEMS AND METHODS FOR BYTE-LEVEL OR QUASI BYTE-LEVEL SINGLE INSTANCING
摘要 Described in detail herein are systems and methods for deduplicating data using byte-level or quasi byte-level techniques. In some embodiments, a file is divided into multiple blocks. A block includes multiple bytes. Multiple rolling hashes of the file are generated. For each byte in the file, a searchable data structure is accessed to determine if the data structure already includes an entry matching a hash of a minimum sequence length. If so, this indicates that the corresponding bytes are already stored. If one or more bytes in the file are already stored, then the one or more bytes in the file are replaced with a reference to the already stored bytes. The systems and methods described herein may be used for file systems, databases, storing backup data, or any other use case where it may be useful to reduce the amount of data being stored.
申请公布号 US2013226883(A1) 申请公布日期 2013.08.29
申请号 US201313855514 申请日期 2013.04.02
申请人 COMMVAULT SYSTEMS, INC.;COMMVAULT SYSTEMS, INC. 发明人 KLOSE MICHAEL F.
分类号 G06F17/30 主分类号 G06F17/30
代理机构 代理人
主权项
地址