发明名称 REDUCING DIGEST STORAGE CONSUMPTION BY TRACKING SIMILARITY ELEMENTS IN A DATA DEDUPLICATION SYSTEM
摘要 For reducing digests storage consumption in a data deduplication system using a processor device in a computing environment, input data is partitioned into chunks, and the chunks are grouped into chunk sets. Digests are calculated for input data and stored in sets corresponding to the chunk sets. Similarity elements are calculated for the input data and the similarity elements are stored in a similarity search structure, and the number of similarity elements associated with a chunk set which are currently contained in the similarity search structure is maintained for each chunk set.
申请公布号 US2015324419(A1) 申请公布日期 2015.11.12
申请号 US201514790804 申请日期 2015.07.02
申请人 INTERNATIONAL BUSINESS MACHINES CORPORATION 发明人 ARONOVICH Lior
分类号 G06F17/30 主分类号 G06F17/30
代理机构 代理人
主权项 1. A method for reducing digests storage consumption by tracking numbers of similarity elements in a similarity search structure in a data deduplication system using a processor device in a computing environment, comprising: partitioning input data into chunks and grouping the chunks into chunk sets; calculating digests for the input data and storing the digests in sets corresponding to the chunk sets; calculating similarity elements for the input data and storing the similarity elements in a similarity search structure; and maintaining for each one of the chunk sets a number of the similarity elements associated with the chunk set which are currently contained in the similarity search structure.
地址 Armonk NY US