发明名称 SCALABLE MECHANISM FOR DETECTION OF COMMONALITY IN A DEDUPLICATED DATA SET
摘要 Mechanisms are provided for efficiently determining commonality in a deduplicated data set in a scalable manner regardless of the number of deduplicated files or the number of stored segments. Information is generated and maintained during deduplication to allow scalable and efficient determination of data segments shared in a particular file, other files sharing data segments included in a particular file, the number of files sharing a data segment, etc. Data need not be expanded or uncompressed. Deduplication processing can be validated and verified during commonality detection.
申请公布号 US2015026139(A1) 申请公布日期 2015.01.22
申请号 US201414507731 申请日期 2014.10.06
申请人 Dell Products L.P. 发明人 Jayaraman Vinod
分类号 G06F17/30;G06F11/14 主分类号 G06F17/30
代理机构 代理人
主权项 1. A method, comprising: generating a filemap corresponding to a deduplicated file, the filemap including a plurality of filemap indices; modifying a datastore suitcase, the datastore suitcase including a plurality of datastore indices corresponding to the filemap indices, a plurality of deduplicated data segments, and a last file entry identifying last files having placed a reference to deduplicated data segments, wherein the datastore suitcase is created when the processor processes a file for deduplication.
地址 Round Rock TX US