发明名称 Indexing a deduplicated cache system by integrating fingerprints of underlying deduplicated storage system
摘要 A computer-implemented method for indexing content stored in a cache memory device is disclosed. The method starts with maintaining a file index having a plurality of extent entries, each extent entry corresponding to one of a plurality of file extents stored in a cache memory device that caches data stored in a persistent storage device of a storage system. The method continues with maintaining a fingerprint index having a plurality of fingerprint entries, each mapping a fingerprint to a data region of a file indexed in the file index, wherein each fingerprint indexed in the fingerprint index is retrieved from metadata stored in the persistent storage device of the storage system when one or more corresponding data chunks were accessed, and deduplicating and accessing the file extents stored in the cache memory device using the file index and the fingerprint index.
申请公布号 US9336143(B1) 申请公布日期 2016.05.10
申请号 US201314038694 申请日期 2013.09.26
申请人 EMC Corporation 发明人 Wallace Grant R.;Douglis Frederick;Shilane Philip N.;Shim Hyong;Smaldone Stephen
分类号 G06F12/00;G06F12/08;G06F12/12 主分类号 G06F12/00
代理机构 Blakely, Sokoloff, Taylor & Zafman LLP 代理人 Blakely, Sokoloff, Taylor & Zafman LLP
主权项 1. A computer-implemented method for indexing content stored in a cache memory device, the method comprising: maintaining a file index having a plurality of extent entries, each extent entry corresponding to one of a plurality of file extents stored in a cache memory device that caches data stored in a persistent storage device of a deduplicated storage system, wherein each extent entry maps a particular data region of a particular file to a storage location of the cache memory device storing a corresponding file extent; maintaining a fingerprint index having a plurality of fingerprint entries, each mapping a fingerprint to a data region of a file indexed in the file index, wherein each fingerprint indexed in the fingerprint index is retrieved from metadata stored in the persistent storage device of the storage system when one or more corresponding data chunks were accessed; and deduplicating and accessing the file extents stored in the cache memory device using the file index and the fingerprint index, wherein the file index is used to determine whether the particular data region of the particular file has been previously stored in the cache memory device, and wherein the fingerprint index is used to determine whether the particular data region is shared by another data region of the particular file or shared by another file and the particular data region has been stored in the cache memory device during access of another data region of the particular file or another file.
地址 Hopkinton MA US