发明名称 Enhancing data processing performance by cache management of fingerprint index
摘要 Various embodiments for improving hash index key lookup caching performance in a computing environment are provided. In one embodiment, for a cached fingerprint map having a plurality of entries corresponding to a plurality of data fingerprints, reference count information is used to determine a length of time to retain the plurality of entries in cache. Those of the plurality of entries having a higher reference counts are retained longer than those having lower reference counts.
申请公布号 US9110815(B2) 申请公布日期 2015.08.18
申请号 US201213465456 申请日期 2012.05.07
申请人 INTERNATIONAL BUSINESS MACHINES CORPORATION 发明人 Hyde, II Joseph Smith;Roy Subhojit
分类号 G06F7/00;G06F17/30;G06F12/08;G06F12/12 主分类号 G06F7/00
代理机构 Griffiths & Seaton PLLC 代理人 Griffiths & Seaton PLLC
主权项 1. A system for improving fingerprint storage mapping caching performance in a computing environment, comprising: a deduplication engine; a fingerprint map having a plurality of entries corresponding to a plurality of data fingerprints, the fingerprint map populated by the deduplication engine; a memory including a cache configured for at least temporarily storing the fingerprint map; and a processing device coupled to the memory; a cache management module in operable communication with the cache; wherein the cache management module is configured for: using reference count information to determine a length of time to retain the plurality of entries of the fingerprint map in the cache, by: examining the reference count information of the plurality of entries of the fingerprint map in the cache and a storage policy related to the plurality of entries to establish a retention duration for the plurality of entries, andquerying whether the reference count information for a data segment has been incremented or whether a predetermined time interval has expired,if the reference count information for a data segment has not been incremented or if the predetermined time interval has not expired, reiterating the step of querying,if the reference count information for a data segment has been incremented or if the predetermined time interval has expired in which no physical activity has been observed on a physical block, re-determining a new appropriate duration of retention in the cache, andwhen the cache is full, retaining in the cache the plurality of entries of the fingerprint map having higher reference counts and removing from the cache the plurality of entries of the fingerprint map having lower reference counts.
地址 Armonk NY US