发明名称 INTELLIGENT DEDUPLICATION DATA PREFETCHING
摘要 Deduplication dictionaries are used to maintain data chunk identifier and location pairings in a deduplication system. When access to a particular data chunk is requested, a deduplication dictionary is accessed to determine the location of the data chunk and a datastore is accessed to retrieve the data chunk. However, deduplication dictionaries are large and typically maintained on disk, so dictionary access is expensive. Techniques and mechanisms of the present invention allow prefetches or read aheads of datastore (DS) headers. For example, if a dictionary hit results in datastore DS(X), then headers for DS (X+1), DS (X+2), DS(X+read-ahead-window) are prefetched ahead of time. These datastore headers are cached in memory, and indexed by datastore identifier. Before going to the dictionary, a lookup is first performed in the cached headers to reduce deduplication data access request latency.
申请公布号 US2014310251(A1) 申请公布日期 2014.10.16
申请号 US201414311485 申请日期 2014.06.23
申请人 Dell Products L.P. 发明人 Jayaraman Vinod;Bolla Ratna Manoj
分类号 G06F17/30 主分类号 G06F17/30
代理机构 代理人
主权项 1. A method, comprising: receiving a first input/output (I/O) request at a deduplication system node; performing a first lookup in cache before accessing a deduplication dictionary; accessing the deduplication dictionary to determine a first location having data corresponding to the read I/O request; maintaining in cache a plurality of headers for a plurality of datastores related to the first location, wherein cache is configured to maintain the plurality of headers for the plurality of datastores related to the first location upon receiving the read I/O request associated with a single datastore at the first location; receiving a second I/O request at the deduplication system node; performing a second lookup in cache before accessing the deduplication dictionary; responding to the second I/O request without accessing the deduplication dictionary.
地址 Round Rock TX US