发明名称 Intelligent caching
摘要 Disclosed are methods, systems, paradigms and structures for managing cache memory in computer systems. Certain caching techniques anticipate queries and caches the data that may be required by the anticipated queries. The queries are predicted based on previously executed queries. The features of the previously executed queries are extracted and correlated to identify a usage pattern of the features. The prediction model predicts queries based on the identified usage pattern of the features. The disclosed method includes purging data from the cache based on predefined eviction policies that are influenced by the predicted queries. The disclosed method supports caching time series data. The disclosed system includes a storage unit that stores previously executed queries and features of the queries.
申请公布号 US9507718(B2) 申请公布日期 2016.11.29
申请号 US201313864016 申请日期 2013.04.16
申请人 Facebook, Inc. 发明人 Rash Samuel;Williamson Timothy
分类号 G06F12/08;G06F12/12 主分类号 G06F12/08
代理机构 Perkins Coie LLP 代理人 Perkins Coie LLP
主权项 1. A method comprising: generating, at a computer system and in response to receiving a specified query from a client, a first read request for obtaining a result of the specified query from a storage system; extracting, from a plurality of candidate queries and at the computer system, a plurality of features of the candidate queries, wherein the features are characteristics of a query; correlating the features of each of the candidate queries to identify a usage pattern of the features, the correlating including: identifying a specified feature of the features based on a derived value of the specified feature, the derived value being derived from an actual value of the specified feature, the actual value and the specified feature specified in one or more of the candidate queries; predicting, based on the usage pattern of the features, a set of queries to be received at the computer system in the future; executing a query of the set of queries to obtain data corresponding to the query from the storage system, the data including time series data, wherein the data is stored at a first granularity level in the storage system, wherein executing the query includes: generating a second read request to obtain the data corresponding to the query,combining the first read request and the second read request to generate a combined read request, andexecuting the combined read request at the storage system to obtain the result of the specified query and the data corresponding to the query; determining, based on the predicting, a second granularity level at which the data is to be cached, the second granularity level being different from the first granularity level; processing the data from the first granularity level to the second granularity level to generate processed data; and updating a cache of the computer system with the processed data, the updating to be performed before any of the set of queries is received at the computer system.
地址 Menlo Park CA US