发明名称 Data processing performance enhancement in a distributed file system
摘要 Systems and methods of data processing performance enhancement are disclosed. One embodiment includes, invoking operating system calls to optimize cache management by an I/O component; wherein, the operating system calls are invoked to perform one or more of; proactive triggering of readaheads for sequential read requests of a disk; purging data out of buffer cache after writing to the disk or performing sequential reads from the desk; and/or eliminating a delay between when a write is performed and when written data from the write is flushed to the disk from the buffer cache.
申请公布号 US9600492(B2) 申请公布日期 2017.03.21
申请号 US201615225533 申请日期 2016.08.01
申请人 Cloudera, Inc. 发明人 Lipcon Todd
分类号 G06F7/00;G06F17/30;G06F12/08;G06F13/20 主分类号 G06F7/00
代理机构 Perkins Coie LLP 代理人 Perkins Coie LLP
主权项 1. A method for enhancing performance for data processing in a distributed file system, the method comprising: instantiating an input/output (I/O) manager on a machine among a plurality of machines that implement the distributed file system; and utilizing the I/O manager to perform cache management optimization selected from the group consisting of (a) deterministically triggering readaheads for all sequential read requests by overriding a heuristic for the readaheads on the machine; (b) instructing the machine to invalidate cached data in a buffer upon detecting that a specific size of data has been automatically cached; and (c) overriding a time delay configured on the machine so that the machine is to commit data from the buffer to a disk without the time delay.
地址 Palo Alto CA US