发明名称 |
Data processing performance enhancement in a distributed file system |
摘要 |
Systems and methods of data processing performance enhancement are disclosed. One embodiment includes, invoking operating system calls to optimize cache management by an I/O component; wherein, the operating system calls are invoked to perform one or more of; proactive triggering of readaheads for sequential read requests of a disk; purging data out of buffer cache after writing to the disk or performing sequential reads from the desk; and/or eliminating a delay between when a write is performed and when written data from the write is flushed to the disk from the buffer cache. |
申请公布号 |
US9600492(B2) |
申请公布日期 |
2017.03.21 |
申请号 |
US201615225533 |
申请日期 |
2016.08.01 |
申请人 |
Cloudera, Inc. |
发明人 |
Lipcon Todd |
分类号 |
G06F7/00;G06F17/30;G06F12/08;G06F13/20 |
主分类号 |
G06F7/00 |
代理机构 |
Perkins Coie LLP |
代理人 |
Perkins Coie LLP |
主权项 |
1. A method for enhancing performance for data processing in a distributed file system, the method comprising:
instantiating an input/output (I/O) manager on a machine among a plurality of machines that implement the distributed file system; and utilizing the I/O manager to perform cache management optimization selected from the group consisting of (a) deterministically triggering readaheads for all sequential read requests by overriding a heuristic for the readaheads on the machine; (b) instructing the machine to invalidate cached data in a buffer upon detecting that a specific size of data has been automatically cached; and (c) overriding a time delay configured on the machine so that the machine is to commit data from the buffer to a disk without the time delay. |
地址 |
Palo Alto CA US |