发明名称 Outputting map-reduce jobs to an archive file
摘要 Disclosed is a method of outputting map-reduce jobs to an archive file. The method includes, providing an archive manager and exposing an interface to be called from map-reduce jobs to output to an archive file in a map-reduce distributed file system. Using a buffering database as a temporary cache to buffer updates to the archive file. The handling by the archive manager of the calls from map-reduce jobs allows, reading directly from an archive file or from a job index at the buffering database and writing to a job index at the buffering database used as a temporary cache to buffer updates, and outputting updates from a job index to the archive file. The call handling may includes receiving a read call for a task of a map-reduce job, connecting to the buffering database, looking up a unique token for the job in a pending index and a committed index provided by the database, and depending on the status of the job either reading from the archive file or reading from a job index.
申请公布号 GB2530052(A) 申请公布日期 2016.03.16
申请号 GB20140016018 申请日期 2014.09.10
申请人 INTERNATIONAL BUSINESS MACHINES CORPORATION 发明人 NIALL MCCARROLL;CURTIS NORMAN BROWNING
分类号 G06F9/50;G06F17/30 主分类号 G06F9/50
代理机构 代理人
主权项
地址