发明名称 Integrating map-reduce into a distributed relational database
摘要 A computer readable storage medium includes executable instructions to define a map-reduce document that coordinates processing of data in a distributed database. The map-reduce document complies with a map-reduce specification that integrates map-reduce functions with queries in a query language. The operations specified by the map-reduce document are executed in the distributed database.
申请公布号 US9514188(B2) 申请公布日期 2016.12.06
申请号 US201414263641 申请日期 2014.04.28
申请人 Pivotal Software, Inc. 发明人 Cohen Jeffrey Ira;Lonergan Luke;Welton Caleb E.
分类号 G06F17/30 主分类号 G06F17/30
代理机构 Fish & Richardson P.C. 代理人 Fish & Richardson P.C.
主权项 1. A distributed database comprising: a plurality of segment hosts each comprising one or more processors; and a master host comprising one or more processors, wherein: the master host is programmed to perform operations comprising: submitting a map-reduce document as an input to a map-reduce program executing on the master host, the map-reducing program configured to cause operations specified in the map-reduce document to be executed in the distributed database system in parallel, wherein the map-reduce document includes comprises an input source and a map-reduce function definition, wherein: the input source includes a query in Structured Query Language (SQL), andthe map-reduce function definition defines, in a scripting language that is different from SQL, a map function to be performed on the input source and a reduce function to be performed on results of the map function; and distributing, using the map-reduce program, the map function and reduce function to the segment hosts as tasks; and each of the segment hosts is programmed to perform the tasks, including executing, as SQL queries, both the map function and reduce function defined in the map-reduce function definition and the query of the input source.
地址 Palo Alto CA US