发明名称 Method and system of mapreduce implementations on indexed datasets in a distributed database environment
摘要 In one exemplary embodiment, a method of a distributed database system includes the step receiving a query in a query language from a client with a distributed database system. An index that matches the query is located. The index is pre-generated from a database table in the distributed database system. A map function of a MapReduce programming model is implemented using the index. A reduce function of the MapReduce programming model is implemented using the output of the map function. Optionally, a finalize function can be implemented using the output of the reduce function. The distributed database system can be a scalable NoSQL database. The reduce function can be optional when the value of the output of the map function is guaranteed to be unique.
申请公布号 US8775464(B2) 申请公布日期 2014.07.08
申请号 US201213653411 申请日期 2012.10.17
申请人 发明人 Bulkowski Brian J.;Srinivasan Srini V.
分类号 G06F17/30 主分类号 G06F17/30
代理机构 代理人
主权项 1. A method in a distributed database system comprising: receiving a query in a query language from a client with a distributed database system, wherein the distributed database system comprises a scalable Not Only Structured Query Language (NoSQL) database implemented with a shared-nothing architecture; locating an index that matches the query, wherein the index is pre-generated from a database table in the distributed database system, wherein the query comprises a WHERE clause, and wherein the index comprises a data structure created using one or more columns of the database table based on the WHERE clause; implementing a map function of a MapReduce programming model using the index; implementing a reduce function of the MapReduce programming model using the output of the map function, wherein the reduce function is not performed when the value of the output of the map function is unique; and implementing a finalize function using the output of the reduce function, wherein the finalize function obtains a list of key/value pairs of a reduce function output and returns a finalized value and returns another list of key/value pairs as an output.
地址