发明名称 Apparatus and method for securing preliminary information about database fragments for utilization in mapreduce processing
摘要 A method of processing a query in a distributed database implemented across a set of nodes includes receiving a query. The query is divided into split characterization queries. The split characterization queries are distributed to worker nodes. Each worker node stores a partition of the distributed database with encoded textual objects and pre-defined indices characterizing encoded textual object fragments. The split characterization queries are executed at the worker nodes to obtain preliminary information about query results. Executing the split characterization queries includes matching query fragments associated with the split characterization queries with encoded textual object fragments of the pre-defined indices to produce fragment matches representative of the size of the query results. For each split characterization query the preliminary information about query results includes a fragment count, a database partition identification, and a database host name.
申请公布号 US8892599(B2) 申请公布日期 2014.11.18
申请号 US201213659775 申请日期 2012.10.24
申请人 MarkLogic Corporation 发明人 Lindblad Christopher;Chen Jane X.
分类号 G06F17/30 主分类号 G06F17/30
代理机构 Cooley LLP 代理人 Cooley LLP
主权项 1. A method of processing a query in a distributed database implemented across a plurality of nodes, comprising: receiving a query; dividing the query into split characterization queries; distributing the split characterization queries to a plurality of worker nodes, wherein each worker node stores a partition of the distributed database as a database with encoded textual objects and pre-defined indices characterizing encoded textual object fragments; and executing the split characterization queries at the plurality of worker nodes to obtain preliminary information about query results, wherein executing the split characterization queries includes matching query fragments associated with the split characterization queries with encoded textual object fragments to produce fragment matches representative of the size of the query results, wherein for each split characterization query the preliminary information about query results includes a fragment count, a database partition identification, and a database host name.
地址 San Carlos CA US