发明名称 MEMORY-AWARE JOINS BASED IN A DATABASE CLUSTER
摘要 Techniques are described herein for distributing data from one or more partitioned tables across the volatile memories of a cluster. In memory copies of data from partitioned tables are grouped based on the data falling within the same partition criteria. These groups are used for assigning data from corresponding partitions to the same node when distributing data from partitioned tables across the volatile memories of a multi-node cluster. When a query requires a join between rows of partitioned tables, the work for the join query is divided into work granules that correspond to partition-wise join operations. Those partition-wise join operations are assigned to nodes by a query coordinator based on the partition-to-node mapping located in the node of the query coordinator.
申请公布号 US2016026667(A1) 申请公布日期 2016.01.28
申请号 US201514806411 申请日期 2015.07.22
申请人 Oracle International Corporation 发明人 Mukherjee Niloy;Zait Mohamed;Loaiza Juan;Marwah Vineet;Lahiri Tirthankar;Yan Jiaqi;Kulkarni Kartik
分类号 G06F17/30;G06F3/06 主分类号 G06F17/30
代理机构 代理人
主权项 1. A method comprising: prior to receiving a join query that requires work to be performed on a first set of data that belongs to a first partitioned object that resides on-disk, performing the steps of: pre-loading into volatile memory data from the first partitioned object;wherein the first partitioned object includes a first plurality of partitions;wherein pre-loading the first partitioned object includes: mapping the first plurality of partitions to a plurality of partition groups;wherein each of the plurality of partition groups corresponds to corresponding partition criteria; andassigning each partition group of the plurality of partition groups to a corresponding host node of a plurality of host nodes;pre-loading each given partition of the first plurality of partitions into volatile memory of a host node that corresponds to the partition group to which the given partition is mapped; in response to receiving the join query, distributing work required by the join query to the plurality of host nodes based on which partition groups have been assigned to each of the plurality of host nodes.
地址 Redwood Shores CA US