发明名称 GEO-SCALE ANALYTICS WITH BANDWIDTH AND REGULATORY CONSTRAINTS
摘要 Various technologies described herein pertain to controlling geo-scale analytics with bandwidth and regulatory constraints. An analytical query (e.g., a recurrent analytical query, a non-recurrent analytical query, etc.) to be executed over distributed data in data partitions stored in a plurality of data centers can be received. Moreover, a query execution plan for the analytical query can be generated, where the query execution plan includes tasks. Further, replication strategies for the data partitions can be determined. A replication strategy for a particular data partition can specify one or more data centers to which the particular data partition is to be replicated if the particular data partition is to be replicated. The tasks of the query execution plan for the analytical query can further be scheduled to the data centers based on the replication strategies for the data partitions. The analytical query can be part of a workload of analytical queries.
申请公布号 US2016306849(A1) 申请公布日期 2016.10.20
申请号 US201514687450 申请日期 2015.04.15
申请人 Microsoft Technology Licensing, LLC 发明人 Curino Carlo Aldo;Padhye Jitendra Dattatraya;Varghese George;Vulimiri Ashish
分类号 G06F17/30 主分类号 G06F17/30
代理机构 代理人
主权项 1. A computing system, comprising: at least one processor; and computer-readable storage comprising components, the components being executable by the at least one processor, the components comprising: a query planner component configured to generate a query execution plan for an analytical query to be executed over distributed data in data partitions stored in a plurality of data centers, the query execution plan comprising tasks; anda workload optimization component configured to: determine replication strategies for the data partitions, a replication strategy for a particular data partition specifies one or more data centers to which the particular data partition is to be replicated if the particular data partition is to be replicated; andschedule the tasks of the query execution plan for the analytical query to the data centers based on the replication strategies for the data partitions.
地址 Redmond WA US