发明名称 Data growth balancing
摘要 A distributed database management system maintains logical collections of data. The collections may grow at varying rates, and may place storage and workload processing demands on the system that change over time. A value representative of the current maximum of capacity utilization for computing nodes in the system may be calculated. An estimate of the number of days for each computing node to reach the current maximum may be calculated, and compared to an average number of days. Computing nodes whose number of days is above the average may be selected to receive new logical collections of data.
申请公布号 US9235611(B1) 申请公布日期 2016.01.12
申请号 US201313912042 申请日期 2013.06.06
申请人 Amazon Technologies, Inc. 发明人 Murray Edward Paul;Gupta Shireesh Prakash
分类号 G06F17/30 主分类号 G06F17/30
代理机构 Baker & Hostetler LLP 代理人 Baker & Hostetler LLP
主权项 1. A database management system comprising: a plurality of computing nodes each computing node configured to store logical collections of data on one or more storage devices; and one or more memories having stored thereon computer-readable instructions that, upon execution, cause the system at least to: determine a maximum of current capacity utilization values for the plurality of computing nodes, wherein a capacity utilization value is based at least in part on storage capacity utilized by a computing node storing logical collections of data; determine an average length of time for the capacity utilization values of computing nodes in the plurality of computing nodes to reach the maximum of the current capacity utilization values; assign one or more computing nodes in the plurality of computing nodes to a subset of the plurality of computing nodes based at least in part on a length of time for the capacity utilization value of the one or more computing nodes to reach the maximum of current capacity utilization values exceeding the average length of time; and store a logical collection on one of the subset of computing nodes.
地址 Seattle WA US