发明名称 Managing operational throughput for shared resources
摘要 Usage of shared resources can be managed by enabling users to obtain different types of guarantees at different times for various types and/or levels of resource capacity. A user can select to have an amount or rate of capacity dedicated to that user. A user can also select reserved capacity for at least a portion of the requests, tasks, or program execution for that user, where the user has priority to that capacity but other users can utilize the excess capacity during other periods. Users can alternatively specify to use the excess capacity or other variable, non-guaranteed capacity. The capacity can be for any appropriate functional aspect of a resource, such as computational capacity, throughput, latency, bandwidth, and storage. Users can submit bids for various types and combinations of excess capacity, and winning bids can receive dedicated use of the excess capacity for at least a period of time.
申请公布号 US9374243(B1) 申请公布日期 2016.06.21
申请号 US201213620251 申请日期 2012.09.14
申请人 Amazon Technologies, Inc. 发明人 Certain Tate Andrew;Jain Sachin;Marshall Bradley E.;Maniscalco Nicholas J.;Sivasubramanian Swaminathan;Garman Matthew S.
分类号 G06F15/16;G06T11/20;H04L15/16;G06T15/20 主分类号 G06F15/16
代理机构 Hogan Lovells US LLP 代理人 Hogan Lovells US LLP
主权项 1. A computer-implemented method for managing shared resources, comprising: receiving an instance request that includes a request parameter, the instance request associated with a user of a multi-tenant computing environment, the multi-tenant computing environment including a plurality of client devices in communication via a network with one or more servers and storage devices, the instance request indicating a data set and a performance specification for responding to data requests for the data set, the performance specification indicating a latency target to be met for responding to at least one of the data requests for the data set; determining an amount of latency of processing at least one of the data requests; determining that the amount of latency is greater than the latency target as indicated by the performance specification; identifying another storage device having additional capacity, the another storage device being associated with a resource usage request for the additional capacity; determining whether the request parameter satisfies the resource usage request; in response to determining that the request parameter satisfies the resource usage request, moving at least a subset of the data set from a current storage device to the another storage device; and in response to determining that the request parameter does not satisfy the resource usage request, denying the instance request.
地址 Reno unknown