发明名称 Runtime profile determinations of virtual machines for graphics processing unit (GPU) allocation
摘要 Methods, techniques, and systems for dynamically allocating graphics processing units among virtual machines are provided. Example embodiments provide a dynamic GPU allocation system (“DGAS”), which enables the efficient allocation of physical GPU resources to one or more virtual machines. In one embodiment, the DGAS determines runtime profiles of a plurality of virtual machines. The runtime profiles are determined by determining a runtime assessment of workload indicative of a potential benefit that the virtual machine can receive from using a GPU resource. The DGAS then causes the processing bandwidth of the physical GPU resources to be allocated to some of the plurality of virtual machines based upon the dynamically determined runtime assessments of workload.
申请公布号 US9417929(B2) 申请公布日期 2016.08.16
申请号 US201213723035 申请日期 2012.12.20
申请人 VMware, Inc. 发明人 Fonseca Jose;Abiezzi Salim
分类号 G06F9/46;G06F9/455;G06F9/50 主分类号 G06F9/46
代理机构 代理人
主权项 1. A computer-implemented method in a server computing system for providing runtime assessments of a plurality of virtual machines for use in allocation of physical graphics processing unit (GPU) resources, including processing bandwidth, to one or more of the plurality of virtual machines comprising: dynamically determining a runtime assessment of workload for each virtual machine of the plurality of virtual machines, the workload runtime assessment indicative of a potential benefit that the virtual machine can receive from using a GPU resource of the physical GPU resources; and causing processing bandwidth of the physical GPU resources to be allocated to some of the plurality of virtual machines based upon the dynamically determined runtime assessments of workload, wherein the dynamically determined runtime assessment of workload for each virtual machine determines a combined value of processing parameters that are each indicative of benefit to using a GPU resource, and wherein: each virtual machine has a video RAM (VRAM) and wherein the processing parameters include a measure of VRAM usage, and wherein the measure of VRAM usage comprises an average ratio of the amount of the VRAM being used by the virtual machine to the total size of the VRAM allocated for the virtual machine; each virtual machine has a command queue and wherein the processing parameters include a measure of load of the command queue, and wherein the measure of load of the command queue comprises an amount indicative of time the command queue is active; at least some of the plurality of virtual machines have a processing unit for handling graphics and wherein the processing parameters include a measure of activity on the processing unit for handling graphics, and wherein the measure of activity comprises an amount of time the processing unit for handling graphics is idle over a particular time interval; or each virtual machine having VRAM, a command queue, and at least one data transfer mechanism, wherein the combined value of processing parameters comprises a combination of a value indicative of an amount of VRAM in use, a value indicative of an amount the command queue is utilized, and a value indicative of an amount of command bandwidth not using the data transfer mechanism.
地址 Palo Alto CA US