摘要 |
A method (600) includes identifying high-availability jobs (122, 122a) and low- availability jobs (122, 122b) that demand usage of resources (110, 112, 114, 116, 422, 424, 426, 432, 434, 436) of a distributed system (100). The method includes determining a first quota (Q1) of the resources available to low-availability jobs as a quantity of the resources available during normal operations, and determining a second quota (Q2) of the resources available to high-availability jobs as a quantity of the resources available during normal operations minus a quantity of the resources lost due to a tolerated event. The method includes executing the jobs on the distributed system and constraining a total usage of the resources by both the high-availability jobs and the low-availability jobs to the quantify of the resources available during normal operations. |