摘要 |
Various exemplary embodiments relate to a method and related network node including one or more of the following: determining first server dynamics associated with a first server instance, wherein the first server dynamics are indicative of a current performance of the first server instance; determining second server dynamics associated with a second server instance, wherein the second server dynamics are indicative of a current performance of the second server instance; determining, based on the first server dynamics, a current operating mode of the first server instance; determining, based on the second server dynamics, a current operating mode of the second server instance; scaling up with respect to the first server instance based on the first current operating mode indicating that the server instance is oversaturated; and scaling down with respect to the second server instance based on the second current operating mode indicating that the server instance is undersaturated. |
主权项 |
1. A method performed by a cloud application scaling controller for providing dynamic scaling, the method comprising:
determining, by the cloud application scaling controller, first server dynamics associated with a first server instance, wherein the first server dynamics are indicative of a current performance of the first server instance, wherein the first server dynamics comprise a first arriving requests metric and a first processed requests metric; determining, by the cloud application scaling controller, second server dynamics associated with a second server instance, wherein the second server dynamics are indicative of a current performance of the second server instance, wherein the second server dynamics comprise a second arriving requests metric and a second processed requests metric; determining, based on the first arriving requests metric and the first processed requests metric, a current operating mode of the first server instance; determining, based on the second arriving requests metric and the second processed requests metric, a current operating mode of the second server instance; scaling up with respect to the first server instance based on the first current operating mode indicating that the server instance is oversaturated; and scaling down with respect to the second server instance based on the second current operating mode indicating that the server instance is undersaturated. |