TY - GEN
T1 - AutoBurst
T2 - 15th Annual ACM Symposium on Cloud Computing, SoCC 2024
AU - Hasan, Rubaba
AU - Zhu, Timothy
AU - Urgaonkar, Bhuvan
N1 - Publisher Copyright:
© 2024 ACM.
PY - 2024/11/20
Y1 - 2024/11/20
N2 - Burstable instances provide a low-cost option for consumers using the public cloud, but they come with significant resource limitations. They can be viewed as "fractional instances"where one receives a fraction of the compute and memory capacity at a fraction of the cost of regular instances. The fractional compute is achieved via rate limiting, where a unique characteristic of the rate limiting is that it allows for the CPU to burst to 100% utilization for limited periods of time. Prior research has shown how this ability to burst can be used to serve specific roles such as a cache backup and handling flash crowds. Our work provides a general-purpose approach to meeting latency SLOs via this burst capability while optimizing for cost. AutoBurst is able to achieve this by controlling both the number of burstable and regular instances along with how/when they are used. Evaluations show that our system is able to reduce cost by up to 25% over the state-of-the-art while maintaining latency SLOs.
AB - Burstable instances provide a low-cost option for consumers using the public cloud, but they come with significant resource limitations. They can be viewed as "fractional instances"where one receives a fraction of the compute and memory capacity at a fraction of the cost of regular instances. The fractional compute is achieved via rate limiting, where a unique characteristic of the rate limiting is that it allows for the CPU to burst to 100% utilization for limited periods of time. Prior research has shown how this ability to burst can be used to serve specific roles such as a cache backup and handling flash crowds. Our work provides a general-purpose approach to meeting latency SLOs via this burst capability while optimizing for cost. AutoBurst is able to achieve this by controlling both the number of burstable and regular instances along with how/when they are used. Evaluations show that our system is able to reduce cost by up to 25% over the state-of-the-art while maintaining latency SLOs.
UR - http://www.scopus.com/inward/record.url?scp=85215505328&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85215505328&partnerID=8YFLogxK
U2 - 10.1145/3698038.3698530
DO - 10.1145/3698038.3698530
M3 - Conference contribution
AN - SCOPUS:85215505328
T3 - SoCC 2024 - Proceedings of the 2024 ACM Symposium on Cloud Computing
SP - 243
EP - 258
BT - SoCC 2024 - Proceedings of the 2024 ACM Symposium on Cloud Computing
PB - Association for Computing Machinery, Inc
Y2 - 20 November 2024 through 22 November 2024
ER -