TY - GEN
T1 - Kerveros
T2 - 17th USENIX Symposium on Operating Systems Design and Implementation, OSDI 2023
AU - Sajal, Sultan Mahmud
AU - Marshall, Luke
AU - Li, Beibin
AU - Zhou, Shandan
AU - Pan, Abhisek
AU - Mellou, Konstantina
AU - Narayanan, Deepak
AU - Zhu, Timothy
AU - Dion, David
AU - Moscibroda, Thomas
AU - Menache, Ishai
N1 - Publisher Copyright:
© OSDI 2023.All rights reserved.
PY - 2023
Y1 - 2023
N2 - The infinite capacity of cloud computing is an illusion: in reality, cloud providers cannot always have enough capacity of the right type, in the right place, at the right time to meet all demand. Consequently, cloud providers need to implement admission-control policies to ensure accepted capacity requests experience high availability. However, admission control in the public cloud is hard due to dynamic changes in both supply and demand: hardware might become unavailable, and actual VM consumption could vary for a variety of reasons including tenant scale-outs and fulfillment of VM reservations made by customers ahead of time. In this paper, we design and implement Kerveros, a flexible admission-control system that has three desired properties: i) high computational scalability to handle a large inventory, ii) accurate capacity provisioning for high VM availability, and iii) good packing efficiency to optimize resource usage. To achieve this, Kerveros uses novel bookkeeping techniques to quickly estimate the capacity available for incoming VM requests. Our system has been deployed in Microsoft Azure. Results from both simulations and production confirm that Kerveros achieves more than four nines of availability while sustaining request processing latencies of a few milliseconds.
AB - The infinite capacity of cloud computing is an illusion: in reality, cloud providers cannot always have enough capacity of the right type, in the right place, at the right time to meet all demand. Consequently, cloud providers need to implement admission-control policies to ensure accepted capacity requests experience high availability. However, admission control in the public cloud is hard due to dynamic changes in both supply and demand: hardware might become unavailable, and actual VM consumption could vary for a variety of reasons including tenant scale-outs and fulfillment of VM reservations made by customers ahead of time. In this paper, we design and implement Kerveros, a flexible admission-control system that has three desired properties: i) high computational scalability to handle a large inventory, ii) accurate capacity provisioning for high VM availability, and iii) good packing efficiency to optimize resource usage. To achieve this, Kerveros uses novel bookkeeping techniques to quickly estimate the capacity available for incoming VM requests. Our system has been deployed in Microsoft Azure. Results from both simulations and production confirm that Kerveros achieves more than four nines of availability while sustaining request processing latencies of a few milliseconds.
UR - http://www.scopus.com/inward/record.url?scp=85180368597&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85180368597&partnerID=8YFLogxK
M3 - Conference contribution
AN - SCOPUS:85180368597
T3 - Proceedings of the 17th USENIX Symposium on Operating Systems Design and Implementation, OSDI 2023
SP - 209
EP - 225
BT - Proceedings of the 17th USENIX Symposium on Operating Systems Design and Implementation, OSDI 2023
PB - USENIX Association
Y2 - 10 July 2023 through 12 July 2023
ER -