TY - GEN
T1 - SOFTScale
T2 - 13th ACM/IFIP/USENIX International Middleware Conference, Middleware 2012
AU - Gandhi, Anshul
AU - Zhu, Timothy
AU - Harchol-Balter, Mor
AU - Kozuch, Michael A.
PY - 2012
Y1 - 2012
N2 - Dynamic capacity provisioning is a well studied approach to handling gradual changes in data center load. However, abrupt spikes in load are still problematic in that the work in the system rises very quickly during the setup time needed to turn on additional capacity. Performance can be severely affected even if it takes only 5 seconds to bring additional capacity online. In this paper, we propose SOFTScale, an approach to handling load spikes in multi-tier data centers without having to over-provision resources. SOFTScale works by opportunistically stealing resources from other tiers to alleviate the bottleneck tier, even when the tiers are carefully provisioned at capacity. SOFTScale is especially useful during the transient overload periods when additional capacity is being brought online. Via implementation on a 28-server multi-tier testbed, we investigate a range of possible load spikes, including an artificial doubling or tripling of load, as well as large spikes in real traces. We find that SOFTScale can meet our stringent 95th percentile response time Service Level Agreement goal of 500ms without using any additional resources even under some extreme load spikes that would normally cause the system (without SOFTScale) to exhibit response times as high as 96 seconds.
AB - Dynamic capacity provisioning is a well studied approach to handling gradual changes in data center load. However, abrupt spikes in load are still problematic in that the work in the system rises very quickly during the setup time needed to turn on additional capacity. Performance can be severely affected even if it takes only 5 seconds to bring additional capacity online. In this paper, we propose SOFTScale, an approach to handling load spikes in multi-tier data centers without having to over-provision resources. SOFTScale works by opportunistically stealing resources from other tiers to alleviate the bottleneck tier, even when the tiers are carefully provisioned at capacity. SOFTScale is especially useful during the transient overload periods when additional capacity is being brought online. Via implementation on a 28-server multi-tier testbed, we investigate a range of possible load spikes, including an artificial doubling or tripling of load, as well as large spikes in real traces. We find that SOFTScale can meet our stringent 95th percentile response time Service Level Agreement goal of 500ms without using any additional resources even under some extreme load spikes that would normally cause the system (without SOFTScale) to exhibit response times as high as 96 seconds.
UR - http://www.scopus.com/inward/record.url?scp=84869838780&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=84869838780&partnerID=8YFLogxK
U2 - 10.1007/978-3-642-35170-9_8
DO - 10.1007/978-3-642-35170-9_8
M3 - Conference contribution
AN - SCOPUS:84869838780
SN - 9783642351693
T3 - Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
SP - 142
EP - 163
BT - Middleware 2012 - ACM/IFIP/USENIX 13th International Middleware Conference, Proceedings
PB - Springer Verlag
Y2 - 3 December 2012 through 7 December 2012
ER -