TY - JOUR
T1 - Agile dynamic provisioning of multi-tier Internet applications
AU - Urgaonkar, Bhuvan
AU - Shenoy, Prashant
AU - Chandra, Abhishek
AU - Goyal, Pawan
AU - Wood, Timothy
N1 - Copyright:
Copyright 2011 Elsevier B.V., All rights reserved.
PY - 2008/3/1
Y1 - 2008/3/1
N2 - Dynamic capacity provisioning is a useful technique for handling the multi-time-scale variations seen in Internet workloads. In this article, we propose a novel dynamic provisioning technique for multi-tier Internet applications that employs (1) a flexible queuing model to determine how much of the resources to allocate to each tier of the application, and (2) a combination of predictive and reactive methods that determine when to provision these resources, both at large and small time scales. We propose a novel data center architecture based on virtual machine monitors to reduce provisioning overheads. Our experiments on a forty-machine Xen/Linux-based hosting platform demonstrate the responsiveness of our technique in handling dynamic workloads. In one scenario where a flash crowd caused the workload of a three-tier application to double, our technique was able to double the application capacity within five minutes, thus maintaining response-time targets. Our technique also reduced the overhead of switching servers across applications from several minutes to less than a second, while meeting the performance targets of residual sessions.
AB - Dynamic capacity provisioning is a useful technique for handling the multi-time-scale variations seen in Internet workloads. In this article, we propose a novel dynamic provisioning technique for multi-tier Internet applications that employs (1) a flexible queuing model to determine how much of the resources to allocate to each tier of the application, and (2) a combination of predictive and reactive methods that determine when to provision these resources, both at large and small time scales. We propose a novel data center architecture based on virtual machine monitors to reduce provisioning overheads. Our experiments on a forty-machine Xen/Linux-based hosting platform demonstrate the responsiveness of our technique in handling dynamic workloads. In one scenario where a flash crowd caused the workload of a three-tier application to double, our technique was able to double the application capacity within five minutes, thus maintaining response-time targets. Our technique also reduced the overhead of switching servers across applications from several minutes to less than a second, while meeting the performance targets of residual sessions.
UR - http://www.scopus.com/inward/record.url?scp=42149100037&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=42149100037&partnerID=8YFLogxK
U2 - 10.1145/1342171.1342172
DO - 10.1145/1342171.1342172
M3 - Article
AN - SCOPUS:42149100037
SN - 1556-4665
VL - 3
JO - ACM Transactions on Autonomous and Adaptive Systems
JF - ACM Transactions on Autonomous and Adaptive Systems
IS - 1
M1 - 1
ER -