|Date:||March 01, 2013|
In this talk, I will present our capacity management policy, AutoScale, that greatly reduces the number of servers needed by applications driven by unpredictable, time-varying demand, while meeting response time SLAs. AutoScale matches incoming demand by dynamically scaling capacity up or down, as needed.
We evaluate AutoScale via implementation on a 38-server multi-tier data center, serving a web site of the type seen in Amazon or Facebook, with a key-value store workload. We find that AutoScale improves upon the current static capacity management policy used in data centers by up to 50% with respect to power and resources, and furthermore, vastly improves upon existing dynamic capacity management policies with respect to meeting SLAs and robustness.