210 likes | 225 Views
Explore dynamic resource allocation for data centers using online measurements to enhance performance, meet QoS goals, and optimize resource utilization. The study focuses on system metrics, monitoring, prediction, and adaptation.
E N D
Dynamic Resource Allocation for Shared Data Centers Using Online Measurements Abhishek Chandra Weibo Gong Prashant Shenoy UMASS Amherst http://lass.cs.umass.edu/projects/shop
Motivation • Data Centers • Server farms • Rent computing and storage resources to applications • Revenue for meeting QoS guarantees • Goals: • Satisfy application QoS guarantees • Maximize resource utilization of platform • Robustness against “Slashdot” effects
Dynamic Resource Allocation • Periodically re-allocate resources among applications • Estimate resource requirements for near future • Challenges: • Reallocation at short time-scales • No prior workload profiling/knowledge • Low overhead • Approach: Online Measurement-based Allocation
Talk Outline • Motivation • System Model • Dynamic Allocation Techniques • Experimental Results • Conclusions
Resource Model • Queuing System • Generalized Processor Sharing (GPS) scheduler • Request classes • Different arrival processes, service time distributions • QoS Goal: Mean Response Time Resource GPS
Expected Load APPLICATION MODELS PREDICTOR Measured Usage Rsrc Reqmts MONITOR ALLOCATOR System Metrics Resource Shares RESOURCE Dynamic Resource Allocation
Expected Load PREDICTOR Measured Usage MONITOR System Metrics RESOURCE Dynamic Resource Allocation APPLICATION MODELS ALLOCATOR
Adaptation Window History Measurement Interval Monitoring • Measure system and application metrics • Queue lengths • Request response times • Monitoring windows Time
Mean AR(1) Last value History Adaptation Window Prediction • Short-term prediction of workload characteristics • Request arrival rate • Average service time • Use history of measured system metrics
Workload Prediction Prediction Error Time (min) Prediction Accuracy
Expected Load APPLICATION MODELS Rsrc Reqmts Resource Shares RESOURCE Dynamic Resource Allocation PREDICTOR MONITOR ALLOCATOR
Measurement-based Model • Goal: Relate QoS metric to resource requirement • Idea: Model parameterized by online measurements • Advantages: • Parameters do not need to be computed • Allow adaptation to dynamic workload • Proposed: Transient Queuing System Description
Transient Queuing Model • Transient queuing behavior over adaptation window • Relation between mean response time T¯ and application share w • Little’s Law: • Relation is parameterized by the measured workload • Arrival rateλand mean service time s¯
Resource Allocation: Utility Model • Discontent function: Measures the QoS violations of an application • Constrained Optimization problem u1 Optimization u2
Discontent Di Goal Response Time subject to Constrained Optimization Formulation • Non-linear Optimization Problem: • Solved using Lagrange multiplier method
Talk Outline • Motivation • System Model • Dynamic Allocation Techniques • Experimental Results • Conclusions
Experimental Setup • Simulation experiments • Soccer World Cup’98 Traces • Results based on a 24-hour portion of the trace • 755,000 requests • Mean req rate: 8.7 req/sec • Mean req size: 8.47 KB
Workloads Share Allocation Adaptation to Transient Overloads Shares adapt to changing workload characteristics
Adaptation: System Discontent GPS without reallocation GPS with reallocation System Discontent is lowered substantially
Conclusions • Dynamic Resource Allocation needed for data centers • Measurement-based allocation: • Monitoring and Prediction gather online state • Use this state for application modeling and allocation • Future Work: • Prediction policies • Utility functions http://lass.cs.umass.edu/projects/shop
Related Work • Prediction • Statistical Prediction Models [Zhang00] • Application Models • Queuing-Theoretic Models [Carlstrom02,Liu01] • Control-Theoretic Models [Abdelzaher02,Lu01] • Data Centers • MUSE [Chase01] • COD [Moore02] • Oceano [Appleby01]