240 likes | 246 Views
Intelligent Placement of Datacenters for Internet Services. Íñigo Goiri , Kien Le, Jordi Guitart , Jordi Torres, and Ricardo Bianchini. Motivation. Internet services require thousands of servers Use multiple “mirror” datacenters High availability and fault tolerance Low response time
E N D
Intelligent Placement of Datacenters for Internet Services ÍñigoGoiri, Kien Le, JordiGuitart, Jordi Torres, and Ricardo Bianchini
Motivation • Internet services require thousands of servers • Use multiple “mirror” datacenters • High availability and fault tolerance • Low response time • Spend millions building and operating datacenters • Consume enormous amounts of brown energy
Datacenter construction costs • Each datacenter costs >$100M to construct • The smaller datacenters are rated at ~25MW • Examples: • Microsoft DCs in Virginia & Chicago: $500M each
Energy costs and carbon emissions Sources: [Qureshi’09], EPA
Intelligent Placement of Datacenters Goal: Manage the monetary and environmental costs • Define framework • Model costs and datacenter characteristics • Define optimization problem • Create solution approaches • Collect cost and location-related data • Create placement tool
Outline • Motivation • Placing datacenters • Evaluation • Conclusion
Selecting datacenter locations • Model datacenter placement • Network latencies • Availability
Selecting datacenter locations • Model datacenter placement • Network latencies • Availability • CAPEX costs • Distance to electricity and networking infrastructure • Land and construction (maximum PUE) • Power delivery, cooling, backup equipment • Servers and networking equipment
Selecting datacenter locations • Model datacenter placement • Network latencies • Availability • CAPEX costs • Distance to electricity and networking infrastructure • Land and construction (maximum PUE) • Power delivery, cooling, backup equipment • Servers and networking equipment • OPEX costs • Maintenance and administration • Electricity and water prices (average PUE)
Selecting datacenter locations • Model datacenter placement • Network latencies • Availability • CAPEX costs • Distance to electricity and networking infrastructure • Land and construction (maximum PUE) • Power delivery, cooling, backup equipment • Servers and networking equipment • OPEX costs • Maintenance and administration • Electricity and water prices (average PUE) • Incentives (taxes)
Selecting datacenter locations • Model datacenter placement • Network latencies • Availability • CAPEX costs • Distance to electricity and networking infrastructure • Land and construction (maximum PUE) • Power delivery, cooling, backup equipment • Servers and networking equipment • OPEX costs • Maintenance and administration • Electricity and water prices (average PUE) • Incentives (taxes)
Formulating the problem • Goal • Minimize CAPEX and OPEX • Constraints • Response times < MAX LATENCY for all users • Min consistency delay between 2 DCs < MAX DELAY • Min system availability > MIN AVAILABILITY • Output • Number of servers at each location • Minimum cost
Solving the (non-linear) problem • Linear Programming • Does not support non-linear costs • Brute force • Too slow • Simple heuristics • May not produce accurate results efficiently
Our approach for solving the problem • Evaluate each potential solution • Quickly via Linear Programming (LP) • Consider neighboring configurations • Simulated annealing (SA) • Cost optimization process • Combine SA and LP SA LP LP Current solution Near neighbor
Our approach for solving the problem SA LP LP $10.3M/month $13.8M/month SA SA LP LP $9.2M/month $10.7M/month
Summary of our approach • Generate a grid of tentative locations • Collect data about each location • Define datacenter characteristics • Instantiate optimization problem • Solve optimization problem
Tool demo • We built a tool that • Embodies the problem • Input data for the US • Multiple solution approaches Short video at: http://www.darklab.rutgers.edu/DCL/dcl.html
Outline • Motivation • Placing datacenters • Evaluation • Conclusion
Interesting questions • How much does… … lower latency cost? … higher availability cost? … faster consistency cost? … a green DC network cost? … a chiller-less DC network cost?
Cost of 60k-servergreen DC network Green DC network costs $100k/month more, except when latency <70ms
Cost of a 60k-serverchiller-less DC network Chiller-less DC network is cheaper but it cannot achieve low latencies
Conclusions • First scientific work on smart datacenter placement • Proposed framework and optimization problem • Proposed solution approach • Characterized many locations across the US • Built a tool to automate the process • Answered many interesting questions • Results show that smart placement can save millions • Work enables smaller companies to reap the benefits
Intelligent Placement of Datacenters for Internet Services ÍñigoGoiri, Kien Le, JordiGuitart, Jordi Torres, and Ricardo Bianchini