Pricing Cloud Bandwidth Reservations under Demand Uncertainty

Pricing Cloud Bandwidth Reservations under Demand Uncertainty Di Niu, Chen Feng, Baochun Li Department of Electrical and Computer Engineering University of Toronto

Roadmap • Part 1A cloud bandwidth reservation model • Part 2 Price such reservations • Large-scale distributed optimization • Part 3 Trace-driven simulations • Part 1A cloud bandwidth reservation model

WWW Cloud Tenants Problem: No bandwidth guarantee Not good for Video-on-Demand, transaction processing web applications, etc.

Amazon Cluster Compute 10 Gbps Dedicated Network Bandwidth Demand 0 1 2 Days Over-provision

Good News: Bandwidth reservations are becoming feasible between a VM and the Internet H. Ballani, et al. Towards Predictable Datacenter Networks ACM SIGCOMM ‘11 C. Guo, et al. SecondNet: a Data Center Network Virtualization Architecture with Bandwidth Guarantees ACM CoNEXT ‘10

Dynamic Bandwidth Reservation reduces cost due to better utilization Reservation Bandwidth Demand 0 1 2 Days Difficulty: tenants don’t really know their demand!

QoS Level (e.g., 95%) Workload history of the tenant Cloud Provider Tenant Guaranteed Portion Demand Prediction A New Bandwidth Reservation Service A tenant specifies a percentage of its bandwidth demand to be served with guaranteed performance; The remaining demand will be served with best effort repeated periodically Bandwidth Reservation

Tenant Demand Model • Each tenant i has a random demand Di • Assume Di is Gaussian, with • meanμi = E[Di] • variance σi2 = var[Di] • covariance matrix Σ = [σij] • Service Level Agreement: Outage w.p.

Roadmap • Part 1A cloud bandwidth reservation model • Part 2 Price such reservations • Large-scale distributed optimization • Part 3 Trace-driven simulations

Objectives • Objective 1: Pricing the reservations • A reservation fee on top of the usage fee • Objective 2: Resource Allocation • Price affects demand, which affects price in turn • Social Welfare Maximization

Tenant i’s expected utility (revenue) Tenant Utility (e.g., Netflix) Tenant i can specify a guaranteed portion wi Concave, twice differentiable, increasing Utility depends not only on demand, but also on the guaranteed portion!

Non-multiplexing: Multiplexing: e.g. Service cost Bandwidth Reservation • Given submitted guaranteed portions the cloud will guarantee the demands • It needs to reserve a total bandwidth capacity

Surplus of tenanti Profit of the Cloud Provider Price Cloud Objective: Social Welfare Maximization Social Welfare Impossible: the cloud does not know Ui

Pricing function Under Pi(⋅), tenant i will choose Example: Linear pricing Surplus (Profit) Pricing Function Price guaranteed portion, not absolute bandwidth!

where Determine pricing policy to Social Welfare Surplus Pricing as a Distributed Solution Challenge: Cost not decomposable for multiplexing

Since , for Gaussian Mean Std A Simple Case: Non-Multiplexing • Determine pricing policy to where

Lagrange dual Dual problem Original problem The General Case:Lagrange Dual Decomposition M. Chiang, S. Low, A. Calderbank, J. Doyle.Layering as optimization decomposition: A mathematical theory of network architectures.Proc. of IEEE 2007

Subgradient Algorithm: For dual minimization, update price: Lagrange dual a subgradient of Dual problem decompose Lagrange multiplier ki as price: Pi (wi):= ki wi

2 4 Social Welfare (SW) Cloud Provider 1 3 Guaranteed Portion Price Surplus Update to increase . . . . . . Tenant i Tenant 1 Tenant N Weakness of the Subgradient Method Step size is a issue! Convergence is slow.

Solve KKT Conditions of Cloud Provider 2 Set 1 3 . . . . . . Tenant i Our Algorithm: Equation Updates 4 Linear pricingPi (wi)= ki wi suffices!

Theorem 1 (Convergence)Equation updates converge if for all i for all between and

Convergence: A Single Tenant (1-D) Subgradient method Equation Updates Not converging

Covariance matrix: is a cone centered at 0 if is not zero and is small symmetric, positive semi-definite The Case of Multiplexing Satisfies Theorem 1, algorithm converges.

Roadmap • Part 1A cloud bandwidth reservation model • Part 2 Price such reservations • Large-scale distributed optimization • Part 3 Trace-driven simulations

Data Mining: VoD Demand Traces • 200+ GB traces (binary) from UUSee Inc. • reports from online users every 10 minutes • Aggregate into video channels

Predict Expected Demand via Seasonal ARIMA Bandwidth (Mbps) Time periods (1 period = 10 minutes)

Predict Demand Variation via GARCH Mbps Time periods (1 period = 10 minutes)

Prediction Results • Each tenant i has a random demand Di in each “10 minutes” • Di is Gaussian, with • meanμi = E[Di] • variance σi2 = var[Di] • covariance matrix Σ = [σij]

Dimension Reduction via PCA • A channel’s demand = weighted sum of factors • Find factors using Principal Component Analysis (PCA) • Predict factors first, then each channel

3 Biggest Channels of 452 Channels Bandwidth (Mbps) Time periods (1 period = 10 minutes)

The First 3 Principal Components Mbps Time periods (1 period = 10 minutes)

Complexity Reduction: 98% 8 components 452 channels 8 components Data Variance Explained Number of principal components

Utility of tenant i(conservative estimate) Usage of tenant i: w.h.p. Reputation loss for demand not guaranteed Linear revenue Pricing: Parameter Settings

CDF Mean = 6 rounds Mean = 158 rounds Convergence Iteration of the Last Tenant 100 tenants (channels), 81 time periods (81 x 10 Minutes)

Related Work • Primal/Dual Decomposition [Chiang et al. 07] • Contraction Mapping x := T(x) • D. P. Bertsekas, J. Tsitsiklis, "Parallel and distributed computation: numerical methods" • Game Theory [Kelly 97] • Each user submits a price (bid), expects a payoff • Equilibrium may or may not be social optimal • Time Series Prediction • HMM [Silva 12], PCA [Gürsun 11], ARIMA [Niu 11]

Conclusions • A cloud bandwidth reservation model based on guaranteed portions • Pricing for social welfare maximization • Future work: • new decomposition and iterative methods for very large-scale distributed optimization • more general convergence conditions

Thank you Di Niu Department of Electrical and Computer Engineering University of Toronto http://iqua.ece.toronto.edu/~dniu

Root mean squared errors (RMSEs) over 1.25 days RMSE (Mbps) in Log Scale Channel Index

Without multiplexing, Correlation to the market, in [-1, 1] Expected Demand Demand Standard Deviation With multiplexing, Optimal Pricing when each tenant requires wi ≡ 1

Risk neutralizers Majority Histogram of Price Discounts due to Multiplexing mean discount 44% total cost saving 35% Counts Discounts of All Tenants in All Test Periods

Video Channel: F190E Aggregate bandwidth (Mbps) Time periods (one period = 10 minutes)

Pricing Cloud Bandwidth Reservations under Demand Uncertainty

Pricing Cloud Bandwidth Reservations under Demand Uncertainty

Presentation Transcript

Decisions under uncertainty

Decisions under Uncertainty

Reasoning under Uncertainty

Reasoning Under Uncertainty

Scheduling under Uncertainty

Reasoning Under Uncertainty

Planning under Uncertainty

Decisions Under Uncertainty

Reasoning Under Uncertainty

Choice Under Uncertainty

PLANNING UNDER UNCERTAINTY

Reasoning Under Uncertainty

Network Design under Demand Uncertainty

Bandwidth on Demand

Bandwidth on demand

Planning Under Uncertainty

Reasoning Under Uncertainty

Reasoning under Uncertainty

Decisions under Uncertainty

Scheduling under Uncertainty

Choice Under Uncertainty

Pricing Cloud Bandwidth Reservations under Demand Uncertainty