420 likes | 541 Vues
This paper presents a comprehensive model for cloud bandwidth reservations that addresses the challenges of demand uncertainty. It details a framework for pricing these reservations and includes large-scale distributed optimization and trace-driven simulations. With a focus on guaranteed performance for varying tenant demands, the study explores the interplay between resource allocation and social welfare maximization. Future sections will delve into practical applications and simulations to validate the proposed reservation strategies.
E N D
Pricing Cloud Bandwidth Reservations under Demand Uncertainty Di Niu, Chen Feng, Baochun Li Department of Electrical and Computer Engineering University of Toronto
Roadmap • Part 1A cloud bandwidth reservation model • Part 2 Price such reservations • Large-scale distributed optimization • Part 3 Trace-driven simulations • Part 1A cloud bandwidth reservation model
WWW Cloud Tenants Problem: No bandwidth guarantee Not good for Video-on-Demand, transaction processing web applications, etc.
Amazon Cluster Compute 10 Gbps Dedicated Network Bandwidth Demand 0 1 2 Days Over-provision
Good News: Bandwidth reservations are becoming feasible between a VM and the Internet H. Ballani, et al. Towards Predictable Datacenter Networks ACM SIGCOMM ‘11 C. Guo, et al. SecondNet: a Data Center Network Virtualization Architecture with Bandwidth Guarantees ACM CoNEXT ‘10
Dynamic Bandwidth Reservation reduces cost due to better utilization Reservation Bandwidth Demand 0 1 2 Days Difficulty: tenants don’t really know their demand!
QoS Level (e.g., 95%) Workload history of the tenant Cloud Provider Tenant Guaranteed Portion Demand Prediction A New Bandwidth Reservation Service A tenant specifies a percentage of its bandwidth demand to be served with guaranteed performance; The remaining demand will be served with best effort repeated periodically Bandwidth Reservation
Tenant Demand Model • Each tenant i has a random demand Di • Assume Di is Gaussian, with • meanμi = E[Di] • variance σi2 = var[Di] • covariance matrix Σ = [σij] • Service Level Agreement: Outage w.p.
Roadmap • Part 1A cloud bandwidth reservation model • Part 2 Price such reservations • Large-scale distributed optimization • Part 3 Trace-driven simulations
Objectives • Objective 1: Pricing the reservations • A reservation fee on top of the usage fee • Objective 2: Resource Allocation • Price affects demand, which affects price in turn • Social Welfare Maximization
Tenant i’s expected utility (revenue) Tenant Utility (e.g., Netflix) Tenant i can specify a guaranteed portion wi Concave, twice differentiable, increasing Utility depends not only on demand, but also on the guaranteed portion!
Non-multiplexing: Multiplexing: e.g. Service cost Bandwidth Reservation • Given submitted guaranteed portions the cloud will guarantee the demands • It needs to reserve a total bandwidth capacity
Surplus of tenanti Profit of the Cloud Provider Price Cloud Objective: Social Welfare Maximization Social Welfare Impossible: the cloud does not know Ui
Pricing function Under Pi(⋅), tenant i will choose Example: Linear pricing Surplus (Profit) Pricing Function Price guaranteed portion, not absolute bandwidth!
where Determine pricing policy to Social Welfare Surplus Pricing as a Distributed Solution Challenge: Cost not decomposable for multiplexing
Since , for Gaussian Mean Std A Simple Case: Non-Multiplexing • Determine pricing policy to where
Lagrange dual Dual problem Original problem The General Case:Lagrange Dual Decomposition M. Chiang, S. Low, A. Calderbank, J. Doyle.Layering as optimization decomposition: A mathematical theory of network architectures.Proc. of IEEE 2007
Subgradient Algorithm: For dual minimization, update price: Lagrange dual a subgradient of Dual problem decompose Lagrange multiplier ki as price: Pi (wi):= ki wi
2 4 Social Welfare (SW) Cloud Provider 1 3 Guaranteed Portion Price Surplus Update to increase . . . . . . Tenant i Tenant 1 Tenant N Weakness of the Subgradient Method Step size is a issue! Convergence is slow.
Solve KKT Conditions of Cloud Provider 2 Set 1 3 . . . . . . Tenant i Our Algorithm: Equation Updates 4 Linear pricingPi (wi)= ki wi suffices!
Theorem 1 (Convergence)Equation updates converge if for all i for all between and
Convergence: A Single Tenant (1-D) Subgradient method Equation Updates Not converging
Covariance matrix: is a cone centered at 0 if is not zero and is small symmetric, positive semi-definite The Case of Multiplexing Satisfies Theorem 1, algorithm converges.
Roadmap • Part 1A cloud bandwidth reservation model • Part 2 Price such reservations • Large-scale distributed optimization • Part 3 Trace-driven simulations
Data Mining: VoD Demand Traces • 200+ GB traces (binary) from UUSee Inc. • reports from online users every 10 minutes • Aggregate into video channels
Predict Expected Demand via Seasonal ARIMA Bandwidth (Mbps) Time periods (1 period = 10 minutes)
Predict Demand Variation via GARCH Mbps Time periods (1 period = 10 minutes)
Prediction Results • Each tenant i has a random demand Di in each “10 minutes” • Di is Gaussian, with • meanμi = E[Di] • variance σi2 = var[Di] • covariance matrix Σ = [σij]
Dimension Reduction via PCA • A channel’s demand = weighted sum of factors • Find factors using Principal Component Analysis (PCA) • Predict factors first, then each channel
3 Biggest Channels of 452 Channels Bandwidth (Mbps) Time periods (1 period = 10 minutes)
The First 3 Principal Components Mbps Time periods (1 period = 10 minutes)
Complexity Reduction: 98% 8 components 452 channels 8 components Data Variance Explained Number of principal components
Utility of tenant i(conservative estimate) Usage of tenant i: w.h.p. Reputation loss for demand not guaranteed Linear revenue Pricing: Parameter Settings
CDF Mean = 6 rounds Mean = 158 rounds Convergence Iteration of the Last Tenant 100 tenants (channels), 81 time periods (81 x 10 Minutes)
Related Work • Primal/Dual Decomposition [Chiang et al. 07] • Contraction Mapping x := T(x) • D. P. Bertsekas, J. Tsitsiklis, "Parallel and distributed computation: numerical methods" • Game Theory [Kelly 97] • Each user submits a price (bid), expects a payoff • Equilibrium may or may not be social optimal • Time Series Prediction • HMM [Silva 12], PCA [Gürsun 11], ARIMA [Niu 11]
Conclusions • A cloud bandwidth reservation model based on guaranteed portions • Pricing for social welfare maximization • Future work: • new decomposition and iterative methods for very large-scale distributed optimization • more general convergence conditions
Thank you Di Niu Department of Electrical and Computer Engineering University of Toronto http://iqua.ece.toronto.edu/~dniu
Root mean squared errors (RMSEs) over 1.25 days RMSE (Mbps) in Log Scale Channel Index
Without multiplexing, Correlation to the market, in [-1, 1] Expected Demand Demand Standard Deviation With multiplexing, Optimal Pricing when each tenant requires wi ≡ 1
Risk neutralizers Majority Histogram of Price Discounts due to Multiplexing mean discount 44% total cost saving 35% Counts Discounts of All Tenants in All Test Periods
Video Channel: F190E Aggregate bandwidth (Mbps) Time periods (one period = 10 minutes)