Optimizing for Cost in the Cloud

Optimizing for Cost in the Cloud Jinesh Varia @jinman Technology Evangelist

Multiple dimensions of optimizations Cost Performance Response time Time to market High-availability Scalability Security Manageability …….

Optimizing for Cost

When you turn off your cloud resources, you actually stop paying for them

Continuous optimization in your architecture results in recurring savings in your next month’s bill

Elasticityis one of the fundamental properties of the cloud that drives many of its economic benefits

Optimizing for Cost… #1 Use only what you need (use Auto Scaling Service, modify–db)

Turn off what you don’t need (automatically)

25% Savings Optimize by the time of day

www.MyWebSite.com (dynamic data) Amazon Route 53 (DNS) media.MyWebSite.com (static data) Elastic Load Balancer Availability Zone #1 Auto Scaling group : Web Tier Availability Zone #2 Amazon CloudFront Amazon EC2 Auto Scaling group : App Tier Amazon RDS Amazon S3 Amazon RDS

50% Savings Optimize during a year

Auto scaling : Types of Scaling • Scaling by Schedule • Use Scheduled Actions in Auto Scaling Service • Date • Time • Min and Max of Auto Scaling Group Size • You can create up to 125 actions, scheduled up to 31 days into the future, for each of your auto scaling groups. This gives you the ability to scale up to four times a day for a month. • Scaling by Policy • Scaling up Policy - Double the group size • Scaling down Policy - Decrement by 1

Auto scaling Best Practices • Use Auto Scaling Tags • Use Auto scaling Alarms and Email Notifications • Scale up and down symmetrically • Scale up quickly and scaling down slowly • Auto Scaling across Availability Zones • Leverage Suspend and Resume Processes

Example: Scale up by 10% if CPU utilization is greater than 60% for 5 minutes, Scale down by 10% if CPU utilization is less than 30% for 20 minutes.

Agg. CPU Instances

75% Savings Optimize during a month

End of the month processing • Expand the cluster at the end of the month • Expand/Shrink feature in Amazon Elastic MapReduce • Vertically Scale up at the end of the month • Modify-DB-Instance (in Amazon RDS) (or a New RDS DB Instance ) • CloudFormation Script (in Amazon EC2)

Tip: Use “Reminder scripts” • Disassociate your unused EIPs • Delete unassociated EBS volumes • Delete older EBS snapshots • Leverage S3 Object Expiration

Pick the Right Instance Type

Basic recommendations on Instance Type • Choose the EC2 instance type that best matches the resources required by the application • Start with memory requirements and architecture type (32bit or 64-bit) • Then choose the closest number of virtual cores required • Scaling across AZs • Smaller sizes give more granularity for deploying to multiple AZs

AWS Support – Trusted Advisor – Your personal cloud assistant

Tip – Instance Optimizer Free Memory Free CPU Free HDD At 1-min intervals PUT 2 weeks Alarm Amazon CloudWatch Instance Custom Metrics “You could save a bunch of money by switching to a small instance, Click on CloudFormation Script to Save”

Optimizing for Cost… #1 Use only what you need (use Auto Scaling Service, modify–db) #2 Invest time in Reserved Pricing analysis (EC2, RDS)

Your Best Option: Reserved + On-Demand

Save more when you reserve

m2.xlarge running Linux in US-East Region over 3 Year period Break-even point

Recommendations • Steady State Usage Pattern • For 100% utilization • 3-Year Heavy RI (for maximum savings over on-demand) • Spiky Predictable Usage Pattern • Baseline • 3-Year Heavy RI (for maximum savings over on-demand) • 1-Year Light RI (for lowest upfront commitment) + savings over on-demand • Peak: On-Demand • Uncertain and unpredictable Usage Pattern • Start out small with On-Demand Instances (risk-free and commitment-free) • Switch to some combination of Reserved and On-Demand, if application is successful • If not successful, you walk away having spent a fraction of what you would pay to buy your own technology infrastructure

Example: Simple 3-Tier Web Application

Optimizing for Cost… #1 Use only what you need (use Auto Scaling Service, modify–db) #2 Invest time in Reserved Pricing analysis (EC2, RDS) #3 Architect for Spot Instances (bidding strategies)

Optimize by using Spot Instances

What are Spot Instances? Sold at 50% Discount! Sold at 54% Discount! Unused Unused Sold at 56% Discount! Sold at 59% Discount! Unused Unused Sold at 63% Discount! Sold at 66% Discount! Unused Unused Availability Zone Availability Zone Region

What is the tradeoff? Unused Unused Unused Reclaimed Unused Unused Unused Reclaimed Availability Zone Availability Zone Region

Spot Use cases

Save more money by using Spot Instances Reserved Hourly Price >Spot Price < On-Demand Price

Spot: Example Customers 57% 50% 63% 50% 56% 50% 66% 50%

Typical Spot Bidding Strategies Bid near the Reserved Hourly Price Bid above the Spot Price History Bid near On-Demand Price Bid above the On-Demand Price

1. Bid Near the Reserved Hourly Price $ $ $$ $ $ $ $$$$$$ $ $ $ $ $ $ $ $$$ $ $ $$$$$$$$$ $ $$$$$ $ $ $ $ 66% Savings over On-Demand

2. Bid above the Spot Price History 50% Savings over On-Demand

3. Bid near the On-Demand Price 50% Savings over On-Demand

4. Bid above the On-Demand Price 57% Savings over On-Demand

Managing Interruption

Amazon EMR (Hadoop): Run Task Nodes on Spot Amazon S3 Upload large datasets or log files directly Input Data Amazon S3 Data Source OutputData Amazon Elastic MapReduce Hadoop Cluster • TaskNode Amazon Elastic MapReduce Amazon SimpleDB Metadata Mapper Reducer Service • NameNode Code/ Scripts • TaskNode HiveQL Pig Latin Cascading Runs multiple JobFlow Steps • Core Node HiveQL Pig Latin Query • CoreNode HDFS BI Apps JDBC/ODBC

Amazon EMR: Reducing Cost with Spot #1: Cost without Spot 4 instances *14 hrs * $0.45 = $25.20 Job Flow Add 5 Spot Instances Scenario #2 Scenario #1 Allocate 4 instances Job Flow #2: Cost with Spot 4 instances *7 hrs * $0.45 = $12.60 + 5 instances * 7 hrs * $0.225 = $7.875 Total = $20.475 Duration: Duration: 14 Hours Time Savings: 50% Cost Savings: ~19% 7 Hours

Made for each other: MapReduce + Spot • Use Case: Web crawling/Search using Hadoop type clusters. Use Reserved Instances for their DB workloads and Spot instances for their indexing clusters. Launch 100’s of instances. • Bidding Strategy: Bid a little above the On-Demand price to prevent interruption. • Interruption Strategy: Restart the cluster if interrupted 66% Savings over On-Demand

Video Transcoding Application Example Amazon S3 Amazon S3 On-demand + Spot Amazon Elastic Compute Cloud Processed Data Output Bucket Input Bucket Raw Data Amazon EC2 Amazon SQS Amazon SQS Reports Website Job Completed Job Output Queue Input Queue Amazon EC2 Website (Job Manager) Metadata Status Logs Exceptions Amazon CloudWatch Amazon SimpleDB Amazon SimpleDB Amazon EC2 Intranet

Use of Amazon SQS in Spot Architectures VisibilityTimeOut Amazon EC2 Spot Instance

Optimizing Video Transcoding Workloads • Premium Offering • Optimized for Faster response times • No Delays Implementation • Invest in RIs • Use on-demand for Elasticity Maximum Bid Price >= On-demand Rate Get Instant Capacity for higher price • Free Offering • Optimize for reducing cost • Acceptable Delay Limits Implementation • Set Persistent Requests • Use on-demand Instances, if delay Maximum Bid Price < On-demand Rate Get your set reduced price for your workload

Persistent Requests

Architecting for Spot Instances : Best Practices • Manage interruption • Split up your work into small increments • Checkpointing: Save your work frequently and periodically • Test Your Application • Track when Spot Instances Start and Stop • Spot Requests • Use Persistent Requests for continuous tasks • Choose maximum price for your requests

Optimizing for Cost in the Cloud

Optimizing for Cost in the Cloud

Presentation Transcript

The Cloud Cost Model

Optimizing Cost and Performance for Multihoming

Optimizing Cost and Performance for Multihoming

Optimizing MapReduce Provisioning in the Cloud

Optimizing the Cloud in Government

Optimizing the Cost of Excavation

Optimizing Cost and Performance in Online Service Provider

Optimizing Cloud MapReduce for Processing Stream Data using Pipelining

Optimizing Cost and Performance for Content Multihoming

Optimizing for Cost in the Cloud

Optimizing Cost-Efficiency in Telephone Counseling for Smoking Cessation

Optimizing Cost and Performance for Multihoming

Optimizing Cost and Performance for Content Multihoming

Akasia cloud cost planing

Cloud Server Cost in India

The Cloud Cost Model

Managing Cost for Your Cloud Workloads - Loves Cloud

Cloud Cost Optimization Services in Pune

Cloud Cost Optimization

Advanced Cloud Cost Savings

Optimizing Cost And Performance In Cloud Application Development

Mastering Cloud Cost Optimization