Elastic Load Balancing & Auto Scaling Groups doc. added

2022-08-08 23:03:06 +09:00
parent 2ba860d28f
commit 0f25646919
2 changed files with 116 additions and 0 deletions
--- a/README.md
+++ b/README.md
@@ -9,6 +9,7 @@
  - [IAM: Identity Access & Management](/iam.md)
  - [EC2: Virtual Machines](/ec2.md)
  - [EC2 Instance Storage](/ec2_storage.md)
+  - [Elastic Load Balancing & Auto Scaling Groups Section](/elb_asg.md)

 ### Contributors

--- a/elb_asg.md
+++ b/elb_asg.md
@@ -0,0 +1,115 @@
+# Elastic Load Balancing & Auto Scaling Groups
+
+## Scalability & High Availability
+
+* Scalability means that an application / system can handle greater loads by adapting.
+* There are two kinds of scalability:
+  * Vertical Scalability
+  * Horizontal Scalability (= elasticity)
+* Scalability is linked but different to High Availability
+* Let’s deep dive into the distinction, using a call center as an example
+
+## Vertical Scalability
+
+* Vertical Scalability means increasing the size of the instance
+* For example, your application runs on a t2.micro
+* Scaling that application vertically means running it on a t2.large
+* Vertical scalability is very common for non distributed systems, such as a database.
+* There’s usually a limit to how much you can vertically scale (hardware limit)
+
+## Horizontal Scalability
+
+* Horizontal Scalability means increasing the number of instances / systems for your application
+* Horizontal scaling implies distributed systems.
+* This is very common for web applications / modern applications
+* It’s easy to horizontally scale thanks the cloud offerings such as Amazon EC2
+
+## High Availability first building in New York
+
+* High Availability usually goes hand in hand with horizontal scaling
+* High availability means running your application / system in at least 2 Availability Zones
+* The goal of high availability is to survive a data center loss (disaster)
+
+## High Availability & Scalability For EC2
+
+* Vertical Scaling: Increase instance size (= scale up / down)
+  * From: t2.nano - 0.5G of RAM, 1 vCPU
+  * To: u-12tb1.metal – 12.3 TB of RAM, 448 vCPUs
+* Horizontal Scaling: Increase number of instances (= scale out / in)
+  * Auto Scaling Group
+  * Load Balancer
+* High Availability: Run instances for the same application across multi AZ
+  * Auto Scaling Group multi AZ
+  * Load Balancer multi AZ
+
+## Scalability vs Elasticity (vs Agility)
+
+* Scalability: ability to accommodate a larger load by making the hardware stronger (scale up), or by adding nodes (scale out)
+* Elasticity: once a system is scalable, elasticity means that there will be some “auto-scaling” so that the system can scale based on the load. This is “cloud-friendly”: pay-per-use, match demand, optimize costs
+* Agility: (not related to scalability - distractor) new IT resources are only a click away, which means that you reduce the time to make those resources available to your developers from weeks to just minutes.
+
+## What is load balancing?
+
+* Load balancers are servers that forward internet traffic to multiple servers (EC2 Instances) downstream.
+
+## Why use a load balancer?
+
+* Spread load across multiple downstream instances
+* Expose a single point of access (DNS) to your application
+* Seamlessly handle failures of downstream instances
+* Do regular health checks to your instances
+* Provide SSL termination (HTTPS) for your websites
+* High availability across zones
+
+## Why use an Elastic Load Balancer?
+
+* An ELB (Elastic Load Balancer) is a managed load balancer
+  * AWS guarantees that it will be working
+  * AWS takes care of upgrades, maintenance, high availability
+  * AWS provides only a few configuration knobs
+* It costs less to setup your own load balancer but it will be a lot more effort on your end (maintenance, integrations)
+* 3 kinds of load balancers offered by AWS:
+  * Application Load Balancer (HTTP / HTTPS only) – Layer 7
+  * Network Load Balancer (ultra-high performance, allows for TCP) – Layer 4
+  * Classic Load Balancer (slowly retiring) – Layer 4 & 7
+
+## What’s an Auto Scaling Group?
+
+* In real-life, the load on your websites and application can change
+* In the cloud, you can create and get rid of servers very quickly
+* The goal of an Auto Scaling Group (ASG) is to:
+  * Scale out (add EC2 instances) to match an increased load
+  * Scale in (remove EC2 instances) to match a decreased load
+  * Ensure we have a minimum and a maximum number of machines running
+  * Automatically register new instances to a load balancer
+  * Replace unhealthy instances
+* Cost Savings: only run at an optimal capacity (principle of the cloud)
+
+## Auto Scaling Groups – Scaling Strategies
+
+* Manual Scaling: Update the size of an ASG manually
+* Dynamic Scaling: Respond to changing demand
+  * Simple / Step Scaling
+    * When a CloudWatch alarm is triggered (example CPU > 70%), then add 2 units
+    * When a CloudWatch alarm is triggered (example CPU < 30%), then remove 1
+  * Target Tracking Scaling
+    * Example: I want the average ASG CPU to stay at around 40%
+  * Scheduled Scaling
+    * Anticipate a scaling based on known usage patterns
+    * Example: increase the min. capacity to 10 at 5 pm on Fridays
+* Predictive Scaling
+  * Uses Machine Learning to predict future traffic ahead of time
+  * Automatically provisions the right number of EC2 instances in advance
+* Useful when your load has predictable time - based patterns
+
+## ELB & ASG – Summary
+
+* High Availability vs Scalability (vertical and horizontal) vs Elasticity vs Agility in the Cloud
+* Elastic Load Balancers (ELB)
+  * Distribute traffic across backend EC2 instances, can be Multi-AZ
+  * Supports health checks
+  * 3 types: Application LB (HTTP – L7), Network LB (TCP – L4), Classic LB (old)
+* Auto Scaling Groups (ASG)
+  * Implement Elasticity for your application, across multiple AZ
+  * Scale EC2 instances based on the demand on your system, replace unhealthy
+  * Integrated with the ELB