Scale an application running on AWS EC2 for a big day event

Ajay Pandey

Solution Architect @Telus Digital-|AWS| System Design| Solution Architecture |DevOps & Cloud | Kubernetes| CI/CD| Terraform| GCP| HA| Scalability| Reliability| Security | Cost Optimization | DR | Gen AI on AWS

Published Feb 6, 2025

Wondering how I did it. Let’s go over the experience.

Task : Being an Architect and a member of Cloud Center of Excellence (CCOE) i have been reached out by the Application team to review the Architecture to support the event.

While reviewing the architecture I found the current infrastructure was not capable of supporting bursty workload for the big day.

How did i know it is not capable of supporting bursty traffic

We did extensive testing with a Load testing tool and sent traffic to current infrastructure and as expected the system crumbled and crashed with a lot of requests.

Here is the architecture diagram of Existing Architecture.

Let’s break it down.

We followed a 3 Tier Architecture model.

Web Tier — Accepting all user requests.

Logic Tier — Applying the business logic and interacting with third party systems and AWS Services.

Database Tier — Storing information related to Products, User info, Order, billing etc.

Challenges with existing Infrastructure

1.Not capable of handling traffic for a bursty workload.

2.Load balancer was not optimized for handling heavy traffic.

3.Autoscaling strategies used were Target Tracking which automatically adjusted capacity to maintain a target metric (CPU utilization at 50%).If CPU usage exceeds 50%, auto Scaling increases EC2 instances. Not ideal to support big day traffic events.

4.AMI used to provision new instances used to take more than 60 seconds due to the large number of packages and libraries.

5.Logic tier was hitting the database directly with requests and causing the database to be overwhelmed.

Architecture Diagram to support Big day event

Let’s break it down and see how we used the above architecture to overcome the existing challenges.

1.Load Balancer was Pre-warmed to support the bursty traffic.

2.Autoscaling strategies were updated from Target Tracking to Scheduled Scaling which allowed to pre-warm ec2 instances before the event.

3.Worked with the App team and removed unwanted packages and libraries from code and made the AMI lightweight that allowed Ec2 instances to provision quickly.

4.RDS proxy service was used as a layer between Logic tier and database to stop databases from being overwhelmed.

Recommended by LinkedIn

High Speed AWS Auto Scaling Groups for Web Application…

NARAYANAN PALANI 👁️🗨️ 5 months ago

Amazon Elastic Container Service (Amazon ECS)

Rohit Singh 1 month ago

Github self hosted with pool of ec2 instances

Jagan Rajagopal AWS Certified Solution Associate ,IAC,Azure ,Terraform 8 months ago

Would like to highlight Couple of AWS Services that were Game Changers to support this event and it deserves a mention here.

Autoscaling Strategies : Scheduled Scaling

Scheduled Scaling allows you to proactively adjust capacity by setting up scaling actions at specific times, based on predictable demand patterns. Instead of reacting to real-time changes, AWS Auto Scaling increases or decreases resources at predefined times

How Scheduled Scaling Works

Define a Schedule: Specify a start time and, optionally, an end time for scaling.
Set Desired Capacity: Define the minimum and maximum number of instances or resources.
AWS Auto Scaling Adjusts Resources: Instances or resources are automatically adjusted at the scheduled time.

Amazon RDS Proxy

Amazon RDS Proxy is a fully managed database proxy service for Amazon RDS (Relational Database Service) and Amazon Aurora. It helps improve the scalability, security, and performance of applications that use MySQL, PostgreSQL, or SQL Server databases by pooling and sharing database connections efficiently.

Key Benefits of RDS Proxy

Connection Pooling

Reduces the overhead of frequently opening and closing database connections.
Helps handle large numbers of client connections efficiently.

2. Improved Availability & Failover

Speeds up failover times in Amazon Aurora and Amazon RDS (reducing from minutes to seconds).
Ensures that applications remain highly available during database failovers.

3. Better Scalability

Protects databases from being overwhelmed by too many connections.
Efficiently reuses existing connections instead of constantly creating new ones.

4. Enhanced Security

Uses AWS IAM authentication instead of storing database credentials in the application.
Supports AWS Secrets Manager to securely store and manage credentials.

5. Cost Efficiency

Reduces database instance costs by optimizing connection usage.
Minimizes the need to scale up database instances for handling connection limits.

How RDS Proxy Works

It sits between your application and the database.
The proxy maintains a pool of established connections to the database.
When the application requests a connection, it is assigned an existing pooled connection instead of creating a new one.
It automatically detects and reroutes traffic in case of database failover.

Use Cases

High-traffic web applications that need efficient database connection management.
Serverless applications using AWS Lambda that require persistent database connections.

By implementing auto-scaling, load balancing, database optimizations, and real-time monitoring, our infrastructure and application were well-prepared to handle sudden traffic spikes during big events and helped us make our event successful.

Have you ever been asked to support a big day event. How did you do it? Feel free to share your experience.

Quang Nguyen

IoT Engineering Manager leading IoT solutions with AWS expertise

2mo

Great post Ajay Pandey. Have you try to run AWS Countdown to make sure that your architecture can handle simulated traffic that you were expecting? If you haven’t, you might want to give it a try.

1 Reaction

Madhuri Sharma

DevOps Lead @Telus Digital || 10K @LinkedIn || AWS || K8s || CICD || Terraform || DevOps || Service Mesh-Istio || Prometheus && Grafanna

3mo

Thanks for such informative post

1 Reaction

Harpreet Singh

10+ years of experience in developing and troubleshooting microservices applications with good experience of AWS, Azure, Devops, Docker and kubernetes.

3mo

very innovative, great article

1 Reaction

See more comments

To view or add a comment, sign in

Scale an application running on AWS EC2 for a big day event

Ajay Pandey

Solution Architect @Telus Digital-|AWS| System Design| Solution Architecture |DevOps & Cloud | Kubernetes| CI/CD| Terraform| GCP| HA| Scalability| Reliability| Security | Cost Optimization | DR | Gen AI on AWS

Recommended by LinkedIn

More articles by Ajay Pandey

Insights from the community

Others also viewed

AWS Elastic Kubernetes Service (EKS): EC2 vs Fargate

🚀 Terraform Like a Pro – Part 5: Elevating EC2 Operations with SSM Access & Metrics-Driven Monitoring

Automating Ollama Deployment with LLaMA on AWS EC2: A Journey in AI Infrastructure

Amazon Elastic Kubernetes Service (AWS EKS) Auto Mode

How to set up the auto-scaling architecture for a Node.js application on AWS?

Save Costs on EKS Clusters using Karpenter

AWS Elastic Container Service (ECS)

EKS vs. self-managed Kubernetes on AWS

EKS deployment by Terraform:

Automating EC2 Instance Start/Stop with AWS Lambda and EventBridge

Explore topics

Recommended by LinkedIn

More articles by Ajay Pandey

Generative AI on Amazon EKS using NVIDIA GPU

Secure Workloads running on AWS EKS

Scaling Application running on AWS EKS

Achieving High Availability for applications on AWS EKS.

Scale an application running on AWS EKS for a big day event

Cost Optimization on AWS Cloud.

Achieving Disaster Recovery on AWS Cloud

Secure Application running on AWS Cloud

Achieving High Availability on AWS Cloud with EC2 and RDS

Scaling Application running in containers (AWS EKS)

Insights from the community

Others also viewed

AWS Elastic Kubernetes Service (EKS): EC2 vs Fargate

🚀 Terraform Like a Pro – Part 5: Elevating EC2 Operations with SSM Access & Metrics-Driven Monitoring

Automating Ollama Deployment with LLaMA on AWS EC2: A Journey in AI Infrastructure

Amazon Elastic Kubernetes Service (AWS EKS) Auto Mode

How to set up the auto-scaling architecture for a Node.js application on AWS?

Save Costs on EKS Clusters using Karpenter

AWS Elastic Container Service (ECS)

EKS vs. self-managed Kubernetes on AWS

EKS deployment by Terraform:

Automating EC2 Instance Start/Stop with AWS Lambda and EventBridge

Explore topics