Ayush Poddar's Wiki

Scaling a web app

Dec 16, 20242 min read

Simple setup

How does a web request happen in a simple single-server setup? Note the order of the requests made

Step 1: Separating the data and web tier

Separating web traffic (web tier) and database (data tier) servers allows them to be scaled independently Data is now saved in a separate DB tier

Which database to choose?

There are, primarily, two kinds of databases:

Relational database
Non-relational database

Step 2: Multiple web servers - Using load balancer

Users will send requests to the public IP of the load balancer, which in turn will decide which server to forward the request to
If the traffic increases, the number of servers can be increased and the load balancer can start sending the increased requests to the new servers too.

Step 3: Replicating data

Replicating data is important in order to ensure that all data is safe in the event that our main DB server goes down
A common way is to perform database replication using master-slave relationship

Step 4: Inserting a cache tier

Used in order to improve the data fetch speed of frequently read data
In this scenario, we will go with the Read-through cache

Step 5: Using a CDN

Used to deliver static content

Step 6: Scaling horizontally - Create stateless web tier

We need to move state (like user session data) out of the web tier
Good practice is to store session data in the persistent storage such as relational database or NoSQL
Each web server in the cluster can access state data from the persistent storage created above
This gives us the Stateless web server

Step 7: GeoDNS routing users

Data centers are replicated across regions
Users are geo-routed to the closest data center

Demonstrating GeoDNS routing

Step 8: Decoupling systems to support independent scaling of systems

Message queues help with decoupling of systems reducing inter-service dependencies

Step 9: Logging

Monitoring logs is important in order to identify errors and problems in the system

Step 10: Collecting metrics

Collecting application system metrics

Step 11: Automating application development process

Automating application maintenance cycle

A full blown horizontally scaled application

Step 12: Scaling the database

Can be done using:
- Can also vertically scale the database
- Database sharding

Horizontal scaling vs Vertical Scaling

Sources

(Done) Scale From Zero To Millions Of Users by ByteByteGo

Related Notes

Horizontal scaling

Graph View

Simple setup
Step 1: Separating the data and web tier
Which database to choose?
Step 2: Multiple web servers - Using load balancer
Step 3: Replicating data
Step 4: Inserting a cache tier
Step 5: Using a CDN
Step 6: Scaling horizontally - Create stateless web tier
Step 7: GeoDNS routing users
Step 8: Decoupling systems to support independent scaling of systems
Step 9: Logging
Step 10: Collecting metrics
Step 11: Automating application development process
Step 12: Scaling the database
Sources
Related Notes

Backlinks

High level designs - Software Systems
Design a key-value store

Created with Quartz v4.4.0 © 2024

GitHub
Discord Community