Definition
A load balancer is a system that distributes incoming network traffic across multiple servers so that no single server bears too much demand. It sits between your visitors and your servers, directing each request to whichever server is best positioned to handle it — typically the one with the lightest current load. If one server fails, the load balancer automatically redirects traffic to the remaining healthy servers, keeping your website or application online.
Why It Matters
A single server has a finite capacity. During traffic spikes — a product launch, a marketing campaign, or seasonal demand — that capacity can be exceeded, resulting in slow responses or complete outages. A load balancer solves this by spreading the work across multiple servers, ensuring consistent performance even under heavy load. It also provides redundancy: if a server crashes or needs maintenance, the load balancer routes around it without visitors noticing any disruption. For any business where downtime directly costs money or damages reputation, load balancing is an essential part of the infrastructure.
Example
A ticket-selling platform experiences massive traffic spikes whenever popular events go on sale. With a single server, the site would crash within minutes of a major release. By placing a load balancer in front of four identical servers, incoming requests are spread evenly. When one event generates ten times the normal traffic, all four servers share the load. If one server develops a problem during the sale, the load balancer stops sending traffic to it and the remaining three continue serving customers without interruption.