A load balancer distributes incoming requests across multiple backend servers to prevent any single server from becoming a bottleneck. It improves availability (if one server fails, others handle traffic) and horizontal scalability (add servers to handle more load).