HAProxy

The Reliable, High Performance TCP/HTTP Load Balancer

Benchmarks using Myricom's PCI-Express 10 Gig NICs (Myri-10G PCI-Express)

...or how to achieve 10-Gig load-balancing with HAProxy !

Lab setup

ASUS M3A32MVP Deluxe + AMD Athlon64-X2/3.2 GHz for traffic generation (*2)
ASUS P5E + intel C2D E8200/2.66 GHz to host HAProxy

Well, I was right to select two different boards. The AMD-based mobos cannot push more than 8-9 Gbps to the wire. They can receive it though. For this reason, some of the tests are made with my desktop PC as the HTTP server (x38 too), so that the whole chain is not limited to 10 Gbps.

I could also verify that this wonderful intel X38 chipset has no trouble pushing 20 Gbps to the wires when attacked by both AMD in parallel.

Anyway, whatever the test, the client is always directly connected to the HAProxy, which itself is directly connected to the server. Those are only point-to-point connections, as I have no 10-Gig switch.

Tests methodology

amd1

c2d

amd2

The collected values are then passed to another script which produces a GNUPLOT script, which when run, produces a PNG graph. The graph shows in green the number of hits per second, which also happens to be the connection rate since haproxy does only one hit per connection. In red, we have the data rate (HTTP headers+data only) reached for each object size. In general, the larger the object, the smaller the connection overhead and the higher the bandwidth.

Tests in single-process mode, 16kB buffers

amd1

C2D