perf-bsd44.html revision 7b3e47feb2f0d0a120f80b2de66656a55d0e63ef
<!DOCTYPE HTML PUBLIC "-//W3C//DTD HTML 3.2 Final//EN">
<HTML>
<HEAD>
<TITLE>Running a High-Performance Web Server for BSD</TITLE>
</HEAD>
<!-- Background white, links blue (unvisited), navy (visited), red (active) -->
<BODY
BGCOLOR="#FFFFFF"
TEXT="#000000"
LINK="#0000FF"
VLINK="#000080"
ALINK="#FF0000"
>
<A NAME="initial">
<!--#include virtual="header.html" -->
</A>
<blockquote><strong>Warning:</strong>
This document has not been updated to take into account changes
made in the 2.0 version of the Apache HTTP Server. Some of the
information may still be relevant, but please use it
with care.
</blockquote>
<H1 ALIGN="CENTER">Running a High-Performance Web Server for BSD</H1>
Like other OS's, the listen queue is often the <STRONG>first limit
hit</STRONG>. The
following are comments from "Aaron Gifford &lt;agifford@InfoWest.COM&gt;"
on how to fix this on BSDI 1.x, 2.x, and FreeBSD 2.0 (and earlier):
<P>
Edit the following two files:
<BLOCKQUOTE><CODE> /usr/include/sys/socket.h <BR>
/usr/src/sys/sys/socket.h </CODE></BLOCKQUOTE>
In each file, look for the following:
<PRE>
/*
* Maximum queue length specifiable by listen.
*/
#define SOMAXCONN 5
</PRE>
Just change the "5" to whatever appears to work. I bumped the two
machines I was having problems with up to 32 and haven't noticed the
problem since.
<P>
After the edit, recompile the kernel and recompile the Apache server
then reboot.
<P>
FreeBSD 2.1 seems to be perfectly happy, with SOMAXCONN
set to 32 already.
<P>
<A NAME="detail">
<STRONG>Addendum for <EM>very</EM> heavily loaded BSD servers</STRONG><BR>
</A>
from Chuck Murcko &lt;chuck@telebase.com&gt;
<P>
If you're running a really busy BSD Apache server, the following are useful
things to do if the system is acting sluggish:<P>
<UL>
<LI> Run vmstat to check memory usage, page/swap rates, <EM>etc.</EM>
<LI> Run netstat -m to check mbuf usage
<LI> Run fstat to check file descriptor usage
</UL>
These utilities give you an idea what you'll need to tune in your kernel,
and whether it'll help to buy more RAM.
Here are some BSD kernel config parameters (actually BSDI, but pertinent to
FreeBSD and other 4.4-lite derivatives) from a system getting heavy usage.
The tools mentioned above were used, and the system memory was increased to
48 MB before these tuneups. Other system parameters remained unchanged.
<P>
<PRE>
maxusers 256
</PRE>
Maxusers drives a <EM>lot</EM> of other kernel parameters:
<UL>
<LI> Maximum # of processes
<LI> Maximum # of processes per user
<LI> System wide open files limit
<LI> Per-process open files limit
<LI> Maximum # of mbuf clusters
<LI> Proc/pgrp hash table size
</UL>
The actual formulae for these derived parameters are in
<EM>/usr/src/sys/conf/param.c</EM>.
These calculated parameters can also be overridden (in part) by specifying
your own values in the kernel configuration file:
<PRE>
# Network options. NMBCLUSTERS defines the number of mbuf clusters and
# defaults to 256. This machine is a server that handles lots of traffic,
# so we crank that value.
options NMBCLUSTERS=4096 # mbuf clusters at 4096
#
# Misc. options
#
options CHILD_MAX=512 # maximum number of child processes
options OPEN_MAX=512 # maximum fds (breaks RPC svcs)
</PRE>
<P>
In many cases, NMBCLUSTERS must be set much larger than would appear
necessary at first glance. The reason for this is that if the browser
disconnects in mid-transfer, the socket fd associated with that particular
connection ends up in the TIME_WAIT state for several minutes, during
which time its mbufs are not yet freed. Another reason is that, on server
timeouts, some connections end up in FIN_WAIT_2 state forever, because
this state doesn't time out on the server, and the browser never sent
a final FIN. For more details see the
<A HREF="fin_wait_2.html">FIN_WAIT_2</A> page.
<P>
Some more info on mbuf clusters (from sys/mbuf.h):
<PRE>
/*
* Mbufs are of a single size, MSIZE (machine/machparam.h), which
* includes overhead. An mbuf may add a single "mbuf cluster" of size
* MCLBYTES (also in machine/machparam.h), which has no additional overhead
* and is used instead of the internal data area; this is done when
* at least MINCLSIZE of data must be stored.
*/
</PRE>
<P>
CHILD_MAX and OPEN_MAX are set to allow up to 512 child processes (different
than the maximum value for processes per user ID) and file descriptors.
These values may change for your particular configuration (a higher OPEN_MAX
value if you've got modules or CGI scripts opening lots of connections or
files). If you've got a lot of other activity besides httpd on the same
machine, you'll have to set NPROC higher still. In this example, the NPROC
value derived from maxusers proved sufficient for our load.
<P>
To increase the size of the <CODE>listen()</CODE> queue, you need to
adjust the value of SOMAXCONN. SOMAXCONN is not derived from maxusers,
so you'll always need to increase that yourself. We use a value guaranteed
to be larger than Apache's default for the listen() of 128, currently.
The actual value for SOMAXCONN is set in <CODE>sys/socket.h</CODE>.
The best way to adjust this parameter is run-time, rather than changing
it in this header file and thus hardcoding a value in the kernel and
elsewhere. To do this, edit <CODE>/etc/rc.local</CODE> and add the
following line:
<PRE>
/usr/sbin/sysctl -w kern.somaxconn=256
</PRE>
<P>
We used <CODE>256</CODE> but you can tune it for your own setup. In
many cases, however, even the default value of <CODE>128</CODE> (for
later versions of FreeBSD) is OK.
<P>
<STRONG>Caveats</STRONG>
<P>
Be aware that your system may not boot with a kernel that is configured
to use more resources than you have available system RAM.
<STRONG>ALWAYS</STRONG>
have a known bootable kernel available when tuning your system this way,
and use the system tools beforehand to learn if you need to buy more
memory before tuning.
<P>
RPC services will fail when the value of OPEN_MAX is larger than 256.
This is a function of the original implementations of the RPC library,
which used a byte value for holding file descriptors. BSDI has partially
addressed this limit in its 2.1 release, but a real fix may well await
the redesign of RPC itself.
<P>
Finally, there's the hard limit of child processes configured in Apache.
<P>
For versions of Apache later than 1.0.5 you'll need to change the
definition for <STRONG>HARD_SERVER_LIMIT</STRONG> in <EM>httpd.h</EM> and
recompile if you need to run more than the default 150 instances of httpd.
<P>
From conf/httpd.conf-dist:
<PRE>
# Limit on total number of servers running, <EM>i.e.</EM>, limit on the number
# of clients who can simultaneously connect --- if this limit is ever
# reached, clients will be LOCKED OUT, so it should NOT BE SET TOO LOW.
# It is intended mainly as a brake to keep a runaway server from taking
# Unix with it as it spirals down...
MaxClients 150
</PRE>
Know what you're doing if you bump this value up, and make sure you've
done your system monitoring, RAM expansion, and kernel tuning beforehand.
Then you're ready to service some serious hits!
<P>
Thanks to <EM>Tony Sanders</EM> and <EM>Chris Torek</EM> at BSDI for their
helpful suggestions and information.
<P>
"M. Teterin" &lt;mi@ALDAN.ziplink.net&gt; writes:<P>
<BLOCKQUOTE>It really does help if your kernel and frequently used utilities
are fully optimized. Rebuilding the FreeBSD kernel on an AMD-133
(486-class CPU) web-server with<BR>
<CODE> -m486 -fexpensive-optimizations -fomit-frame-pointer -O2</CODE><BR>
helped reduce the number of "unable" errors, because the CPU was
often maxed out.</BLOCKQUOTE>
<P>
<HR>
<H3>More welcome!</H3>
If you have tips to contribute, send mail to
<A HREF="mailto:apache@apache.org">apache@apache.org</A>
<!--#include virtual="footer.html" -->
</BODY></HTML>