perf-scaling.xml revision 9ddfddf1f7de27b716e12a49af7bc8c02e9f5651
842ae4bd224140319ae7feec1872b93dfd491143fielding<?xml version="1.0" encoding="utf-8"?>
842ae4bd224140319ae7feec1872b93dfd491143fielding<!DOCTYPE manualpage SYSTEM "/style/manualpage.dtd">
842ae4bd224140319ae7feec1872b93dfd491143fielding<?xml-stylesheet type="text/xsl" href="/style/manual.en.xsl"?>
842ae4bd224140319ae7feec1872b93dfd491143fielding<!-- $LastChangedRevision: 1296735 $ -->
842ae4bd224140319ae7feec1872b93dfd491143fielding
842ae4bd224140319ae7feec1872b93dfd491143fielding<!--
cc003103e52ff9d5fe9bed567ef9438613ab4fbfrse Licensed to the Apache Software Foundation (ASF) under one or more
ce9621257ef9e54c1bbe5ad8a5f445a1f211c2dcnd contributor license agreements. See the NOTICE file distributed with
cc003103e52ff9d5fe9bed567ef9438613ab4fbfrse this work for additional information regarding copyright ownership.
ce9621257ef9e54c1bbe5ad8a5f445a1f211c2dcnd The ASF licenses this file to You under the Apache License, Version 2.0
ce9621257ef9e54c1bbe5ad8a5f445a1f211c2dcnd (the "License"); you may not use this file except in compliance with
ce9621257ef9e54c1bbe5ad8a5f445a1f211c2dcnd the License. You may obtain a copy of the License at
ce9621257ef9e54c1bbe5ad8a5f445a1f211c2dcnd
ce9621257ef9e54c1bbe5ad8a5f445a1f211c2dcnd http://www.apache.org/licenses/LICENSE-2.0
ce9621257ef9e54c1bbe5ad8a5f445a1f211c2dcnd
ce9621257ef9e54c1bbe5ad8a5f445a1f211c2dcnd Unless required by applicable law or agreed to in writing, software
ce9621257ef9e54c1bbe5ad8a5f445a1f211c2dcnd distributed under the License is distributed on an "AS IS" BASIS,
ce9621257ef9e54c1bbe5ad8a5f445a1f211c2dcnd WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
ce9621257ef9e54c1bbe5ad8a5f445a1f211c2dcnd See the License for the specific language governing permissions and
ce9621257ef9e54c1bbe5ad8a5f445a1f211c2dcnd limitations under the License.
ce9621257ef9e54c1bbe5ad8a5f445a1f211c2dcnd-->
ce9621257ef9e54c1bbe5ad8a5f445a1f211c2dcnd<manualpage metafile="perf-scaling.xml.meta">
ce9621257ef9e54c1bbe5ad8a5f445a1f211c2dcnd
ce9621257ef9e54c1bbe5ad8a5f445a1f211c2dcnd<title>Performance Scaling</title>
cc003103e52ff9d5fe9bed567ef9438613ab4fbfrse
cc003103e52ff9d5fe9bed567ef9438613ab4fbfrse<summary>
cc003103e52ff9d5fe9bed567ef9438613ab4fbfrse
cc003103e52ff9d5fe9bed567ef9438613ab4fbfrse<p>The Performance Tuning page in the Apache 1.3 documentation says: </p>
cc003103e52ff9d5fe9bed567ef9438613ab4fbfrse<ul><li>“Apache is a general webserver, which is designed to be correct first, and fast
cc003103e52ff9d5fe9bed567ef9438613ab4fbfrse second. Even so, its performance is quite satisfactory. Most sites have less than 10Mbits of outgoing bandwidth, which Apache can fill using only a low end Pentium-based webserver.” </li>
e18e68b42830409bf48de0df9eed3fe363664aa7aaron</ul>
70535d6421eb979ac79d8f49d31cd94d75dd8b2fjorton<p>However, this sentence was written a few years ago, and in the meantime several things have happened. On one hand, web server hardware has become much faster. On the other hand, many sites now are allowed much more than ten megabits per second of outgoing bandwidth. In addition, web applications have become more complex. The classic brochureware site is alive and well, but the web has grown up substantially as a computing application platform and webmasters may find themselves running dynamic content in Perl, PHP or Java, all of which take a toll on performance. </p>
8464a9c46b967001e38fe3c8afff51a649e9de51dougm<p>Therefore, in spite of strides forward in machine speed and bandwidth allowances, web server performance and web application performance remain areas of concern. In this documentation several aspects of web server performance will be discussed. </p>
579fd9e90990eee18b5e504eb4c0d2ce18f76208aaron
cc003103e52ff9d5fe9bed567ef9438613ab4fbfrse</summary>
cc003103e52ff9d5fe9bed567ef9438613ab4fbfrse
cc003103e52ff9d5fe9bed567ef9438613ab4fbfrse<section id="What Will and Will Not Be Discussed">
cc003103e52ff9d5fe9bed567ef9438613ab4fbfrse<title>What Will and Will Not Be Discussed</title>
cc003103e52ff9d5fe9bed567ef9438613ab4fbfrse<p>The session will focus on easily accessible configuration and tuning options for Apache httpd 2.2 and 2.3 as well as monitoring tools. Monitoring tools will allow you to observe your web server to gather information about its performance, or lack thereof. We&apos;ll assume that you don&apos;t have an unlimited budget for server hardware, so the existing infrastructure will have to do the job. You have no desire to compile your own Apache, or to recompile the operating system kernel. We do assume, though, that you have some familiarity with the Apache httpd configuration file. </p>
cc003103e52ff9d5fe9bed567ef9438613ab4fbfrse
cc003103e52ff9d5fe9bed567ef9438613ab4fbfrse</section>
bb0b94431dc9a1591a0a38a6c48925c6d9213c83rse
cc003103e52ff9d5fe9bed567ef9438613ab4fbfrse<section id="Monitoring Your Server">
cc003103e52ff9d5fe9bed567ef9438613ab4fbfrse<title>Monitoring Your Server</title>
cc003103e52ff9d5fe9bed567ef9438613ab4fbfrse<p>The first task when sizing or performance-tuning your server is to find out how your system is currently performing. By monitoring your server under real-world load, or artificially generated load, you can extrapolate its behavior under stress, such as when your site is mentioned on Slashdot. </p>
cc003103e52ff9d5fe9bed567ef9438613ab4fbfrse
05413593151a238718198cc04ca849b2426be106rse
cc003103e52ff9d5fe9bed567ef9438613ab4fbfrse<section id="Monitoring Tools">
cc003103e52ff9d5fe9bed567ef9438613ab4fbfrse<title>Monitoring Tools</title>
cc003103e52ff9d5fe9bed567ef9438613ab4fbfrse
cc003103e52ff9d5fe9bed567ef9438613ab4fbfrse
cc003103e52ff9d5fe9bed567ef9438613ab4fbfrse<section id="top">
cc003103e52ff9d5fe9bed567ef9438613ab4fbfrse<title>top</title>
434ad3e8e769a6a7a78c15f3ae2f7ae3adbfbb49wrowe<p>The top tool ships with Linux and FreeBSD. Solaris offers `prstat&apos;. It collects a number of statistics for the system and for each running process, then displays them interactively on your terminal. The data displayed is refreshed every second and varies by platform, but typically includes system load average, number of processes and their current states, the percent CPU(s) time spent executing user and system code, and the state of the virtual memory system. The data displayed for each process is typically configurable and includes its process name and ID, priority and nice values, memory footprint, and percentage CPU usage. The following example shows multiple httpd processes (with MPM worker and event) running on an Linux (Xen) system: </p>
cc003103e52ff9d5fe9bed567ef9438613ab4fbfrse
cc003103e52ff9d5fe9bed567ef9438613ab4fbfrse<example>
cc003103e52ff9d5fe9bed567ef9438613ab4fbfrsetop - 23:10:58 up 71 days, 6:14, 4 users, load average: 0.25, 0.53, 0.47<br />
cc003103e52ff9d5fe9bed567ef9438613ab4fbfrseTasks: 163 total, 1 running, 162 sleeping, 0 stopped, 0 zombie<br />
05413593151a238718198cc04ca849b2426be106rseCpu(s): 11.6%us, 0.7%sy, 0.0%ni, 87.3%id, 0.4%wa, 0.0%hi, 0.0%si, 0.0%st<br />
cc003103e52ff9d5fe9bed567ef9438613ab4fbfrseMem: 2621656k total, 2178684k used, 442972k free, 100500k buffers<br />
cc003103e52ff9d5fe9bed567ef9438613ab4fbfrseSwap: 4194296k total, 860584k used, 3333712k free, 1157552k cached<br />
cc003103e52ff9d5fe9bed567ef9438613ab4fbfrse<br />
87a1c79b7b37702a254920ca5214fb282a4fb085dougm PID USER PR NI VIRT RES SHR S %CPU %MEM TIME+ COMMAND<br />
87a1c79b7b37702a254920ca5214fb282a4fb085dougm16687 example_ 20 0 1200m 547m 179m S 45 21.4 1:09.59 httpd-worker<br />
cc003103e52ff9d5fe9bed567ef9438613ab4fbfrse15195 www 20 0 441m 33m 2468 S 0 1.3 0:41.41 httpd-worker<br />
bb0b94431dc9a1591a0a38a6c48925c6d9213c83rse 1 root 20 0 10312 328 308 S 0 0.0 0:33.17 init<br />
bb0b94431dc9a1591a0a38a6c48925c6d9213c83rse 2 root 15 -5 0 0 0 S 0 0.0 0:00.00 kthreadd<br />
cc003103e52ff9d5fe9bed567ef9438613ab4fbfrse 3 root RT -5 0 0 0 S 0 0.0 0:00.14 migration/0<br />
e8f95a682820a599fe41b22977010636be5c2717jim 4 root 15 -5 0 0 0 S 0 0.0 0:04.58 ksoftirqd/0<br />
cc003103e52ff9d5fe9bed567ef9438613ab4fbfrse 5 root RT -5 0 0 0 S 0 0.0 4:45.89 watchdog/0<br />
e8f95a682820a599fe41b22977010636be5c2717jim 6 root 15 -5 0 0 0 S 0 0.0 1:42.52 events/0<br />
bb0b94431dc9a1591a0a38a6c48925c6d9213c83rse 7 root 15 -5 0 0 0 S 0 0.0 0:00.00 khelper<br />
bb0b94431dc9a1591a0a38a6c48925c6d9213c83rse 19 root 15 -5 0 0 0 S 0 0.0 0:00.00 xenwatch<br />
e8f95a682820a599fe41b22977010636be5c2717jim 20 root 15 -5 0 0 0 S 0 0.0 0:00.00 xenbus<br />
bb0b94431dc9a1591a0a38a6c48925c6d9213c83rse 28 root RT -5 0 0 0 S 0 0.0 0:00.14 migration/1<br />
bb0b94431dc9a1591a0a38a6c48925c6d9213c83rse 29 root 15 -5 0 0 0 S 0 0.0 0:00.20 ksoftirqd/1<br />
bb0b94431dc9a1591a0a38a6c48925c6d9213c83rse 30 root RT -5 0 0 0 S 0 0.0 0:05.96 watchdog/1<br />
bb0b94431dc9a1591a0a38a6c48925c6d9213c83rse 31 root 15 -5 0 0 0 S 0 0.0 1:18.35 events/1<br />
bb0b94431dc9a1591a0a38a6c48925c6d9213c83rse 32 root RT -5 0 0 0 S 0 0.0 0:00.08 migration/2<br />
bb0b94431dc9a1591a0a38a6c48925c6d9213c83rse 33 root 15 -5 0 0 0 S 0 0.0 0:00.18 ksoftirqd/2<br />
87a1c79b7b37702a254920ca5214fb282a4fb085dougm 34 root RT -5 0 0 0 S 0 0.0 0:06.00 watchdog/2<br />
bb0b94431dc9a1591a0a38a6c48925c6d9213c83rse 35 root 15 -5 0 0 0 S 0 0.0 1:08.39 events/2<br />
bb0b94431dc9a1591a0a38a6c48925c6d9213c83rse 36 root RT -5 0 0 0 S 0 0.0 0:00.10 migration/3<br />
cc003103e52ff9d5fe9bed567ef9438613ab4fbfrse 37 root 15 -5 0 0 0 S 0 0.0 0:00.16 ksoftirqd/3<br />
cc003103e52ff9d5fe9bed567ef9438613ab4fbfrse 38 root RT -5 0 0 0 S 0 0.0 0:06.08 watchdog/3<br />
bb0b94431dc9a1591a0a38a6c48925c6d9213c83rse 39 root 15 -5 0 0 0 S 0 0.0 1:22.81 events/3<br />
cc003103e52ff9d5fe9bed567ef9438613ab4fbfrse 68 root 15 -5 0 0 0 S 0 0.0 0:06.28 kblockd/0<br />
bb0b94431dc9a1591a0a38a6c48925c6d9213c83rse 69 root 15 -5 0 0 0 S 0 0.0 0:00.04 kblockd/1<br />
cc003103e52ff9d5fe9bed567ef9438613ab4fbfrse 70 root 15 -5 0 0 0 S 0 0.0 0:00.04 kblockd/2
cc003103e52ff9d5fe9bed567ef9438613ab4fbfrse</example>
cc003103e52ff9d5fe9bed567ef9438613ab4fbfrse
cc003103e52ff9d5fe9bed567ef9438613ab4fbfrse<p>Top is a wonderful tool even though it’s slightly resource intensive (when running, its own process is usually in the top ten CPU gluttons). It is indispensable in determining the size of a running process, which comes in handy when determining how many server processes you can run on your machine. How to do this is described in &apos;<a href="/httpd/PerformanceScalingUp#S">sizing MaxClients</a>&apos;. Top is, however, an interactive tool and running it continuously has few if any advantages. </p>
cc003103e52ff9d5fe9bed567ef9438613ab4fbfrse</section>
cc003103e52ff9d5fe9bed567ef9438613ab4fbfrse<section id="free">
87a1c79b7b37702a254920ca5214fb282a4fb085dougm<title>free</title>
87a1c79b7b37702a254920ca5214fb282a4fb085dougm<p>This command is only available on Linux. It shows how much memory and swap space is in use. Linux allocates unused memory as file system cache. The free command shows usage both with and without this cache. The free command can be used to find out how much memory the operating system is using, as described in the paragraph &apos;<a href="/httpd/PerformanceScalingUp#S">Sizing MaxClients</a>&apos;. The output of free looks like this: </p>
cc003103e52ff9d5fe9bed567ef9438613ab4fbfrse
cc003103e52ff9d5fe9bed567ef9438613ab4fbfrse<example>
bb0b94431dc9a1591a0a38a6c48925c6d9213c83rsesctemme@brutus:~$ free<br />
87a1c79b7b37702a254920ca5214fb282a4fb085dougm total used free shared buffers cached<br />
cc003103e52ff9d5fe9bed567ef9438613ab4fbfrseMem: 4026028 3901892 124136 0 253144 841044<br />
cc003103e52ff9d5fe9bed567ef9438613ab4fbfrse-/+ buffers/cache: 2807704 1218324<br />
cc003103e52ff9d5fe9bed567ef9438613ab4fbfrseSwap: 3903784 12540 3891244
87a1c79b7b37702a254920ca5214fb282a4fb085dougm</example>
cc003103e52ff9d5fe9bed567ef9438613ab4fbfrse</section>
87a1c79b7b37702a254920ca5214fb282a4fb085dougm
bb0b94431dc9a1591a0a38a6c48925c6d9213c83rse<section id="vmstat">
bb0b94431dc9a1591a0a38a6c48925c6d9213c83rse<title>vmstat</title>
cc003103e52ff9d5fe9bed567ef9438613ab4fbfrse<p>This command is available on many unix platforms. It displays a large number of operating system metrics. Run without argument, it displays a status line for that moment. When a numeric argument is added, the status is redisplayed at designated intervals. For example, <code>vmstat 5</code> causes the information to reappear every five seconds. Vmstat displays the amount of virtual memory in use, how much memory is swapped in and out each second, the number of processes currently running and sleeping, the number of interrupts and context switches per second and the usage percentages of the CPU. </p>
cc003103e52ff9d5fe9bed567ef9438613ab4fbfrse<p>The following is <code>vmstat</code> output of an idle server: </p>
cc003103e52ff9d5fe9bed567ef9438613ab4fbfrse
cc003103e52ff9d5fe9bed567ef9438613ab4fbfrse
cc003103e52ff9d5fe9bed567ef9438613ab4fbfrse<example>
cc003103e52ff9d5fe9bed567ef9438613ab4fbfrse[sctemme@GayDeceiver sctemme]$ vmstat 5 3<br />
cc003103e52ff9d5fe9bed567ef9438613ab4fbfrseprocs memory swap io system cpu<br />
cc003103e52ff9d5fe9bed567ef9438613ab4fbfrser b w swpd free buff cache si so bi bo in cs us sy i<br />
cc003103e52ff9d5fe9bed567ef9438613ab4fbfrse0 0 0 0 186252 6688 37516 0 0 12 5 47 311 0 1 9<br />
cc003103e52ff9d5fe9bed567ef9438613ab4fbfrse0 0 0 0 186244 6696 37516 0 0 0 16 41 314 0 0 10<br />
03181bdde77be8e10ed297a02db5d8f98ecb703ewrowe0 0 0 0 186236 6704 37516 0 0 0 9 44 314 0 0 100
cc003103e52ff9d5fe9bed567ef9438613ab4fbfrse</example>
bb0b94431dc9a1591a0a38a6c48925c6d9213c83rse
cc003103e52ff9d5fe9bed567ef9438613ab4fbfrse<p>And this is output of a server that is under a load of one hundred simultaneous connections fetching static content: </p>
cc003103e52ff9d5fe9bed567ef9438613ab4fbfrse
cc003103e52ff9d5fe9bed567ef9438613ab4fbfrse<example>
e8f95a682820a599fe41b22977010636be5c2717jimsctemme@GayDeceiver sctemme]$ vmstat 5 3<br />
bb0b94431dc9a1591a0a38a6c48925c6d9213c83rse procs memory swap io system cpu<br />
cc003103e52ff9d5fe9bed567ef9438613ab4fbfrse r b w swpd free buff cache si so bi bo in cs us sy id<br />
bb0b94431dc9a1591a0a38a6c48925c6d9213c83rse 1 0 1 0 162580 6848 40056 0 0 11 5 150 324 1 1 98<br />
cc003103e52ff9d5fe9bed567ef9438613ab4fbfrse 6 0 1 0 163280 6856 40248 0 0 0 66 6384 1117 42 25 32<br />
bb0b94431dc9a1591a0a38a6c48925c6d9213c83rse11 0 0 0 162780 6864 40436 0 0 0 61 6309 1165 33 28 40
cc003103e52ff9d5fe9bed567ef9438613ab4fbfrse</example>
bb0b94431dc9a1591a0a38a6c48925c6d9213c83rse
cc003103e52ff9d5fe9bed567ef9438613ab4fbfrse<p>The first line gives averages since the last reboot. The subsequent lines give information for five second intervals. The second argument tells vmstat to generate three reports and then exit. </p>
cc003103e52ff9d5fe9bed567ef9438613ab4fbfrse
cc003103e52ff9d5fe9bed567ef9438613ab4fbfrse
cc003103e52ff9d5fe9bed567ef9438613ab4fbfrse</section>
e8f95a682820a599fe41b22977010636be5c2717jim<section id="SE Toolkit">
cc003103e52ff9d5fe9bed567ef9438613ab4fbfrse<title>SE Toolkit</title>
cc003103e52ff9d5fe9bed567ef9438613ab4fbfrse<p>The SE Toolkit is a system monitoring toolkit for Solaris. Its programming language is based on the C preprocessor and comes with a number of sample scripts. It can use both the command line and the GUI to display information. It can also be programmed to apply rules to the system data. The example script shown in Figure 2, Zoom.se, shows green, orange or red indicators when utilization of various parts of the system rises above certain thresholds. Another included script, Virtual Adrian, applies performance tuning metrics according to. </p>
14099c5540ce39114b5501a71ff96e40f48efc4bmartin<p>The SE Toolkit has drifted around for a while and has had several owners since its inception. It seems that it has now found a final home at Sunfreeware.com, where it can be downloaded at no charge. There is a single package for Solaris 8, 9 and 10 on SPARC and x86, and includes source code. SE Toolkit author Richard Pettit has started a new company, Captive Metrics4 that plans to bring to market a multiplatform monitoring tool built on the same principles as SE Toolkit, written in Java. </p>
e8f95a682820a599fe41b22977010636be5c2717jim
cc003103e52ff9d5fe9bed567ef9438613ab4fbfrse
cc003103e52ff9d5fe9bed567ef9438613ab4fbfrse</section>
14099c5540ce39114b5501a71ff96e40f48efc4bmartin<section id="DTrace">
cc003103e52ff9d5fe9bed567ef9438613ab4fbfrse<title>DTrace</title>
7f683bb300df767164724ebc664f339ac396b434dougm<p>Given that DTrace is available for Solaris, FreeBSD and OS X, it might be worth exploring it. There&apos;s also mod_dtrace available for httpd. </p>
e8f95a682820a599fe41b22977010636be5c2717jim
cc003103e52ff9d5fe9bed567ef9438613ab4fbfrse
cc003103e52ff9d5fe9bed567ef9438613ab4fbfrse</section>
e8f95a682820a599fe41b22977010636be5c2717jim<section id="mod_status">
cc003103e52ff9d5fe9bed567ef9438613ab4fbfrse<title>mod_status</title>
cc003103e52ff9d5fe9bed567ef9438613ab4fbfrse<p>The mod_status module gives an overview of the server performance at a given moment. It generates an HTML page with, among others, the number of Apache processes running and how many bytes each has served, and the CPU load caused by httpd and the rest of the system. The Apache Software Foundation uses mod_status on its own <a href="http://apache.org/server-status">web site</a>. If you put the <code>ExtendedStatus On</code> directive in your <code>httpd.conf</code>, the <code>mod_status</code> page will give you more information at the cost of a little extra work per request. </p>
ea6ff3396df1d6d43ee0ecfa3e26ada981d8e9a3sctemme
ea6ff3396df1d6d43ee0ecfa3e26ada981d8e9a3sctemme
ea6ff3396df1d6d43ee0ecfa3e26ada981d8e9a3sctemme</section>
ea6ff3396df1d6d43ee0ecfa3e26ada981d8e9a3sctemme</section>
ea6ff3396df1d6d43ee0ecfa3e26ada981d8e9a3sctemme<section id="Web Server Log Files">
cc003103e52ff9d5fe9bed567ef9438613ab4fbfrse<title>Web Server Log Files</title>
cc003103e52ff9d5fe9bed567ef9438613ab4fbfrse<p>Monitoring and analyzing the log files httpd writes is one of the most effective ways to keep track of your server health and performance. Monitoring the error log allows you to detect error conditions, discover attacks and find performance issues. Analyzing the access logs tells you how busy your server is, which resources are the most popular and where your users come from. Historical log file data can give you invaluable insight into trends in access to your server, which allows you to predict when your performance needs will overtake your server capacity. </p>
cc003103e52ff9d5fe9bed567ef9438613ab4fbfrse
cc003103e52ff9d5fe9bed567ef9438613ab4fbfrse
14099c5540ce39114b5501a71ff96e40f48efc4bmartin<section id="Error Log">
14099c5540ce39114b5501a71ff96e40f48efc4bmartin<title>Error Log</title>
14099c5540ce39114b5501a71ff96e40f48efc4bmartin<p>The error log will contain messages if the server has reached the maximum number of active processes or the maximum number of concurrently open files. The error log also reflects when processes are being spawned at a higher-than-usual rate in response to a sudden increase in load. When the server starts, the stderr file descriptor is redirected to the error logfile, so any error encountered by httpd after it opens its logfiles will appear in this log. This makes it good practice to review the error log frequently. </p>
14099c5540ce39114b5501a71ff96e40f48efc4bmartin<p>Before Apache httpd opens its logfiles, any errors will be written to the stderr stream. If you start httpd manually, this error information will appear on your terminal and you can use it directly to troubleshoot your server. If your httpd is started by a startup script, the destination of early error messages depends on their design. The <code>/var/log/messages</code> file is usually a good bet. On Windows, early error messages are written to the Applications Event Log, which can be viewed through the Event Viewer in Administrative Tools. </p>
14099c5540ce39114b5501a71ff96e40f48efc4bmartin<p>The Error Log is configured through the <code>ErrorLog</code> and <code>LogLevel</code> configuration directives. The error log of httpd’s main server configuration receives the log messages that pertain to the entire server: startup, shutdown, crashes, excessive process spawns, etc. The <code>ErrorLog</code> directive can also be used in virtual host containers. The error log of a virtual host receives only log messages specific to that virtual host, such as authentication failures and &apos;File not Found&apos; errors. </p>
cc003103e52ff9d5fe9bed567ef9438613ab4fbfrse<p>On a server that is visible to the Internet, expect to see a lot of exploit attempt and worm attacks in the error log. A lot of these will be targeted at other server platforms instead of Apache, but the current state of affairs is that attack scripts just throw everything they have at any open port, regardless of which server is actually running or what applications might be installed. You could block these attempts using a firewall or <a href="http://www.modsecurity.org/">mod_security</a>, but this falls outside the scope of this discussion. </p>
cc003103e52ff9d5fe9bed567ef9438613ab4fbfrse<p>The <code>LogLevel</code> directive determines the level of detail included in the logs. There are eight log levels as described here: </p>
cc003103e52ff9d5fe9bed567ef9438613ab4fbfrse<table><tr><td><p> <strong>Level</strong> </p>
e8f95a682820a599fe41b22977010636be5c2717jim</td><td><p> <strong>Description</strong> </p>
cc003103e52ff9d5fe9bed567ef9438613ab4fbfrse</td></tr><tr><td><p> emerg </p>
cc003103e52ff9d5fe9bed567ef9438613ab4fbfrse</td><td><p> Emergencies - system is unusable. </p>
cc003103e52ff9d5fe9bed567ef9438613ab4fbfrse</td></tr><tr><td><p> alert </p>
cc003103e52ff9d5fe9bed567ef9438613ab4fbfrse</td><td><p> Action must be taken immediately. </p>
cc003103e52ff9d5fe9bed567ef9438613ab4fbfrse</td></tr><tr><td><p> crit </p>
e8f95a682820a599fe41b22977010636be5c2717jim</td><td><p> Critical Conditions. </p>
cc003103e52ff9d5fe9bed567ef9438613ab4fbfrse</td></tr><tr><td><p> error </p>
cc003103e52ff9d5fe9bed567ef9438613ab4fbfrse</td><td><p> Error conditions. </p>
e8f95a682820a599fe41b22977010636be5c2717jim</td></tr><tr><td><p> warn </p>
cc003103e52ff9d5fe9bed567ef9438613ab4fbfrse</td><td><p> Warning conditions. </p>
cc003103e52ff9d5fe9bed567ef9438613ab4fbfrse</td></tr><tr><td><p> notice </p>
ea6ff3396df1d6d43ee0ecfa3e26ada981d8e9a3sctemme</td><td><p> Normal but significant condition. </p>
ea6ff3396df1d6d43ee0ecfa3e26ada981d8e9a3sctemme</td></tr><tr><td><p> info </p>
ea6ff3396df1d6d43ee0ecfa3e26ada981d8e9a3sctemme</td><td><p> Informational. </p>
ea6ff3396df1d6d43ee0ecfa3e26ada981d8e9a3sctemme</td></tr><tr><td><p> debug </p>
ea6ff3396df1d6d43ee0ecfa3e26ada981d8e9a3sctemme</td><td><p> Debug-level messages </p>
cc003103e52ff9d5fe9bed567ef9438613ab4fbfrse</td></tr></table><p>The default log level is warn. A production server should not be run on debug, but increasing the level of detail in the error log can be useful during troubleshooting. Starting with 2.3.8 <code>LogLevel</code> can be specified on a per module basis: </p>
cc003103e52ff9d5fe9bed567ef9438613ab4fbfrse
cc003103e52ff9d5fe9bed567ef9438613ab4fbfrse<example>
cc003103e52ff9d5fe9bed567ef9438613ab4fbfrseLogLevel debug mod_ssl:warn
cc003103e52ff9d5fe9bed567ef9438613ab4fbfrse</example>
cc003103e52ff9d5fe9bed567ef9438613ab4fbfrse
a0e0d20b666cfc453ac76506079eb50e03997eefdougm<p>This puts all of the server in debug mode, except for <code>mod_ssl</code>, which tends to be very noisy. </p>
a0e0d20b666cfc453ac76506079eb50e03997eefdougm
a0e0d20b666cfc453ac76506079eb50e03997eefdougm
a0e0d20b666cfc453ac76506079eb50e03997eefdougm</section>
a0e0d20b666cfc453ac76506079eb50e03997eefdougm<section id="Access Log">
a0e0d20b666cfc453ac76506079eb50e03997eefdougm<title>Access Log</title>
a0e0d20b666cfc453ac76506079eb50e03997eefdougm<p>Apache httpd keeps track of every request it services in its access log file. In addition to the time and nature of a request, httpd can log the client IP address, date and time of the request, the result and a host of other information. The various logging format features are documented in the <a href="http://httpd.apache.org/docs/current/mod/core.html#loglevel">manual</a>. This file exists by default for the main server and can be configured per virtual host by using the <code>TransferLog</code> or <code>CustomLog</code> configuration directive. </p>
a0e0d20b666cfc453ac76506079eb50e03997eefdougm<p>The access logs can be analyzed with any of several free and commercially available programs. Popular free analysis packages include Analog and Webalizer. Log analysis should be done offline so the web server machine is not burdened by processing the log files. Most log analysis packages understand the Common Log Format. The fields in the log lines are explained in in the following: </p>
a0e0d20b666cfc453ac76506079eb50e03997eefdougm
a0e0d20b666cfc453ac76506079eb50e03997eefdougm
a0e0d20b666cfc453ac76506079eb50e03997eefdougm<example>
a0e0d20b666cfc453ac76506079eb50e03997eefdougm195.54.228.42 - - [24/Mar/2007:23:05:11 -0400] "GET /sander/feed/ HTTP/1.1" 200 9747<br />
a0e0d20b666cfc453ac76506079eb50e03997eefdougm64.34.165.214 - - [24/Mar/2007:23:10:11 -0400] "GET /sander/feed/atom HTTP/1.1" 200 9068<br />
a0e0d20b666cfc453ac76506079eb50e03997eefdougm60.28.164.72 - - [24/Mar/2007:23:11:41 -0400] "GET / HTTP/1.0" 200 618<br />
a0e0d20b666cfc453ac76506079eb50e03997eefdougm85.140.155.56 - - [24/Mar/2007:23:14:12 -0400] "GET /sander/2006/09/27/44/ HTTP/1.1" 200 14172<br />
a0e0d20b666cfc453ac76506079eb50e03997eefdougm85.140.155.56 - - [24/Mar/2007:23:14:15 -0400] "GET /sander/2006/09/21/gore-tax-pollution/ HTTP/1.1" 200 15147<br />
a0e0d20b666cfc453ac76506079eb50e03997eefdougm74.6.72.187 - - [24/Mar/2007:23:18:11 -0400] "GET /sander/2006/09/27/44/ HTTP/1.0" 200 14172<br />
a0e0d20b666cfc453ac76506079eb50e03997eefdougm74.6.72.229 - - [24/Mar/2007:23:24:22 -0400] "GET /sander/2006/11/21/os-java/ HTTP/1.0" 200 13457
a0e0d20b666cfc453ac76506079eb50e03997eefdougm</example>
a0e0d20b666cfc453ac76506079eb50e03997eefdougm
a0e0d20b666cfc453ac76506079eb50e03997eefdougm<table><tr><td><p> <strong>Field</strong> </p>
a0e0d20b666cfc453ac76506079eb50e03997eefdougm</td><td><p> <strong>Content</strong> </p>
a0e0d20b666cfc453ac76506079eb50e03997eefdougm</td><td><p> <strong>Explanation</strong> </p>
a0e0d20b666cfc453ac76506079eb50e03997eefdougm</td></tr><tr><td><p> Client IP </p>
a0e0d20b666cfc453ac76506079eb50e03997eefdougm</td><td><p> 195.54.228.42 </p>
a0e0d20b666cfc453ac76506079eb50e03997eefdougm</td><td><p> IP address where the request originated </p>
a0e0d20b666cfc453ac76506079eb50e03997eefdougm</td></tr><tr><td><p> RFC 1413 ident </p>
a0e0d20b666cfc453ac76506079eb50e03997eefdougm</td><td><p> - </p>
a0e0d20b666cfc453ac76506079eb50e03997eefdougm</td><td><p> Remote user identity as reported by their identd </p>
a0e0d20b666cfc453ac76506079eb50e03997eefdougm</td></tr><tr><td><p> username </p>
a0e0d20b666cfc453ac76506079eb50e03997eefdougm</td><td><p> - </p>
a0e0d20b666cfc453ac76506079eb50e03997eefdougm</td><td><p> Remote username as authenticated by Apache </p>
a0e0d20b666cfc453ac76506079eb50e03997eefdougm</td></tr><tr><td><p> timestamp </p>
a0e0d20b666cfc453ac76506079eb50e03997eefdougm</td><td><p> [24/Mar/2007:23:05:11 -0400] </p>
a0e0d20b666cfc453ac76506079eb50e03997eefdougm</td><td><p> Date and time of request </p>
a0e0d20b666cfc453ac76506079eb50e03997eefdougm</td></tr><tr><td><p> Request </p>
a0e0d20b666cfc453ac76506079eb50e03997eefdougm</td><td><p> &quot;GET /sander/feed/ HTTP/1.1&quot; </p>
a0e0d20b666cfc453ac76506079eb50e03997eefdougm</td><td><p> Request line </p>
a0e0d20b666cfc453ac76506079eb50e03997eefdougm</td></tr><tr><td><p> Status Code </p>
a0e0d20b666cfc453ac76506079eb50e03997eefdougm</td><td><p> 200 </p>
a0e0d20b666cfc453ac76506079eb50e03997eefdougm</td><td><p> Response code </p>
a0e0d20b666cfc453ac76506079eb50e03997eefdougm</td></tr><tr><td><p> Content Bytes </p>
a0e0d20b666cfc453ac76506079eb50e03997eefdougm</td><td><p> 9747 </p>
a0e0d20b666cfc453ac76506079eb50e03997eefdougm</td><td><p> Bytes transferred w/o headers </p>
a0e0d20b666cfc453ac76506079eb50e03997eefdougm</td></tr></table>
a0e0d20b666cfc453ac76506079eb50e03997eefdougm
a0e0d20b666cfc453ac76506079eb50e03997eefdougm</section>
a0e0d20b666cfc453ac76506079eb50e03997eefdougm<section id="Rotating Log Files">
a0e0d20b666cfc453ac76506079eb50e03997eefdougm<title>Rotating Log Files</title>
a0e0d20b666cfc453ac76506079eb50e03997eefdougm<p>There are several reasons to rotate logfiles. Even though almost no operating systems out there have a hard file size limit of two Gigabytes anymore, log files simply become too large to handle over time. Additionally, any periodic log file analysis should not be performed on files to which the server is actively writing. Periodic logfile rotation helps keep the analysis job manageable, and allows you to keep a closer eye on usage trends. </p>
a0e0d20b666cfc453ac76506079eb50e03997eefdougm<p>On unix systems, you can simply rotate logfiles by giving the old file a new name using mv. The server will keep writing to the open file even though it has a new name. When you send a graceful restart signal to the server, it will open a new logfile with the configured name. For example, you could run a script from cron like this: </p>
a0e0d20b666cfc453ac76506079eb50e03997eefdougm
a0e0d20b666cfc453ac76506079eb50e03997eefdougm
a0e0d20b666cfc453ac76506079eb50e03997eefdougm<example>
a0e0d20b666cfc453ac76506079eb50e03997eefdougmAPACHE=/usr/local/apache2<br />
a0e0d20b666cfc453ac76506079eb50e03997eefdougmHTTPD=$APACHE/bin/httpd<br />
a0e0d20b666cfc453ac76506079eb50e03997eefdougmmv $APACHE/logs/access_log $APACHE/logarchive/access_log-‘date +%F‘<br />
a0e0d20b666cfc453ac76506079eb50e03997eefdougm$HTTPD -k graceful
a0e0d20b666cfc453ac76506079eb50e03997eefdougm</example>
a0e0d20b666cfc453ac76506079eb50e03997eefdougm
a0e0d20b666cfc453ac76506079eb50e03997eefdougm<p>This approach also works on Windows, just not as smoothly. While the httpd process on your Windows server will keep writing to the log file after it has been renamed, the Windows Service that runs Apache can not do a graceful restart. Restarting a Service on Windows means stopping it and then starting it again. The advantage of a graceful restart is that the httpd child processes get to complete responding to their current requests before they exit. Meanwhile, the httpd server becomes immediately available again to serve new requests. The stop-start that the Windows Service has to perform will interrupt any requests currently in progress, and the server is unavailable until it is started again. Plan for this when you decide the timing of your restarts. </p>
a0e0d20b666cfc453ac76506079eb50e03997eefdougm<p>A second approach is to use piped logs. From the <code>CustomLog</code>, <code>TransferLog</code> or <code>ErrorLog</code> directives you can send the log data into any program using a pipe character (<code>|</code>). For instance: </p>
a0e0d20b666cfc453ac76506079eb50e03997eefdougm
a0e0d20b666cfc453ac76506079eb50e03997eefdougm<example>CustomLog "|/usr/local/apache2/bin/rotatelogs /var/log/access_log 86400" common</example>
a0e0d20b666cfc453ac76506079eb50e03997eefdougm
ea6ff3396df1d6d43ee0ecfa3e26ada981d8e9a3sctemme<p>The program on the other end of the pipe will receive the Apache log data on its stdin stream, and can do with this data whatever it wants. The rotatelogs program that comes with Apache seamlessly turns over the log file based on time elapsed or the amount of data written, and leaves the old log files with a timestamp suffix to its name. This method for rotating logfiles works well on unix platforms, but is currently broken on Windows. </p>
ea6ff3396df1d6d43ee0ecfa3e26ada981d8e9a3sctemme
ea6ff3396df1d6d43ee0ecfa3e26ada981d8e9a3sctemme
22357f10585a847ebf7b084cbe1db07ba071aeb6dougm</section>
ea6ff3396df1d6d43ee0ecfa3e26ada981d8e9a3sctemme<section id="Logging and Performance">
22357f10585a847ebf7b084cbe1db07ba071aeb6dougm<title>Logging and Performance</title>
dd7c683f683624b082d430935b594df7406782c2dougm<p>Writing entries to the Apache log files obviously takes some effort, but the information gathered from the logs is so valuable that under normal circumstances logging should not be turned off. For optimal performance, you should put your disk-based site content on a different physical disk than the server log files: the access patterns are very different. Retrieving content from disk is a read operation in a fairly random pattern, and log files are written to disk sequentially. </p>
22357f10585a847ebf7b084cbe1db07ba071aeb6dougm<p>Do not run a production server with your error <code>LogLevel</code> set to debug. This log level causes a vast amount of information to be written to the error log, including, in the case of SSL access, complete dumps of BIO read and write operations. The performance implications are significant: use the default warn level instead. </p>
22357f10585a847ebf7b084cbe1db07ba071aeb6dougm<p>If your server has more than one virtual host, you may give each virtual host a separate access logfile. This makes it easier to analyze the logfile later. However, if your server has many virtual hosts, all the open logfiles put a resource burden on your system, and it may be preferable to log to a single file. Use the <code>%v</code> format character at the start of your <a href="/httpd/LogFormat" class="nonexistent">LogFormat</a> and starting 2.3.8 of your <code>ErrorLogFormat</code> to make httpd print the hostname of the virtual host that received the request or the error at the beginning of each log line. A simple Perl script can split out the log file after it rotates: one is included with the Apache source under <code>support/split-logfile</code>. </p>
22357f10585a847ebf7b084cbe1db07ba071aeb6dougm<p>You can use the <code>BufferedLogs</code> directive to have Apache collect several log lines in memory before writing them to disk. This might yield better performance, but could affect the order in which the server&apos;s log is written. </p>
22357f10585a847ebf7b084cbe1db07ba071aeb6dougm
22357f10585a847ebf7b084cbe1db07ba071aeb6dougm
dd7c683f683624b082d430935b594df7406782c2dougm</section>
22357f10585a847ebf7b084cbe1db07ba071aeb6dougm</section>
22357f10585a847ebf7b084cbe1db07ba071aeb6dougm<section id="Generating A Test Load">
dd7c683f683624b082d430935b594df7406782c2dougm<title>Generating A Test Load</title>
dd7c683f683624b082d430935b594df7406782c2dougm<p>It is useful to generate a test load to monitor system performance under realistic operating circumstances. Besides commercial packages such as <a href="/httpd/LoadRunner" class="nonexistent">LoadRunner</a>, there are a number of freely available tools to generate a test load against your web server. </p>
dd7c683f683624b082d430935b594df7406782c2dougm<ul><li>Apache ships with a test program called ab, short for Apache Bench. It can generate a web server load by repeatedly asking for the same file in rapid succession. You can specify a number of concurrent connections and have the program run for either a given amount of time or a specified number of requests. </li>
dd7c683f683624b082d430935b594df7406782c2dougm<li>Another freely available load generator is http load11 . This program works with a URL file and can be compiled with SSL support. </li>
dd7c683f683624b082d430935b594df7406782c2dougm<li>The Apache Software Foundation offers a tool named flood12 . Flood is a fairly sophisticated program that is configured through an XML file. </li>
dd7c683f683624b082d430935b594df7406782c2dougm<li>Finally, JMeter13 , a Jakarta subproject, is an all-Java load-testing tool. While early versions of this application were slow and difficult to use, the current version 2.1.1 seems to be versatile and useful. </li>
dd7c683f683624b082d430935b594df7406782c2dougm<li><p>ASF external projects, that have proven to be quite good: grinder, httperf, tsung, <a href="/httpd/FunkLoad" class="nonexistent">FunkLoad</a> </p>
dd7c683f683624b082d430935b594df7406782c2dougm</li>
dd7c683f683624b082d430935b594df7406782c2dougm</ul>
f4311d5c9112156f84d47a1ca2ff6811de838031rpluem<p>When you load-test your web server, please keep in mind that if that server is in production, the test load may negatively affect the server’s response. Also, any data traffic you generate may be charged against your monthly traffic allowance. </p>
176c2742db03fcb7b7d13e6408dd967d87e542e9ben
176c2742db03fcb7b7d13e6408dd967d87e542e9ben
e0c3fda9f782aee1140d83fbce32672ac299f2a4ben</section>
176c2742db03fcb7b7d13e6408dd967d87e542e9ben</section>
176c2742db03fcb7b7d13e6408dd967d87e542e9ben<section id="Configuring for Performance">
176c2742db03fcb7b7d13e6408dd967d87e542e9ben<title>Configuring for Performance</title>
176c2742db03fcb7b7d13e6408dd967d87e542e9ben
176c2742db03fcb7b7d13e6408dd967d87e542e9ben
e0c3fda9f782aee1140d83fbce32672ac299f2a4ben<section id="Apache Configuration">
176c2742db03fcb7b7d13e6408dd967d87e542e9ben<title>Apache Configuration</title>
176c2742db03fcb7b7d13e6408dd967d87e542e9ben<p>The Apache 2.2 httpd is by default a pre-forking web server. When the server starts, the parent process spawns a number of child processes that do the actual work of servicing requests. But Apache httpd 2.0 introduced the concept of the Multi-Processing Module (MPM). Developers can write MPMs to suit the process- or threadingarchitecture of their specific operating system. Apache 2 comes with special MPMs for Windows, OS/2, Netware and BeOS. On unix-like platforms, the two most popular MPMs are Prefork and Worker. The Prefork MPM offers the same pre-forking process model that Apache 1.3 uses. The Worker MPM runs a smaller number of child processes, and spawns multiple request handling threads within each child process. In 2.3+ MPMs are no longer hard-wired. They too can be exchanged via <a href="/httpd/LoadModule" class="nonexistent">LoadModule</a>. The default MPM in 2.3 is the event MPM. </p>
f4311d5c9112156f84d47a1ca2ff6811de838031rpluem<p>The maximum number of workers, be they pre-forked child processes or threads within a process, is an indication of how many requests your server can manage concurrently. It is merely a rough estimate because the kernel can queue connection attempts for your web server. When your site becomes busy and the maximum number of workers is running, the machine doesn&apos;t hit a hard limit beyond which clients will be denied access. However, once requests start backing up, system performance is likely to degrade. </p>
176c2742db03fcb7b7d13e6408dd967d87e542e9ben
e0c3fda9f782aee1140d83fbce32672ac299f2a4ben
e0c3fda9f782aee1140d83fbce32672ac299f2a4ben<section id="MaxClients">
e0c3fda9f782aee1140d83fbce32672ac299f2a4ben<title>MaxClients</title>
e0c3fda9f782aee1140d83fbce32672ac299f2a4ben<p>The <code>MaxClients</code> directive in your Apache httpd configuration file specifies the maximum number of workers your server can create. It has two related directives, <code>MinSpareServers</code> and <code>MaxSpareServers</code>, which specify the number of workers Apache keeps waiting in the wings ready to serve requests. The absolute maximum number of processes is configurable through the <code>ServerLimit</code> directive. </p>
176c2742db03fcb7b7d13e6408dd967d87e542e9ben
176c2742db03fcb7b7d13e6408dd967d87e542e9ben
176c2742db03fcb7b7d13e6408dd967d87e542e9ben</section>
176c2742db03fcb7b7d13e6408dd967d87e542e9ben<section id="Spinning Threads">
e0c3fda9f782aee1140d83fbce32672ac299f2a4ben<title>Spinning Threads</title>
e0c3fda9f782aee1140d83fbce32672ac299f2a4ben<p>For the prefork MPM of the above directives are all there is to determining the process limit. However, if you are running a threaded MPM the situation is a little more complicated. Threaded MPMs support the <code>ThreadsPerChild</code> directive1 . Apache requires that <code>MaxClients</code> is evenly divisible by <code>ThreadsPerChild</code>. If you set either directive to a number that doesn’t meet this requirement, Apache will send a message of complaint to the error log and adjust the <code>ThreadsPerChild</code> value downwards until it is an even factor of <code>MaxClients</code>. </p>
176c2742db03fcb7b7d13e6408dd967d87e542e9ben
176c2742db03fcb7b7d13e6408dd967d87e542e9ben
e0c3fda9f782aee1140d83fbce32672ac299f2a4ben</section>
e0c3fda9f782aee1140d83fbce32672ac299f2a4ben<section id="Sizing MaxClients">
176c2742db03fcb7b7d13e6408dd967d87e542e9ben<title>Sizing MaxClients</title>
176c2742db03fcb7b7d13e6408dd967d87e542e9ben<p>Optimally, the maximum number of processes should be set so that all the memory on your system is used, but no more. If your system gets so overloaded that it needs to heavily swap core memory out to disk, performance will degrade quickly. The formula for determining <code>MaxClients</code> is fairly simple: </p>
e0c3fda9f782aee1140d83fbce32672ac299f2a4ben
e0c3fda9f782aee1140d83fbce32672ac299f2a4ben<example>
e0c3fda9f782aee1140d83fbce32672ac299f2a4ben total RAM − RAM for OS − RAM for external programs<br />
176c2742db03fcb7b7d13e6408dd967d87e542e9benMaxClients = -------------------------------------------------------<br />
176c2742db03fcb7b7d13e6408dd967d87e542e9ben RAM per httpd process
176c2742db03fcb7b7d13e6408dd967d87e542e9ben</example>
e0c3fda9f782aee1140d83fbce32672ac299f2a4ben
e0c3fda9f782aee1140d83fbce32672ac299f2a4ben<p>The various amounts of memory allocated for the OS, external programs and the httpd processes is best determined by observation: use the top and free commands described above to determine the memory footprint of the OS without the web server running. You can also determine the footprint of a typical web server process from top: most top implementations have a Resident Size (RSS) column and a Shared Memory column. </p>
e0c3fda9f782aee1140d83fbce32672ac299f2a4ben<p>The difference between these two is the amount of memory per-process. The shared segment really exists only once and is used for the code and libraries loaded and the dynamic inter-process tally, or &apos;scoreboard,&apos; that Apache keeps. How much memory each process takes for itself depends heavily on the number and kind of modules you use. The best approach to use in determining this need is to generate a typical test load against your web site and see how large the httpd processes become. </p>
176c2742db03fcb7b7d13e6408dd967d87e542e9ben<p>The RAM for external programs parameter is intended mostly for CGI programs and scripts that run outside the web server process. However, if you have a Java virtual machine running Tomcat on the same box it will need a significant amount of memory as well. The above assessment should give you an idea how far you can push <code>MaxClients</code>, but it is not an exact science. When in doubt, be conservative and use a low <code>MaxClients</code> value. The Linux kernel will put extra memory to good use for caching disk access. On Solaris you need enough available real RAM memory to create any process. If no real memory is available, httpd will start writing ‘No space left on device’ messages to the error log and be unable to fork additional child processes, so a higher <code>MaxClients</code> value may actually be a disadvantage. </p>
176c2742db03fcb7b7d13e6408dd967d87e542e9ben
e0c3fda9f782aee1140d83fbce32672ac299f2a4ben
e0c3fda9f782aee1140d83fbce32672ac299f2a4ben</section>
176c2742db03fcb7b7d13e6408dd967d87e542e9ben<section id="Selecting your MPM">
176c2742db03fcb7b7d13e6408dd967d87e542e9ben<title>Selecting your MPM</title>
176c2742db03fcb7b7d13e6408dd967d87e542e9ben<p>The prime reason for selecting a threaded MPM is that threads consume fewer system resources than processes, and it takes less effort for the system to switch between threads. This is more true for some operating systems than for others. On systems like Solaris and AIX, manipulating processes is relatively expensive in terms of system resources. On these systems, running a threaded MPM makes sense. On Linux, the threading implementation actually uses one process for each thread. Linux processes are relatively lightweight, but it means that a threaded MPM offers less of a performance advantage than in other environments. </p>
dd7c683f683624b082d430935b594df7406782c2dougm<p>Running a threaded MPM can cause stability problems in some situations For instance, should a child process of a preforked MPM crash, at most one client connection is affected. However, if a threaded child crashes, all the threads in that process disappear, which means all the clients currently being served by that process will see their connection aborted. Additionally, there may be so-called &quot;thread-safety&quot; issues, especially with third-party libraries. In threaded applications, threads may access the same variables indiscriminately, not knowing whether a variable may have been changed by another thread. </p>
6a26d195dfba3a91f8352cabd4547afa77675bb1aaron<p>This has been a sore point within the PHP community. The PHP processor heavily relies on third-party libraries and cannot guarantee that all of these are thread-safe. The good news is that if you are running Apache on Linux, you can run PHP in the preforked MPM without fear of losing too much performance relative to the threaded option. </p>
d94fd18ee21dc9b8c1f422144a881e941687d41fdougm
d94fd18ee21dc9b8c1f422144a881e941687d41fdougm
d94fd18ee21dc9b8c1f422144a881e941687d41fdougm</section>
d94fd18ee21dc9b8c1f422144a881e941687d41fdougm<section id="Spinning Locks">
e18e68b42830409bf48de0df9eed3fe363664aa7aaron<title>Spinning Locks</title>
3c65aa88903de7330a07e133dfda779842fadad4wrowe<p>Apache httpd maintains an inter-process lock around its network listener. For all practical purposes, this means that only one httpd child process can receive a request at any given time. The other processes are either servicing requests already received or are &apos;camping out&apos; on the lock, waiting for the network listener to become available. This process is best visualized as a revolving door, with only one process allowed in the door at any time. On a heavily loaded web server with requests arriving constantly, the door spins quickly and requests are accepted at a steady rate. On a lightly loaded web server, the process that currently &quot;holds&quot; the lock may have to stay in the door for a while, during which all the other processes sit idle, waiting to acquire the lock. At this time, the parent process may decide to terminate some children based on its <code>MaxSpareServers</code> directive. </p>
d94fd18ee21dc9b8c1f422144a881e941687d41fdougm
a1696119fa668c01957eea97a616fcbe95da9492wrowe
b40799adcfd0f0a2a465c2934585986f7bbc9bbcwrowe</section>
b40799adcfd0f0a2a465c2934585986f7bbc9bbcwrowe<section id="The Thundering Herd">
6b441532f6ac4ebd1c4867ab5f8a0165247b178ewrowe<title>The Thundering Herd</title>
b40799adcfd0f0a2a465c2934585986f7bbc9bbcwrowe<p>The function of the &apos;accept mutex&apos; (as this inter-process lock is called) is to keep request reception moving along in an orderly fashion. If the lock is absent, the server may exhibit the Thundering Herd syndrome. </p>
b40799adcfd0f0a2a465c2934585986f7bbc9bbcwrowe<p>Consider an American Football team poised on the line of scrimmage. If the football players were Apache processes all team members would go for the ball simultaneously at the snap. One process would get it, and all the others would have to lumber back to the line for the next snap. In this metaphor, the accept mutex acts as the quarterback, delivering the connection &quot;ball&quot; to the appropriate player process. </p>
6b441532f6ac4ebd1c4867ab5f8a0165247b178ewrowe<p>Moving this much information around is obviously a lot of work, and, like a smart person, a smart web server tries to avoid it whenever possible. Hence the revolving door construction. In recent years, many operating systems, including Linux and Solaris, have put code in place to prevent the Thundering Herd syndrome. Apache recognizes this and if you run with just one network listener, meaning one virtual host or just the main server, Apache will refrain from using an accept mutex. If you run with multiple listeners (for instance because you have a virtual host serving SSL requests), it will activate the accept mutex to avoid internal conflicts. </p>
b40799adcfd0f0a2a465c2934585986f7bbc9bbcwrowe<p>You can manipulate the accept mutex with the <code>AcceptMutex</code> directive. Besides turning the accept mutex off, you can select the locking mechanism. Common locking mechanisms include fcntl, System V Semaphores and pthread locking. Not all are available on every platform, and their availability also depends on compile-time settings. The various locking mechanisms may place specific demands on system resources: manipulate them with care. </p>
b40799adcfd0f0a2a465c2934585986f7bbc9bbcwrowe<p>There is no compelling reason to disable the accept mutex. Apache automatically recognizes the single listener situation described above and knows if it is safe to run without mutex on your platform. </p>
d54a31567fc49f1841d27a14796ae726016c54aadougm
3c65aa88903de7330a07e133dfda779842fadad4wrowe
b40799adcfd0f0a2a465c2934585986f7bbc9bbcwrowe</section>
d94fd18ee21dc9b8c1f422144a881e941687d41fdougm</section>
3c65aa88903de7330a07e133dfda779842fadad4wrowe<section id="Tuning the Operating System">
3c65aa88903de7330a07e133dfda779842fadad4wrowe<title>Tuning the Operating System</title>
3c65aa88903de7330a07e133dfda779842fadad4wrowe<p>People often look for the &apos;magic tune-up&apos; that will make their system perform four times as fast by tweaking just one little setting. The truth is, present-day UNIX derivatives are pretty well adjusted straight out of the box and there is not a lot that needs to be done to make them perform optimally. However, there are a few things that an administrator can do to improve performance. </p>
3c65aa88903de7330a07e133dfda779842fadad4wrowe
3c65aa88903de7330a07e133dfda779842fadad4wrowe
3c65aa88903de7330a07e133dfda779842fadad4wrowe<section id="RAM and Swap Space">
3c65aa88903de7330a07e133dfda779842fadad4wrowe<title>RAM and Swap Space</title>
b40799adcfd0f0a2a465c2934585986f7bbc9bbcwrowe<p>The usual mantra regarding RAM is &quot;more is better&quot;. As discussed above, unused RAM is put to good use as file system cache. The Apache processes get bigger if you load more modules, especially if you use modules that generate dynamic page content within the processes, like PHP and mod_perl. A large configuration file-with many virtual hosts-also tends to inflate the process footprint. Having ample RAM allows you to run Apache with more child processes, which allows the server to process more concurrent requests. </p>
a1696119fa668c01957eea97a616fcbe95da9492wrowe<p>While the various platforms treat their virtual memory in different ways, it is never a good idea to run with less disk-based swap space than RAM. The virtual memory system is designed to provide a fallback for RAM, but when you don&apos;t have disk space available and run out of swappable memory, your machine grinds to a halt. This can crash your box, requiring a physical reboot for which your hosting facility may charge you. </p>
b40799adcfd0f0a2a465c2934585986f7bbc9bbcwrowe<p>Also, such an outage naturally occurs when you least want it: when the world has found your website and is beating a path to your door. If you have enough disk-based swap space available and the machine gets overloaded, it may get very, very slow as the system needs to swap memory pages to disk and back, but when the load decreases the system should recover. Remember, you still have <code>MaxClients</code> to keep things in hand. </p>
b40799adcfd0f0a2a465c2934585986f7bbc9bbcwrowe<p>Most unix-like operating systems use designated disk partitions for swap space. When a system starts up it finds all swap partitions on the disk(s), by partition type or because they are listed in the file <code>/etc/fstab</code>, and automatically enables them. When adding a disk or installing the operating system, be sure to allocate enough swap space to accommodate eventual RAM upgrades. Reassigning disk space on a running system is a cumbersome process. </p>
b40799adcfd0f0a2a465c2934585986f7bbc9bbcwrowe<p>Plan for available hard drive swap space of at least twice your amount of RAM, perhaps up to four times in situations with frequent peaking loads. Remember to adjust this configuration whenever you upgrade RAM on your system. In a pinch, you can use a regular file as swap space. For instructions on how to do this, see the manual pages for the <code>mkswap</code> and <code>swapon</code> or <code>swap</code> programs. </p>
b40799adcfd0f0a2a465c2934585986f7bbc9bbcwrowe
b40799adcfd0f0a2a465c2934585986f7bbc9bbcwrowe
b40799adcfd0f0a2a465c2934585986f7bbc9bbcwrowe</section>
d94fd18ee21dc9b8c1f422144a881e941687d41fdougm<section id="ulimit: Files and Processes">
d94fd18ee21dc9b8c1f422144a881e941687d41fdougm<title>ulimit: Files and Processes</title>
d94fd18ee21dc9b8c1f422144a881e941687d41fdougm<p>Given a machine with plenty of RAM and processor capacity, you can run hundreds of Apache processes if necessary. . . and if your kernel allows it. </p>
8e09f1830f114c016598a3b76fd6d31e1589c012sctemme<p>Consider a situation in which several hundred web servers are running; if some of these need to spawn CGI processes, the maximum number of processes would occur quickly. </p>
8e09f1830f114c016598a3b76fd6d31e1589c012sctemme<p>However, you can change this limit with the command </p>
8e09f1830f114c016598a3b76fd6d31e1589c012sctemme
8e09f1830f114c016598a3b76fd6d31e1589c012sctemme<example>
8e09f1830f114c016598a3b76fd6d31e1589c012sctemmeulimit [-H|-S] -u [newvalue]
8e09f1830f114c016598a3b76fd6d31e1589c012sctemme</example>
8e09f1830f114c016598a3b76fd6d31e1589c012sctemme
8e09f1830f114c016598a3b76fd6d31e1589c012sctemme<p>This must be changed before starting the server, since the new value will only be available to the current shell and programs started from it. In newer Linux kernels the default has been raised to 2048. On FreeBSD, the number seems to be the rather unusual 513. In the default user shell on this system, <code>csh</code> the equivalent is <code>limit</code> and works analogous the the Bourne-like <code>ulimit</code>: </p>
1eddce0da057f6fa5c5e9dde32e9dc6596616b12sctemme
1eddce0da057f6fa5c5e9dde32e9dc6596616b12sctemme<example>
1eddce0da057f6fa5c5e9dde32e9dc6596616b12sctemmelimit [-h] maxproc [newvalue]
1eddce0da057f6fa5c5e9dde32e9dc6596616b12sctemme</example>
1eddce0da057f6fa5c5e9dde32e9dc6596616b12sctemme
1eddce0da057f6fa5c5e9dde32e9dc6596616b12sctemme<p>Similarly, the kernel may limit the number of open files per process. This is generally not a problem for pre-forked servers, which just handle one request at a time per process. Threaded servers, however, serve many requests per process and much more easily run out of available file descriptors. You can increase the maximum number of open files per process by running the </p>
1eddce0da057f6fa5c5e9dde32e9dc6596616b12sctemme
1eddce0da057f6fa5c5e9dde32e9dc6596616b12sctemme<example>ulimit -n [newvalue]</example>
1eddce0da057f6fa5c5e9dde32e9dc6596616b12sctemme
1eddce0da057f6fa5c5e9dde32e9dc6596616b12sctemme<p>command. Once again, this must be done prior to starting Apache. </p>
1eddce0da057f6fa5c5e9dde32e9dc6596616b12sctemme
1eddce0da057f6fa5c5e9dde32e9dc6596616b12sctemme
1eddce0da057f6fa5c5e9dde32e9dc6596616b12sctemme</section>
1eddce0da057f6fa5c5e9dde32e9dc6596616b12sctemme<section id="Setting User Limits on System Startup">
1eddce0da057f6fa5c5e9dde32e9dc6596616b12sctemme<title>Setting User Limits on System Startup</title>
1eddce0da057f6fa5c5e9dde32e9dc6596616b12sctemme<p>Under Linux, you can set the ulimit parameters on bootup by editing the <code>/etc/security/limits.conf</code> file. This file allows you to set soft and hard limits on a per-user or per-group basis; the file contains commentary explaining the options. To enable this, make sure that the file <code>/etc/pam.d/login</code> contains the line </p>
1eddce0da057f6fa5c5e9dde32e9dc6596616b12sctemme
1eddce0da057f6fa5c5e9dde32e9dc6596616b12sctemme<example>session required /lib/security/pam_limits.so</example>
1eddce0da057f6fa5c5e9dde32e9dc6596616b12sctemme
1eddce0da057f6fa5c5e9dde32e9dc6596616b12sctemme<p>All items can have a &apos;soft&apos; and a &apos;hard&apos; limit: the first is the default setting and the second the maximum value for that item. </p>
1eddce0da057f6fa5c5e9dde32e9dc6596616b12sctemme<p>In FreeBSD&apos;s <code>/etc/login.conf</code> these resources can be limited or extended system wide, analogously to <code>limits.conf</code>. &apos;Soft&apos; limits can be specified with <code>-cur</code> and &apos;hard&apos; limits with <code>-max</code>. </p>
1eddce0da057f6fa5c5e9dde32e9dc6596616b12sctemme<p>Solaris has a similar mechanism for manipulating limit values at boot time: In <code>/etc/system</code> you can set kernel tunables valid for the entire system at boot time. These are the same tunables that can be set with the <code>mdb</code> kernel debugger during run time. The soft and hard limit corresponding to ulimit -u can be set via: </p>
1eddce0da057f6fa5c5e9dde32e9dc6596616b12sctemme
1eddce0da057f6fa5c5e9dde32e9dc6596616b12sctemme<example>
1eddce0da057f6fa5c5e9dde32e9dc6596616b12sctemmeset rlim_fd_max=65536<br />
1eddce0da057f6fa5c5e9dde32e9dc6596616b12sctemmeset rlim_fd_cur=2048
1eddce0da057f6fa5c5e9dde32e9dc6596616b12sctemme</example>
1eddce0da057f6fa5c5e9dde32e9dc6596616b12sctemme
1eddce0da057f6fa5c5e9dde32e9dc6596616b12sctemme<p>Solaris calculates the maximum number of allowed processes per user (<code>maxuprc</code>) based on the total amount available memory on the system (<code>maxusers</code>). You can review the numbers with </p>
1eddce0da057f6fa5c5e9dde32e9dc6596616b12sctemme
1eddce0da057f6fa5c5e9dde32e9dc6596616b12sctemme<example>sysdef -i | grep maximum</example>
1eddce0da057f6fa5c5e9dde32e9dc6596616b12sctemme
1eddce0da057f6fa5c5e9dde32e9dc6596616b12sctemme<p>but it is not recommended to change them. </p>
1eddce0da057f6fa5c5e9dde32e9dc6596616b12sctemme
1eddce0da057f6fa5c5e9dde32e9dc6596616b12sctemme
1eddce0da057f6fa5c5e9dde32e9dc6596616b12sctemme</section>
1eddce0da057f6fa5c5e9dde32e9dc6596616b12sctemme<section id="Turn Off Unused Services and Modules">
1eddce0da057f6fa5c5e9dde32e9dc6596616b12sctemme<title>Turn Off Unused Services and Modules</title>
1eddce0da057f6fa5c5e9dde32e9dc6596616b12sctemme<p>Many UNIX and Linux distributions come with a slew of services turned on by default. You probably need few of them. For example, your web server does not need to be running sendmail, nor is it likely to be an NFS server, etc. Turn them off. </p>
1eddce0da057f6fa5c5e9dde32e9dc6596616b12sctemme<p>On Red Hat Linux, the chkconfig tool will help you do this from the command line. On Solaris systems <code>svcs</code> and <code>svcadm</code> will show which services are enabled and disable them respectively. </p>
1eddce0da057f6fa5c5e9dde32e9dc6596616b12sctemme<p>In a similar fashion, cast a critical eye on the Apache modules you load. Most binary distributions of Apache httpd, and pre-installed versions that come with Linux distributions, have their modules enabled through the <code>LoadModule</code> directive. </p>
1eddce0da057f6fa5c5e9dde32e9dc6596616b12sctemme<p>Unused modules may be culled: if you don&apos;t rely on their functionality and configuration directives, you can turn them off by commenting out the corresponding <code>LoadModule</code> lines. Read the documentation on each module’s functionality before deciding whether to keep it enabled. While the performance overhead of an unused module is small, it&apos;s also unnecessary. </p>
1eddce0da057f6fa5c5e9dde32e9dc6596616b12sctemme
1eddce0da057f6fa5c5e9dde32e9dc6596616b12sctemme
1eddce0da057f6fa5c5e9dde32e9dc6596616b12sctemme</section>
1eddce0da057f6fa5c5e9dde32e9dc6596616b12sctemme</section>
1eddce0da057f6fa5c5e9dde32e9dc6596616b12sctemme</section>
1eddce0da057f6fa5c5e9dde32e9dc6596616b12sctemme<section id="Caching Content">
1eddce0da057f6fa5c5e9dde32e9dc6596616b12sctemme<title>Caching Content</title>
1eddce0da057f6fa5c5e9dde32e9dc6596616b12sctemme<p>Requests for dynamically generated content usually take significantly more resources than requests for static content. Static content consists of simple filespages, images, etc.-on disk that are very efficiently served. Many operating systems also automatically cache the contents of frequently accessed files in memory. </p>
1eddce0da057f6fa5c5e9dde32e9dc6596616b12sctemme<p>Processing dynamic requests, on the contrary, can be much more involved. Running CGI scripts, handing off requests to an external application server and accessing database content can introduce significant latency and processing load to a busy web server. Under many circumstances, performance can be improved by turning popular dynamic requests into static requests. In this section, two approaches to this will be discussed. </p>
1eddce0da057f6fa5c5e9dde32e9dc6596616b12sctemme
1eddce0da057f6fa5c5e9dde32e9dc6596616b12sctemme
1eddce0da057f6fa5c5e9dde32e9dc6596616b12sctemme<section id="Making Popular Pages Static">
1eddce0da057f6fa5c5e9dde32e9dc6596616b12sctemme<title>Making Popular Pages Static</title>
1eddce0da057f6fa5c5e9dde32e9dc6596616b12sctemme<p>By pre-rendering the response pages for the most popular queries in your application, you can gain a significant performance improvement without giving up the flexibility of dynamically generated content. For instance, if your application is a flower delivery service, you would probably want to pre-render your catalog pages for red roses during the weeks leading up to Valentine&apos;s Day. When the user searches for red roses, they are served the pre-rendered page. Queries for, say, yellow roses will be generated directly from the database. The mod_rewrite module included with Apache is a great tool to implement these substitutions. </p>
1eddce0da057f6fa5c5e9dde32e9dc6596616b12sctemme
1eddce0da057f6fa5c5e9dde32e9dc6596616b12sctemme
1eddce0da057f6fa5c5e9dde32e9dc6596616b12sctemme<section id="Example: A Statically Rendered Blog">
1eddce0da057f6fa5c5e9dde32e9dc6596616b12sctemme<title>Example: A Statically Rendered Blog</title>
1eddce0da057f6fa5c5e9dde32e9dc6596616b12sctemme<p><strong>&apos;we should provide a more useful example here. One showing how to make Wordpress or Drupal suck less.</strong>&apos; </p>
1eddce0da057f6fa5c5e9dde32e9dc6596616b12sctemme<p>Blosxom is a lightweight web log package that runs as a CGI. It is written in Perl and uses plain text files for entry input. Besides running as CGI, Blosxom can be run from the command line to pre-render blog pages. Pre-rendering pages to static HTML can yield a significant performance boost in the event that large numbers of people actually start reading your blog. </p>
1eddce0da057f6fa5c5e9dde32e9dc6596616b12sctemme<p>To run blosxom for static page generation, edit the CGI script according to the documentation. Set the $static dir variable to the <code>DocumentRoot</code> of the web server, and run the script from the command line as follows: </p>
1eddce0da057f6fa5c5e9dde32e9dc6596616b12sctemme
1eddce0da057f6fa5c5e9dde32e9dc6596616b12sctemme<example>$ perl blosxom.cgi -password='whateveryourpassword'</example>
1eddce0da057f6fa5c5e9dde32e9dc6596616b12sctemme
1eddce0da057f6fa5c5e9dde32e9dc6596616b12sctemme<p>This can be run periodically from Cron, after you upload content, etc. To make Apache substitute the statically rendered pages for the dynamic content, we’ll use mod_rewrite. This module is included with the Apache source code, but is not compiled by default. It can be built with the server by passing the option <code>--enable-rewrite[=shared]</code> to the configure command. Many binary distributions of Apache come with mod_rewrite included. The following is an example of an Apache virtual host that takes advantage of pre-rendered blog pages: </p>
1eddce0da057f6fa5c5e9dde32e9dc6596616b12sctemme
1eddce0da057f6fa5c5e9dde32e9dc6596616b12sctemme<example>Listen *:8001<br />
1eddce0da057f6fa5c5e9dde32e9dc6596616b12sctemme&lt;VirtualHost *:8001&gt;<br />
1eddce0da057f6fa5c5e9dde32e9dc6596616b12sctemme<indent>
1eddce0da057f6fa5c5e9dde32e9dc6596616b12sctemme ServerName blog.sandla.org:8001<br />
1eddce0da057f6fa5c5e9dde32e9dc6596616b12sctemme ServerAdmin sander@temme.net<br />
1eddce0da057f6fa5c5e9dde32e9dc6596616b12sctemme DocumentRoot "/home/sctemme/inst/blog/httpd/htdocs"<br />
1eddce0da057f6fa5c5e9dde32e9dc6596616b12sctemme &lt;Directory "/home/sctemme/inst/blog/httpd/htdocs"&gt;<br />
1eddce0da057f6fa5c5e9dde32e9dc6596616b12sctemme <indent>
1eddce0da057f6fa5c5e9dde32e9dc6596616b12sctemme Options +Indexes<br />
1eddce0da057f6fa5c5e9dde32e9dc6596616b12sctemme Order allow,deny<br />
1eddce0da057f6fa5c5e9dde32e9dc6596616b12sctemme Allow from all<br />
1eddce0da057f6fa5c5e9dde32e9dc6596616b12sctemme RewriteEngine on<br />
1eddce0da057f6fa5c5e9dde32e9dc6596616b12sctemme RewriteCond %{REQUEST_FILENAME} !-f<br />
1eddce0da057f6fa5c5e9dde32e9dc6596616b12sctemme RewriteCond %{REQUEST_FILENAME} !-d<br />
1eddce0da057f6fa5c5e9dde32e9dc6596616b12sctemme RewriteRule ^(.*)$ /cgi-bin/blosxom.cgi/$1 [L,QSA]<br />
1eddce0da057f6fa5c5e9dde32e9dc6596616b12sctemme </indent>
1eddce0da057f6fa5c5e9dde32e9dc6596616b12sctemme &lt;/Directory&gt;<br />
1eddce0da057f6fa5c5e9dde32e9dc6596616b12sctemme RewriteLog /home/sctemme/inst/blog/httpd/logs/rewrite_log<br />
1eddce0da057f6fa5c5e9dde32e9dc6596616b12sctemme RewriteLogLevel 9<br />
1eddce0da057f6fa5c5e9dde32e9dc6596616b12sctemme ErrorLog /home/sctemme/inst/blog/httpd/logs/error_log<br />
1eddce0da057f6fa5c5e9dde32e9dc6596616b12sctemme LogLevel debug<br />
1eddce0da057f6fa5c5e9dde32e9dc6596616b12sctemme CustomLog /home/sctemme/inst/blog/httpd/logs/access_log common<br />
1eddce0da057f6fa5c5e9dde32e9dc6596616b12sctemme ScriptAlias /cgi-bin/ /home/sctemme/inst/blog/bin/<br />
1eddce0da057f6fa5c5e9dde32e9dc6596616b12sctemme &lt;Directory "/home/sctemme/inst/blog/bin"&gt;<br />
1eddce0da057f6fa5c5e9dde32e9dc6596616b12sctemme <indent>
1eddce0da057f6fa5c5e9dde32e9dc6596616b12sctemme Options +ExecCGI<br />
1eddce0da057f6fa5c5e9dde32e9dc6596616b12sctemme Order allow,deny<br />
1eddce0da057f6fa5c5e9dde32e9dc6596616b12sctemme Allow from all<br />
1eddce0da057f6fa5c5e9dde32e9dc6596616b12sctemme </indent>
1eddce0da057f6fa5c5e9dde32e9dc6596616b12sctemme &lt;/Directory&gt;<br />
1eddce0da057f6fa5c5e9dde32e9dc6596616b12sctemme</indent>
1eddce0da057f6fa5c5e9dde32e9dc6596616b12sctemme&lt;/VirtualHost&gt;
dd9940ba9b4d9c09f034b910d1569db4a5111c75dougm</example>
e62985c7a1b46a5036a247f35bddac1308985758dougm
e8f95a682820a599fe41b22977010636be5c2717jim<p>The <code>RewriteCond</code> and <code>RewriteRule</code> directives say that, if the requested resource does not exist as a file or a directory, its path is passed to the Blosxom CGI for rendering. Blosxom uses Path Info to specify blog entries and index pages, so this means that if a particular path under Blosxom exists as a static file in the file system, the file is served instead. Any request that isn&apos;t pre- rendered is served by the CGI. This means that individual entries, which show the comments, are always served by the CGI which in turn means that your comment spam is always visible. This configuration also hides the Blosxom CGI from the user-visible URL in their Location bar. mod_rewrite is a fantastically powerful and versatile module: investigate it to arrive at a configuration that is best for your situation. </p>
e8f95a682820a599fe41b22977010636be5c2717jim
98f81eac9530d487f05013cda9df99755bb59689trawick
98f81eac9530d487f05013cda9df99755bb59689trawick</section>
98f81eac9530d487f05013cda9df99755bb59689trawick</section>
98f81eac9530d487f05013cda9df99755bb59689trawick<section id="Caching Content With mod_cache">
98f81eac9530d487f05013cda9df99755bb59689trawick<title>Caching Content With mod_cache</title>
98f81eac9530d487f05013cda9df99755bb59689trawick<p>The mod_cache module provides intelligent caching of HTTP responses: it is aware of the expiration timing and content requirements that are part of the HTTP specification. The mod_cache module caches URL response content. If content sent to the client is considered cacheable, it is saved to disk. Subsequent requests for that URL will be served directly from the cache. The provider module for mod_cache, mod_disk_cache, determines how the cached content is stored on disk. Most server systems will have more disk available than memory, and it&apos;s good to note that some operating system kernels cache frequently accessed disk content transparently in memory, so replicating this in the server is not very useful. </p>
98f81eac9530d487f05013cda9df99755bb59689trawick<p>To enable efficient content caching and avoid presenting the user with stale or invalid content, the application that generates the actual content has to send the correct response headers. Without headers like <code>Etag:</code>, <code>Last-Modified:</code> or <code>Expires:</code>, mod_cache can not make the right decision on whether to cache the content, serve it from cache or leave it alone. When testing content caching, you may find that you need to modify your application or, if this is impossible, selectively disable caching for URLs that cause problems. The mod_cache modules are not compiled by default, but can be enabled by passing the option <code>--enable-cache[=shared]</code> to the configure script. If you use a binary distribution of Apache httpd, or it came with your port or package collection, it may have mod_cache already included. </p>
98f81eac9530d487f05013cda9df99755bb59689trawick
98f81eac9530d487f05013cda9df99755bb59689trawick
98f81eac9530d487f05013cda9df99755bb59689trawick<section id="Example: wiki.apache.org">
e62985c7a1b46a5036a247f35bddac1308985758dougm<title>Example: wiki.apache.org</title>
98f81eac9530d487f05013cda9df99755bb59689trawick<p><strong>&apos;Is this still the case? Maybe we should give a better example here too.</strong> </p>
e62985c7a1b46a5036a247f35bddac1308985758dougm<p>The Apache Software Foundation Wiki is served by <a href="/httpd/MoinMoin">MoinMoin</a>. <a href="/httpd/MoinMoin">MoinMoin</a> is written in Python and runs as a CGI. To date, any attempts to run it under mod_python has been unsuccessful. The CGI proved to place an untenably high load on the server machine, especially when the Wiki was being indexed by search engines like Google. To lighten the load on the server machine, the Apache Infrastructure team turned to mod_cache. It turned out <a href="/httpd/MoinMoin">MoinMoin</a> needed a small patch to ensure proper behavior behind the caching server: certain requests can never be cached and the corresponding Python modules were patched to send the proper HTTP response headers. After this modification, the cache in front of the Wiki was enabled with the following configuration snippet in <code>httpd.conf</code>: </p>
e62985c7a1b46a5036a247f35bddac1308985758dougm
8464a9c46b967001e38fe3c8afff51a649e9de51dougm<example>
d94fd18ee21dc9b8c1f422144a881e941687d41fdougmCacheRoot /raid1/cacheroot<br />
d94fd18ee21dc9b8c1f422144a881e941687d41fdougmCacheEnable disk /<br />
462f3213ebe7eb2a3527530497d0428e2298a034jorton# A page modified 100 minutes ago will expire in 10 minutes<br />
1eddce0da057f6fa5c5e9dde32e9dc6596616b12sctemmeCacheLastModifiedFactor .1<br />
1eddce0da057f6fa5c5e9dde32e9dc6596616b12sctemme# Always check again after 6 hours<br />
1eddce0da057f6fa5c5e9dde32e9dc6596616b12sctemmeCacheMaxExpire 21600
1eddce0da057f6fa5c5e9dde32e9dc6596616b12sctemme</example>
1eddce0da057f6fa5c5e9dde32e9dc6596616b12sctemme
1eddce0da057f6fa5c5e9dde32e9dc6596616b12sctemme<p>This configuration will try to cache any and all content within its virtual host. It will never cache content for more than six hours (the <code>CacheMaxExpire</code> directive). If no <code>Expires:</code> header is present in the response, mod_cache will compute an expiration period from the <code>Last-Modified:</code> header. The computation using <code>CacheLastModifiedFactor</code> is based on the assumption that if a page was recently modified, it is likely to change again in the near future and will have to be re-cached. </p>
8464a9c46b967001e38fe3c8afff51a649e9de51dougm<p>Do note that it can pay off to <em>disable</em> the <code>ETag:</code> header: For files smaller than 1k the server has to calculate the checksum (usually MD5) and then send out a <code>304 Not Modified</code> response, which will take waste some CPU and still saturate the same amount of network resources for the transfer (one TCP packet). For resources larger than 1k it might prove CPU expensive to calculate the header for each request. Unfortunately there does currently not exist a way to cache these headers. </p>
e8f95a682820a599fe41b22977010636be5c2717jim<example>
3c65aa88903de7330a07e133dfda779842fadad4wrowe&lt;FilesMatch \.(jpe?g|png|gif|js|css|x?html|xml)&gt;<br />
d94fd18ee21dc9b8c1f422144a881e941687d41fdougm<indent>
d94fd18ee21dc9b8c1f422144a881e941687d41fdougm FilesETag None<br />
d94fd18ee21dc9b8c1f422144a881e941687d41fdougm</indent>
3c65aa88903de7330a07e133dfda779842fadad4wrowe&lt;/FilesMatch&gt;
d94fd18ee21dc9b8c1f422144a881e941687d41fdougm</example>
3c65aa88903de7330a07e133dfda779842fadad4wrowe
8464a9c46b967001e38fe3c8afff51a649e9de51dougm<p>This will disable the generation of the <code>ETag:</code> header for most static resources. The server does not calculate these headers for dynamic resources. </p>
3c65aa88903de7330a07e133dfda779842fadad4wrowe
3c65aa88903de7330a07e133dfda779842fadad4wrowe
8464a9c46b967001e38fe3c8afff51a649e9de51dougm</section>
3c65aa88903de7330a07e133dfda779842fadad4wrowe</section>
e18e68b42830409bf48de0df9eed3fe363664aa7aaron</section>
d94fd18ee21dc9b8c1f422144a881e941687d41fdougm<section id="Further Considerations">
d94fd18ee21dc9b8c1f422144a881e941687d41fdougm<title>Further Considerations</title>
e62985c7a1b46a5036a247f35bddac1308985758dougm<p>Armed with the knowledge of how to tune a sytem to deliver the desired the performance, we will soon discover that <em>one</em> system might prove a bottleneck. How to make a system fit for growth, or how to put a number of systems into tune will be discussed in <a href="/httpd/PerformanceScalingOut">PerformanceScalingOut</a>. </p>
9e530d1e49062250c345bfd45810e145b4f435eddougm</section>
e62985c7a1b46a5036a247f35bddac1308985758dougm</manualpage>
1eddce0da057f6fa5c5e9dde32e9dc6596616b12sctemme