<
H1 ALIGN="CENTER">Apache module mod_proxy</
h1>
or the <
code>
modules/
proxy</
code> subdirectory for Apache 1.2, and
is not compiled in by default. It provides for an <
STRONG>HTTP
1.0</
STRONG> caching proxy
server. It is only available in Apache 1.1 and later. Common configuration
questions are addressed <
a href="#configs">after the directive
<
p>This module was experimental in Apache
1.1.x. As of Apache 1.2, mod_proxy
stability is <
EM>greatly</
EM> improved.<
p>
This module implements a
proxy/
cache for Apache. It implements
<
code>CONNECT</
code> (for SSL),
The module can be configured to connect to other proxy modules for these
<
li><
a href="#proxyrequests">ProxyRequests</
a>
<
li><
a href="#proxyremote">ProxyRemote</
a>
<
li><
a href="#proxypass">ProxyPass</
a>
<
li><
a href="#proxyblock">ProxyBlock</
a>
<
li><
a href="#noproxy">NoProxy</
a>
<
li><
a href="#proxydomain">ProxyDomain</
a>
<
li><
a href="#cacheroot">CacheRoot</
a>
<
li><
a href="#cachesize">CacheSize</
a>
<
li><
a href="#cachemaxexpire">CacheMaxExpire</
a>
<
li><
a href="#cachedefaultexpire">CacheDefaultExpire</
a>
<
li><
a href="#cachelastmodifiedfactor">CacheLastModifiedFactor</
a>
<
li><
a href="#cachegcinterval">CacheGcInterval</
a>
<
li><
a href="#cachedirlevels">CacheDirLevels</
a>
<
li><
a href="#cachedirlength">CacheDirLength</
a>
<
li><
a href="#nocache">NoCache</
a>
<
A name="proxyrequests"><
h2>ProxyRequests</
h2></
A>
<
strong>Syntax:</
strong> ProxyRequests <
em>
on/
off</
em><
br>
<
strong>Default:</
strong> <
code>ProxyRequests Off</
code><
br>
<
strong>Context:</
strong> server config, virtual host<
br>
<
strong>Override:</
strong> <
EM>Not applicable</
EM><
br>
<
strong>Status:</
strong> Base<
br>
<
strong>Module:</
strong> mod_proxy<
br>
<
strong>Compatibility:</
strong> ProxyRequests is only available in
This allows or prevents Apache from functioning as a proxy
server. Setting ProxyRequests to 'off' does not disable use of the <
ahref="#proxypass">ProxyPass</
a> directive.
<
A name="proxyremote"><
h2>ProxyRemote</
h2></
A>
<
strong>Syntax:</
strong> ProxyRemote <
em><match> <remote-server></
em><
br>
<
strong>Default:</
strong> <
EM>None</
EM><
br>
<
strong>Context:</
strong> server config, virtual host<
br>
<
strong>Override:</
strong> <
EM>Not applicable</
EM><
br>
<
strong>Status:</
strong> Base<
br>
<
strong>Module:</
strong> mod_proxy<
br>
<
strong>Compatibility:</
strong> ProxyRemote is only available in
This defines remote proxies to this proxy. <match> is either the
name of a URL-scheme that the remote server supports, or a partial URL
for which the remote server should be used, or '*' to indicate the
server should be contacted for all requests. <remote-server> is a
partial URL for the remote server. Syntax:
<remote-server> = <protocol>://<hostname>[:port]
<protocol> is the protocol that should be used to communicate
with the remote server; only "http" is supported by this module.
In the last example, the proxy will forward FTP requests, encapsulated
as yet another HTTP proxy request, to another proxy which can handle
<
A name="proxypass"><
h2>ProxyPass</
h2></
A>
<
strong>Syntax:</
strong> ProxyPass <
em><path> <url></
em><
br>
<
strong>Default:</
strong> <
EM>None</
EM><
br>
<
strong>Context:</
strong> server config, virtual host<
br>
<
strong>Override:</
strong> <
EM>Not applicable</
EM><
br>
<
strong>Status:</
strong> Base<
br>
<
strong>Module:</
strong> mod_proxy<
br>
<
strong>Compatibility:</
strong> ProxyPass is only available in
This directive allows remote servers to be mapped into the space of the local
server; the local server does not act as a proxy in the conventional sense,
but appears to be a mirror of the remote server. <path> is the name of
a local virtual path; <url> is a partial URL for the remote server.
will cause a local request for the
internally converted into a proxy request to
<
A name="proxyblock"><
h2>ProxyBlock</
h2></
A>
<
strong>Syntax:</
strong> ProxyBlock <
em><
word/
host/
domain list></
em><
br>
<
strong>Default:</
strong> <
EM>None</
EM><
br>
<
strong>Context:</
strong> server config, virtual host<
br>
<
strong>Override:</
strong> <
EM>Not applicable</
EM><
br>
<
strong>Status:</
strong> Base<
br>
<
strong>Module:</
strong> mod_proxy<
br>
<
strong>Compatibility:</
strong> ProxyBlock is only available in
The ProxyBlock directive specifies a list of words, hosts
and/
or domains,
separated by spaces. HTTP, HTTPS, and FTP document requests to matched words,
hosts or domains are <
em>blocked</
em> by the proxy server. The proxy module
will also attempt to determine IP addresses of list items which may be
hostnames during startup, and cache them for match test as well. Example:
Note that 'wotsamattau' would also be sufficient to match '
wotsamattau.edu'.<
p>
blocks connections to all sites.
<
A name="noproxy"><
h2>NoProxy</
h2></
A>
<
strong>Syntax:</
strong> NoProxy { <
A HREF="#domain"><
em><Domain></
em></
A>
| <
A HREF="#subnet"><
em><SubNet></
em></
A>
| <
A HREF="#ipaddr"><
em><IpAddr></
em></
A>
| <
A HREF="#hostname"><
em><Hostname></
em></
A>
<
strong>Default:</
strong> <
EM>None</
EM><
br>
<
strong>Context:</
strong> server config, virtual host<
br>
<
strong>Override:</
strong> <
EM>Not applicable</
EM><
br>
<
strong>Status:</
strong> Base<
br>
<
strong>Module:</
strong> mod_proxy<
br>
<
strong>Compatibility:</
strong> NoProxy is only available in
This directive is only useful for Apache proxy servers within intranets.
The NoProxy directive specifies a list of subnets, IP addresses, hosts
and/
or domains, separated by spaces. A request to a host which matches
one or more of these is always served directly, without forwarding to
the configured ProxyRemote proxy server(s).
The arguments to the NoProxy directive are one of the following type list:
<!-- ===================== Domain ======================= --> <
DD>A <
EM>Domain</
EM> is a partially qualified DNS domain name, preceded
It represents a list of hosts which logically belong to the same DNS
domain or zone (
i.e. the suffixes of the hostnames are all ending in
To distinguish <
EM>Domain</
EM>s from <
A HREF="#hostname"><
EM>Hostname</
EM></
A>s (both
syntactically and semantically; a DNS domain can have a DNS A record,
too!), <
EM>Domain</
EM>s are always written
with a leading period.<
BR>
Note: Domain name comparisons are done without regard to the case,
and <
EM>Domain</
EM>s are always assumed to be anchored in the root
of the DNS tree, therefore two domains <
SAMP>
.MyDomain.com</
SAMP> and
considered equal. Since a domain comparison does not involve a DNS
lookup, it is much more efficient than subnet comparison.
<!-- ===================== SubNet ======================= --> <
DD>A <
EM>SubNet</
EM> is a partially qualified internet address in
numeric (dotted quad) form, optionally followed by a slash and the
netmask, specified as the number of significant bits in the
<
EM>SubNet</
EM>. It is used to represent a subnet of hosts which can
be reached over a common network interface. In the absence of the
explicit net mask it is assumed that omitted (or zero valued)
trailing digits specify the mask. (In this case, the netmask can
only be multiples of 8 bits wide.)<
BR>
<
DT><
SAMP>192.168</
SAMP> or <
SAMP>192.168.0.0</
SAMP>
<
DD>the subnet 192.168.0.0 with an implied netmask of 16 valid bits
(sometimes used in the netmask form <
SAMP>255.255.0.0</
SAMP>)
valid bits (also used in the form 255.255.248.0)
As a degenerate case, a <
EM>SubNet</
EM> with 32 valid bits is the
equivalent to an <
EM>IPAddr</
EM>, while a <
EM>SubNet</
EM> with zero
valid bits (
e.g., 0.0.0.0/0) is the same as the constant
<
EM>_Default_</
EM>, matching any IP address.
<!-- ===================== IPAddr ======================= --> <
DD>A <
EM>IPAddr</
EM> represents a fully qualified internet address in
numeric (dotted quad) form. Usually, this address represents a
host, but there need not necessarily be a DNS domain name
connected with the address.<
BR>
Example: 192.168.123.7<
BR>
Note: An <
EM>IPAddr</
EM> does not need to be resolved by the DNS
system, so it can result in more effective apache performance.
<
p><
strong>See Also:</
strong>
<!-- ===================== Hostname ======================= --> <
DT><
EM>Hostname</
EM></
A>
<
DD>A <
EM>Hostname</
EM> is a fully qualified DNS domain name which can
be resolved to one or more <
A HREF="#ipaddr"><
EM>IPAddrs</
EM></
A> via the DNS domain name service.
It represents a logical host (in contrast to
<
A HREF="#domain"><
EM>Domain</
EM></
A>s, see
above) and must be resolvable to at least one <
A HREF="#ipaddr"><
EM>IPAddr</
EM></
A> (or often to a list of hosts
with different <
A HREF="#ipaddr"><
EM>IPAddr</
EM></
A>'s).<
BR>
Note: In many situations, it is more effective to specify an
<
A HREF="#ipaddr"><
EM>IPAddr</
EM></
A> in place of a
<
EM>Hostname</
EM> since a DNS lookup
can be avoided. Name resolution in Apache can take a remarkable deal
of time when the connection to the name server uses a slow PPP
Note: <
EM>Hostname</
EM> comparisons are done without regard to the case,
and <
EM>Hostname</
EM>s are always assumed to be anchored in the root
<
p><
strong>See Also:</
strong>
<
A name="proxydomain"><
h2>ProxyDomain</
h2></
A>
<
strong>Syntax:</
strong> ProxyDomain <
em><Domain></
em><
br>
<
strong>Default:</
strong> <
EM>None</
EM><
br>
<
strong>Context:</
strong> server config, virtual host<
br>
<
strong>Override:</
strong> <
EM>Not applicable</
EM><
br>
<
strong>Status:</
strong> Base<
br>
<
strong>Module:</
strong> mod_proxy<
br>
<
strong>Compatibility:</
strong> ProxyDomain is only available in
This directive is only useful for Apache proxy servers within intranets.
The ProxyDomain directive specifies the default domain which the apache
proxy server will belong to. If a request to a host without a domain name
is encountered, a redirection response to the same host
with the configured <
em>Domain</
em> appended will be generated.
<
A name="cacheroot"><
h2>CacheRoot</
h2></
A>
<
strong>Syntax:</
strong> CacheRoot <
em><directory></
em><
br>
<
strong>Default:</
strong> <
EM>None</
EM><
br>
<
strong>Context:</
strong> server config, virtual host<
br>
<
strong>Override:</
strong> <
EM>Not applicable</
EM><
br>
<
strong>Status:</
strong> Base<
br>
<
strong>Module:</
strong> mod_proxy<
br>
<
strong>Compatibility:</
strong> CacheRoot is only available in
Sets the name of the directory to contain cache files; this must be
<
A name="cachesize"><
h2>CacheSize</
h2></
A>
<
strong>Syntax:</
strong> CacheSize <
em><size></
em><
br>
<
strong>Default:</
strong> <
code>CacheSize 5</
code><
br>
<
strong>Context:</
strong> server config, virtual host<
br>
<
strong>Override:</
strong> <
EM>Not applicable</
EM><
br>
<
strong>Status:</
strong> Base<
br>
<
strong>Module:</
strong> mod_proxy<
br>
<
strong>Compatibility:</
strong> CacheSize is only available in
Sets the desired space usage of the cache, in KB (1024-byte units). Although
usage may grow above this setting, the garbage collection will delete files
until the usage is at or below this setting.
<
A name="cachegcinterval"><
h2>CacheGcInterval</
h2></
A>
<
strong>Syntax:</
strong> CacheGcInterval <
em><time></
em><
br>
<
strong>Default:</
strong> <
EM>None</
EM><
br>
<
strong>Context:</
strong> server config, virtual host<
br>
<
strong>Override:</
strong> <
EM>Not applicable</
EM><
br>
<
strong>Status:</
strong> Base<
br>
<
strong>Module:</
strong> mod_proxy<
br>
<
strong>Compatibility:</
strong> CacheGcinterval is only available in
Check the cache every <time> hours, and delete files if the space
usage is greater than that set by CacheSize.
<
A name="cachemaxexpire"><
h2>CacheMaxExpire</
h2></
A>
<
strong>Syntax:</
strong> CacheMaxExpire <
em><time></
em><
br>
<
strong>Default:</
strong> <
code>CacheMaxExpire 24</
code><
br>
<
strong>Context:</
strong> server config, virtual host<
br>
<
strong>Override:</
strong> <
EM>Not applicable</
EM><
br>
<
strong>Status:</
strong> Base<
br>
<
strong>Module:</
strong> mod_proxy<
br>
<
strong>Compatibility:</
strong> CacheMaxExpire is only available in
Cachable HTTP documents will be retained for at most <time> hours without
checking the origin server. Thus documents can be at most <time>
hours out of date. This restriction is enforced even if an expiry date
was supplied with the document.
<
A name="cachelastmodifiedfactor"><
h2>CacheLastModifiedFactor</
h2></
A>
<
strong>Syntax:</
strong> CacheLastModifiedFactor <
em><factor></
em><
br>
<
strong>Default:</
strong> <
code>CacheLastModifiedFactor 0.1</
code><
br>
<
strong>Context:</
strong> server config, virtual host<
br>
<
strong>Override:</
strong> <
EM>Not applicable</
EM><
br>
<
strong>Status:</
strong> Base<
br>
<
strong>Module:</
strong> mod_proxy<
br>
<
strong>Compatibility:</
strong> CacheLastModifiedFactor is only available in
If the origin HTTP server did not supply an expiry date for the
document, then estimate one using the formula
expiry-period = time-since-last-modification * <factor>
For example, if the document was last modified 10 hours ago, and
<factor> is 0.1, then the expiry period will be set to 10*0.1 = 1 hour.
<
p>If the expiry-period would be longer than that set by CacheMaxExpire,
then the latter takes precedence.
<
A name="cachedirlevels"><
h2>CacheDirLevels</
h2></
A>
<
strong>Syntax:</
strong> CacheDirLevels <
em><levels></
em><
br>
<
strong>Default:</
strong> <
code>CacheDirLevels 3</
code><
br>
<
strong>Context:</
strong> server config, virtual host<
br>
<
strong>Override:</
strong> <
EM>Not applicable</
EM><
br>
<
strong>Status:</
strong> Base<
br>
<
strong>Module:</
strong> mod_proxy<
br>
<
strong>Compatibility:</
strong> CacheDirLevels is only available in
CacheDirLevels sets the number of levels of subdirectories in the cache.
Cached data will be saved this many directory levels below CacheRoot.
<
A name="cachedirlength"><
h2>CacheDirLength</
h2></
A>
<
strong>Syntax:</
strong> CacheDirLength <
em><length></
em><
br>
<
strong>Default:</
strong> <
code>CacheDirLength 1</
code><
br>
<
strong>Context:</
strong> server config, virtual host<
br>
<
strong>Override:</
strong> <
EM>Not applicable</
EM><
br>
<
strong>Status:</
strong> Base<
br>
<
strong>Module:</
strong> mod_proxy<
br>
<
strong>Compatibility:</
strong> CacheDirLength is only available in
CacheDirLength sets the number of characters in proxy cache subdirectory names.
<
A name="cachedefaultexpire"><
h2>CacheDefaultExpire</
h2></
A>
<
strong>Syntax:</
strong> CacheDefaultExpire <
em><time></
em><
br>
<
strong>Default:</
strong> <
code>CacheDefaultExpire 1</
code><
br>
<
strong>Context:</
strong> server config, virtual host<
br>
<
strong>Override:</
strong> <
EM>Not applicable</
EM><
br>
<
strong>Status:</
strong> Base<
br>
<
strong>Module:</
strong> mod_proxy<
br>
<
strong>Compatibility:</
strong> CacheDefaultExpire is only available in
If the document is fetched via a protocol that does not support expiry times,
then use <time> hours as the expiry time.
<
a href="#cachemaxexpire">CacheMaxExpire</
a> does <
strong>not</
strong>
<
A name="nocache"><
h2>NoCache</
h2></
A>
<
strong>Syntax:</
strong> NoCache <
em><
word/
host/
domain list></
em><
br>
<
strong>Default:</
strong> <
EM>None</
EM><
br>
<
strong>Context:</
strong> server config, virtual host<
br>
<
strong>Override:</
strong> <
EM>Not applicable</
EM><
br>
<
strong>Status:</
strong> Base<
br>
<
strong>Module:</
strong> mod_proxy<
br>
<
strong>Compatibility:</
strong> NoCache is only available in
The NoCache directive specifies a list of words, hosts
and/
or domains, separated
by spaces. HTTP and non-passworded FTP documents from matched words, hosts or
domains are <
em>not</
em> cached by the proxy server. The proxy module will
also attempt to determine IP addresses of list items which may be hostnames
during startup, and cache them for match test as well. Example:
Note that 'wotsamattau' would also be sufficient to match '
wotsamattau.edu'.<
p>
disables caching completely.<
p>
<
a name="configs"><
h2>Common configuration topics</
h2></
a>
<
li><
a href="#access">Controlling access to your proxy</
a>
<
li><
a href="#shortname">Using Netscape hostname shortcuts</
a>
<
li><
a href="#mimetypes">Why doesn't file type <
EM>xxx</
EM> download via FTP?</
a>
<
li><
a href="#startup">Why does Apache start more slowly when using the
<
li><
a href="#socks">Can I use the Apache proxy module with my SOCKS proxy?</
a>
<
li><
a href="#intranet">What other functions are useful for an intranet proxy server?</
a>
<
h2><
a name="access">Controlling access to your proxy</
a></
h2>
You can control who can access your proxy via the normal <Directory>
control block using the following example:<
p>
<Directory proxy:*>
<Limit GET PUT POST DELETE CONNECT OPTIONS>
deny from [machines you'd like *not* to allow by IP address or name]
allow from [machines you'd like to allow by IP address or name]
A <Files> block will also work, and is the only method known to work
for all possible URLs in Apache versions earlier than 1.2b10.<
p>
<
h2><
a name="shortname">Using Netscape hostname shortcuts</
a></
h2>
There is an optional patch to the proxy module to allow Netscape-like
hostname shortcuts to be used. It's available
<
h2><
a name="mimetypes">Why doesn't file type <
EM>xxx</
EM> download via FTP?</
a></
h2>
You probably don't have that particular file type defined as
file. A useful line can be<
p>
<
h2><
a name="startup">Why does Apache start more slowly when using the
If you're using the <
code>ProxyBlock</
code> or <
code>NoCache</
code>
directives, hostnames' IP addresses are looked up and cached during
startup for later match test. This may take a few seconds (or more)
depending on the speed with which the hostname lookups occur.<
p>
<
h2><
a name="socks">Can I use the Apache proxy module with my SOCKS proxy?</
a></
h2>
Yes. Just build Apache with the rule <
code>SOCKS4=yes</
code> in your
<
EM>Configuration</
EM> file, and follow the instructions there. SOCKS5
capability can be added in a similar way (there's no <
code>SOCKS5</
code>
rule yet), so use the <
code>EXTRA_LDFLAGS</
code> definition, or build Apache
normally and run it with the <
EM>runsocks</
EM> wrapper provided with SOCKS5,
if your OS supports dynamically linked libraries.<
p>
Some users have reported problems when using SOCKS version 4.2 on Solaris.
The problem was solved by upgrading to SOCKS 4.3.<
p>
Remember that you'll also have to grant access to your Apache proxy machine by
permitting connections on the appropriate ports in your SOCKS daemon's
<
h2><
a name="intranet">What other functions are useful for an intranet proxy server?</
a></
h2>
<
p>An Apache proxy server situated in an intranet needs to forward external
requests through the company's firewall. However, when it has to access
resources within the intranet, it can bypass the firewall when accessing
hosts. The <
A HREF="#noproxy">NoProxy</
A> directive is useful for specifying
which hosts belong to the intranet and should be accessed directly.</
p>
<
p>Users within an intranet tend to omit the local domain name from their
away with this and simply serve the request, implying a configured
local domain. When the <
A HREF="#proxydomain">ProxyDomain</
A> directive
is used and the server is <
A HREF="#proxyrequests">configured for
proxy service</
A>, Apache can return a redirect response and send the client
to the correct, fully qualified, server address. This is the preferred method
since the user's bookmark files will then contain fully qualified hosts.</
p>