"100+ Auto-Installing Software Titles For Your Web Site"
======================================================
Module mod_rewrite Tutorial (Part 4):
Special Directives and Examples
------------------------------------------------------
by
Dirk Brockhausen
------------------------------------------------------

In this final part of our tutorial we will take
a look at those special directives we haven't covered
yet.

These directives cannot be defined on directory level.

This means that you will have to be able to edit the
Apache webserver's configuration file (httpd.conf).
These permissions will usually only be assigned to
users "root" or "admin".

If you wish to log all operations effected by
mod_rewrite you can activate logging with the
following entries:

RewriteLog /usr/local/apache/logs/mod_rewrite_log
RewriteLogLevel 1

These entries are not written into the file
".htaccess" but in "Section 2: 'Main' server
configuration" of file "httpd.conf".

All mod_rewrite manipulations will be logged
in this file. The log file can have any name you
prefer. It can be referenced as an absolute path or
relative to ServerRoot.

If you wish to maintain separate log files for
individual virtual hosts, you will have to place the
pertinent entries in "Section 3: Virtual Hosts",
e.g.:


ServerAdmin [email protected]
DocumentRoot /usr/www/htdocs/yourdomain
ServerName yourdomain.com
RewriteLog /usr/apache/logs/yourdomain_mod_rewrite_log
RewriteLogLevel 1


(Note: If your email reader or browser wraps these
lines take care to enter them unwrapped in your file!)

The RewriteLogLevel can be defined within a range of
1 to 8. Normally, 1 will do fine. Higher levels are
only required for debugging purposes.

--------

Another directive which is very handy for cloaking
purposes are the so-called Rewriting Maps. These are
files consisting of key/value pairs, e.g. in the
simple format of an ordinary text file:

cde2c920.infoseek.com spider
205.226.201.32 spider
cde2c923.infoseek.com spider
205.226.201.35 spider
cde2c981.infoseek.com spider
205.226.201.129 spider
cde2cb23.infoseek.com spider
205.226.203.35 spider

These keys are, as you can see, hostnames or IPs.
In this simplistic example the value is always the
same, namely "spider".

This directive is entered either in the server
section 2 or in the virtual host section 3 in file
"httpd.conf":

RewriteMap botBase txt:/www/yourdomain/spiderspy.txt

The Rewriting Map will then be available across your
server.

The other directives are entered in file ".htaccess":

RewriteCond   ${botBase:%{REMOTE_HOST}} =spider [OR]
RewriteCond   ${botBase:%{REMOTE_ADDR}} =spider
RewriteRule   ^(.*)\.htm$  $1.htm [L]
RewriteRule   ^.*\.htm$  index.html [L]

The conditions will make the system check whether the
required access is generated by a spider. To this
effect a lookup of file "spiderspy.txt" is triggered.

If the key is found, the value "spider" is returned
and the condition is rendered as true.

Next, the first RewriteRule will be executed. This one
determines that the called for ".htm" page will be fed
to the spider. The variable $1 is equal to the part in
parentheses of "^(.*)\.htm$", i.e. the file name will
remain the same.

If the URL is called by a normal human visitor, rule 2
applies: the user will be redirected to page
"index.html".

As the ".htm" pages will only be read by spiders, they
can be optimized accordingly for the search engines.

You may also use a file in dbm format instead of an
ordinary text file. The binary data base format helps
accelerate the lookup which is particularly important
if you are operating from very large spider lists.

This example given above offers a simple cloaking
functionality. All ordinary visitors will always be
redirected to the site's "index.html" page and there
is no access logging beyond the mod_rewrite logs.

However, it does go to show how you can effectively
replace several lines of Perl code with just a few
lines of mod_rewrite.

Our last example will illustrate this in some greater
detail.

----

The objective is to present site visitors with your
"Picture of the Day". Visitors will click a link, e.g.:
< http://www.yourdomain.com/pic.html >
which will display a different picture every day.

We will work from these server variables:

TIME_MON
TIME_DAY

In file ".htaccess" we will enter the following
single code line:

RewriteRule ^pic.html$ pic-%{TIME_MON}-%{TIME_DAY}.html

(Note: If your email reader or browser wraps this line
take care to enter it unwrapped in your file!)

The URL called for will be rewritten, e.g. to:

pic-08-28.html
pic-08-29.html
pic-08-30.html
etc.

So all you have to do is upload the pertinent files
once, after which you won't need to tend to their
daily assignation anymore.

Obviously the time variables can also be used for
other periodicities.

------

With this final example our mod_rewrite tutorial has
come to its end.

Of course, we have not tackled each and every
directive, variable, etc. here.

Rather, we suggest you view this tutorial as a general
introduction intended to help you as a start off point
towards a more in-depth study of the mod_rewrite
module, enabling you to customize it according to your
specific requirements.

------------------------------------------------------

[Main text: 749 words/5122 characters]
======================================================
This text may freely be republished or distributed
provided the following resource box is included intact
either at the beginning or the end of the article and
a complimentary copy or notice (link) is sent to the
author at the address specified below:

------------------------------------------------------
Dirk Brockhausen is the co-founder and principal of
fantomaster.com Ltd. (UK) and fantomaster.com GmbH
(Belgium), a company specializing in webmasters
software development, industrial-strength cloaking and
search engine positioning services. He holds a
doctorate in physics and has worked as an SAP
consultant and software developer since 1994. He is
also Technical Editor of fantomNews, a free newsletter
focusing on search engine optimization, available at:
< http://fantomaster.com/fantomnews-sub.html >
You can contact him at
mailto:[email protected]
(c) copyright 2000 by fantomaster.com
------------------------------------------------------


[ Back ] [ Main Menu ]



Download Fuse Node.js Compiler