Introduction to Webalizer

Webalizer is a free log file analysis tool that generates highly detailed usage summaries for Web and FTP servers in easily comprehensible graphical and tabular formats. The logs are a statistical summary of user traffic on your server.

Webalizer reports are generated at a scheduled time in accordance with the configuration options that you set for the site at the time of creating the site.

Features of Webalizer

Some of the unique features of Webalizer are:

  1. Fast processing of log files

Log files, over a period of time, grow to very large sizes. A relatively small web site that has several thousand visitors a month will have approximately 25 megabytes of log files. Most log file analysis tools are slow, processing the log files at speeds of only 2-3 megabytes per minute. Webalizer can be very fast; on a 200Mhz Pentium machine, over 10,000 records can be processed in one second, with a 40 Megabyte file taking roughly 15 seconds (over 150,000 records).

  1. Portable

Webalizer is compatible with a range of hardware configurations ( Intel x86, Power PC, Sparc, Mips4 ) and operating systems (Linux, Solaris, OS/2) and can be easily ported from one system to another.

  1. Configurable reports

Webalizer can be configured with custom options at any time by modifying the Webalizer configuration file.

  1. Incremental processing of logs

Incremental processing of logs enables large log files to be split up into multiple files for processing, without any loss of data.

Supported log formats

Server logs follow defined standards in the way information is stored in a log file.  

Webalizer supports the following log formats.

  1. Common Log File (CLF) format

The CLF format specifies data such as the machine name or IP address from which the user made the request, as well as the information on the resource requested. This is the most commonly used format.

  1. Combined log format

An extension of the Common Log File format, with additional information on referrals, user agents and cookies.

  1. Xferlog format

The xferlog format is generated by the wu-ftpd utility and contains information on inbound and outbound requests.

  1. Squid proxy logs

Squid logs record access information, system configuration errors and resource consumption (for example, memory, disk space).

Report generation

Webalizer provides Web and FTP log reports in HTML format that can be viewed with any browser. It provides a highly detailed report of the activity on your server, manifest in a variety of reports.

When a log report is requested, Webalizer performs the following actions to produce the report.

  1. Reads the Webalizer configuration information (contained in the webalizer.conf file) for the site. Webalizer can read both GZ and ZIP log file formats. If it gets a GZ or ZIP format, it decompresses the file on the fly.

  2. Creates an output directory to store the log reports. If no directory is specified, the current working directory is used.

  3. Reads the history file (contains totals for previous months) for the site, if it exists.

  4. If incremental processing of logs is enabled, then it looks for a data file, named webalizer.current, that contains information about the previous run.

  5. Processes the server log file. If the log file contains information for multiple months, a separate HTML file is created for each month.

  6. Creates the usage summary (encapsulated in the index.htm file). The usage summary shows totals by month, and links to the HTML page of each month.

  7. A new history file is saved to disk, which contains the totals generated by Webalizer during the current run.

  8. If incremental processing is enabled, a data file (webalizer.current) is written, which captures necessary information about the current run.

For more information about Webalizer, visit the Home page of Webalizer.