Introduction to Webalizer
Webalizer is a free log file analysis application that generates highly detailed usage summaries for Web and FTP servers in easily comprehensible graphical and tabular formats. The logs generated are a statistical encapsulation of user traffic on your server.
Webalizer reports are generated at a scheduled time and in accordance with the configuration options you configured for the site at the time of creating the site.
Note: The webalizer.conf file for the Web server is located at /home/virtual/<site_name>/etc/webalizer/web, where <site_name> is the name of the site for which you configured Webalizer. The webalizer.conf file for the FTP server is located at /home/virtual/<site_name>/etc/webalizer/ftp, where <site_name> is the name of the site for which you configured Webalizer.
Fast processing of log files
Log files, over a period of time, grow to very large sizes. A relatively small Web site that has several thousand visitors a month will have approximately 25 megabytes of log files. Most log file analysis tools are slow, processing the log files at speeds of only 2-3 megabytes per minute. Webalizer can be very fast; on a 200Mhz Pentium machine, over 10,000 records can be processed in one second, with a 40 Megabyte file taking roughly 15 seconds (over 150,000 records).
Portable
Webalizer is compatible with a range of hardware configurations ( Intel x86, Power PC, Sparc, Mips4 ) and operating systems (Linux, Solaris, OS/2) and can be easily ported from one system to another.
Configurable reports
Webalizer can be configured with custom options at any time by modifying the Webalizer configuration file.
Incremental processing of logs
Incremental processing of logs enables large log files to be split up into multiple files for processing, without any loss of data.
Server logs follow defined standards in the way information is stored in a log file.
Webalizer supports the following log formats.
Common Log File (CLF) format
The CLF format specifies data such as the machine name or IP address from which the user made the request, as well as the information on the resource requested. This is the most commonly used format.
Combined log format
An extension of the Common Log File format, with additional information on referrals, user agents and cookies.
Xferlog format
The xferlog format is generated by the wu-ftpd and contains information on inbound and outbound requests.
Squid proxy logs
Squid logs record access information, system configuration errors and resource consumption (for example, memory, disk space).
Webalizer provides Web and FTP log reports in HTML format that can be viewed with any browser. It provides a highly detailed report of the activity on your server, manifest in a variety of reports.
When a log report is requested, Webalizer performs the following actions to produce the report.
Reads for the Webalizer configuration information (contained in the webalizer.conf file) for the site. Webalizer can read both GZ and ZIP file formats. If it gets a GZ or ZIP format, it decompresses the file on the fly.
Creates an output directory to store the log reports. If no directory is specified, the current working directory is used.
Reads the history file (contains totals for previous months) for the site, if it exists.
If incremental processing of logs is enabled, then it looks for a data file, named webalizer.current, that contains information about the previous run.
Processes the Web server log file. If the log file contains information for multiple months, a separate HTML file is created for each month.
Creates the usage summary (encapsulated in the index.htm file). The usage summary shows totals by month, and links to the HTML page of each month.
A new history file is saved to disk, which contains the totals generated by Webalizer during the current run.
If incremental processing is enabled, a data file (webalizer.current) is written, which captures necessary information about the current run.