Webserver Log Analysis

Choice
I have chosen Webalizer over Analog, because I had trouble getting Analog to work as I wanted... (not very much trouble, but I'm extremely lazy!)

Installation

 * 1) emerge webalizer   <- (or perhaps # emerge app-admin/webalizer)

Create a cron script for automatic analysis. When my logs grow in size, perhaps it should only be run once a day (e.g. use )


 * 1) vim /etc/cron.hourly/analyze_apache_logs.sh

#!/bin/sh webalizer

Configuration
LogFile       /var/log/apache2/access_log OutputDir     /var/www/path/to/webalizer
 * 1) mkdir /var/www/path/to/webalizer
 * 2) chown schmid:users /var/www/path/to/webalizer
 * 3) vim /etc/webalizer.conf

/etc/webalizer.conf:

LogFile       /var/log/apache2/access_log OutputDir     /var/www/schmid.dk/public/private/analysis HostName      schmid.dk PageType        htm* PageType       php

DNSCache       /tmp/dns_cache.db DNSChildren     5
 * 1) these are necessary for DNS names to appear in the analysis

CountryGraph   yes DailyGraph     yes DailyStats     no HourlyGraph     yes HourlyStats    no GraphLegend     yes

TopKSites      0 TopURLs        30 TopKURLs       0 TopCountries   10 TopEntry       0 TopExit        0

HideSite       localhost HideReferrer   *.schmid.dk* HideURL        *createpic.php*

GroupSite      inktomisearch.com HideSite       inktomisearch.com GroupSite      googlebot.com HideSite       googlebot.com