Lightsquid is a handy log analyzer for squid proxy. However, the project is not maintained since 2009.

Today, I found lightsquid doesn't work in 2021. Why?

# Lightsquid Error

When I run lightparser, the output looks like this:

$./lightparser.pl access.log.66666 >>> use file :: /var/log/squid/access.log.66666 run TIME: 1 sec LightSquid parser statistic report 3312 lines processed (average 3312.00 lines per second) 0 lines parsed 0 lines recovered 0 lines notrecovered 0 lines skiped by bad year 3312 lines skiped by date filter 0 lines skiped by Denied filter 0 lines skiped by skipURL filter WARNING !!!!, parsed 0 lines from total : 3312 please check confiuration !!!! may be wrong log format selected ? Seems that all lines are filtered by the date filter. I double checked the log format and make sure it is correct and the same with previous days. # Analyze the Problem Look at this piece of code (I am a Perl layman): here my$filterdatestop =timelocal(59,59,23,31,12-1,2020-1900)+1000;

$fToday=1 if ($ARGV[0] eq "today");
$fToday=1 if ($ARGV[0] eq "yesterday");

if ($fToday) { ($sec,$min,$hour,$mday,$mon,$year,$wday,$yday,$isdst) = localtime;

$filterdate=sprintf("%04d%02d%02d",$year+1900,$mon+1,$mday);;
$filterdatestart=timelocal( 0, 0, 0,$mday,$mon,$year);
$filterdatestop =timelocal(59,59,23,$mday,$mon,$year);
print ">>> filter today: $filterdate\n" if ($debug);
}

if ($ARGV[0] eq "yesterday") {$filterdatestart=$filterdatestart-(24*60*60);$filterdatestop =$filterdatestop -(24*60*60); ($sec,$min,$hour,$mday,$mon,$year,$wday,$yday,$isdst) = localtime($filterdatestart);$filterdate=sprintf("%04d%02d%02d",$year+1900,$mon+1,$mday);; print ">>> filter yesterday:$filterdate\n" if ($debug); } if ($ARGV[0] =~ m/^(\d\d\d\d)(\d\d)(\d\d)$/) {$filterdate=$ARGV[0];$filterdatestart=timelocal( 0, 0, 0,$3,$2-1,$1);$filterdatestop =timelocal(59,59,23,$3,$2-1,$1); print ">>> filter date:$filterdate\n" if ($debug); } if ($ARGV[0] =~ m/access\.log\.(\d)/) {
$filename=$ARGV[0];
$catname="zcat" if ($ARGV[0] =~ m/\.gz$/);$catname="bzcat" if ($ARGV[0] =~ m/\.bz2$/);
}

print ">>> use file :: $logpath/$filename\n" if ($debug); ... #speed optimization for FILTERDATE mode$Ltimestamp=substr $_,0,11; if ($Ltimestamp<$filterdatestart or$Ltimestamp>$filterdatestop) { print ">>>> skipDafteFilter URL$Lurl\n$_" if ($debug2 >= 2 );
$skipfilterdatecntr++; next; }; Found two issues: 1. A hardcode 2020 in $filterdatestop which makes lightsquid not work in 2021
2. today argument cannot work with customized file at the same time

## First Issue

Obviously, $filterdatestop is the culprit: my$filterdatestop =timelocal(59,59,23,31,12-1,2020-1900)+1000;

A quick fix is to change the hardcoded 2020 to 2100. It seems that the month and year parameter are weird...

I read the document of Perl's "Time::Local" and found that the design is really complex and hard to understand (personally I think it is anti-pattern, why not make it a simple design which supports only absolute year?):

• Years greater than 999 are interpreted as being the actual year, rather than the offset from 1900. Thus, 1964 would indicate the year Martin Luther King won the Nobel prize, not the year 3864.
• Years in the range 100..999 are interpreted as offset from 1900, so that 112 indicates 2012. This rule also applies to years less than zero (but see note below regarding date range).
• Years in the range 0..99 are interpreted as shorthand for years in the rolling "current century," defined as 50 years on either side of the current year. Thus, today, in 1999, 0 would refer to 2000, and 45 to 2045, but 55 would refer to 1955. Twenty years from now, 55 would instead refer to 2055. This is messy, but matches the way people currently think about two digit dates. Whenever possible, use an absolute four digit year instead.

Then I try to understand what these lines means:

use Time::Local;

my $date1 = timelocal(59,59,23,31,12-1,2020-1900); my$date2 = timelocal(59,59,23,31,12-1,2020-1900)+1000;
my $date3 = timelocal(59,59,23,31,11,2020); print "$date1\n";
print "$date2\n"; print "$date3\n";

Output:

1609430399
1609431399
1609430399

The solution is easy, change this line to(note that the month is zero indexed (0..11)), and now it works until 2100:

my $filterdatestop =timelocal(59,59,23,31,11,2100)+1000; ## Second Issue Another problem is that today argument cannot work with customized filename: $fToday=1 if ($ARGV[0] eq "today");$fToday=1 if ($ARGV[0] eq "yesterday"); if ($fToday) {
...
}

if ($ARGV[0] eq "yesterday") { ... } if ($ARGV[0] =~ m/^(\d\d\d\d)(\d\d)(\d\d)$/) {$filterdate=$ARGV[0];$filterdatestart=timelocal( 0, 0, 0,$3,$2-1,$1);$filterdatestop =timelocal(59,59,23,$3,$2-1,$1); print ">>> filter date:$filterdate\n" if ($debug); } if ($ARGV[0] =~ m/access\.log\.(\d)/) {
$filename=$ARGV[0];
$catname="zcat" if ($ARGV[0] =~ m/\.gz$/);$catname="bzcat" if ($ARGV[0] =~ m/\.bz2$/);
}

print ">>> use file :: $logpath/$filename\n" if ($debug); Apparently, only one argument ARGV[0] is applied. If you wrote the command like this and not fixed the first issue: $./lightparser.pl access.log.66666 today
or
$./lightparser.pl today access.log.66666 You will encounter the "parsed 0 lines" error. # Summary Just fix the first issue and you will make lightsquid works longer :-) $ ./lightparser.pl access.log.66666
>>> use file :: /var/log/squid/access.log.66666
>>> Make Report 20210102 (1577 - log line parsed)
>>> Make Report 20210103 (1572 - log line parsed)
run TIME: 1 sec
LightSquid parser statistic report

3312 lines processed (average 3312.00 lines per second)
3149 lines parsed
0 lines recovered
0 lines notrecovered
0 lines skiped by bad year
0 lines skiped by date filter
163 lines skiped by Denied filter
0 lines skiped by skipURL filter


Fixed version: https://github.com/finisky/lightsquid-1.8.1