Nick Finck says, "when it comes to web statistics, be very skeptical." I'm inclined to agree with Nick, but I also believe that web stats are a very powerful and useful tool. But, like any power tool, one needs to know how to use it before running around and applying it willy-nilly to whatever one comes across. Thus, I thought it might be nice to spend a few moments expounding on the subject of web stats.
But when using server logs (or other forms of web statistics), you are observing real-life behavior. This direct connection to what the user is actually doing makes the use of server logs an important tool. Important, but, as pointed out by Nick and Tim Bray, not necessarily straightforward.
A more complicated case comes with dynamic pages. Are page.cgi?id=1 and page.cgi?id=2 the same page or two different pages? Obviously, this depends on your setup, but you better be able to tell your stats package which is which.
I remember one case where, at first glance, it looked like one prominent feature on the site wasn't being used much at all. But, I realized that I needed to tell the stats system to account for the query string, and lo and behold, those pages were being used. Good thing we didn't take rash action before I figured out how to use the system!
I've recently written about my purchase of ClickTracks, which is neat click stream analysis software. This gives a page-by-page account of where users are clicking. This will be, I think, very helpful in helping us gauge the success of re-designs and newsletter campaigns. It also presents the data in a very nice, non-threatening way. But, it doesn't meet all of the criteria I laid out for my ideal stats system. So, we use a second (and maybe a third) server log analysis tool to help with these other items. The tools each give us an different viewport onto what is happening on the site.
We store queries entered into the site's search engine to get a better idea about the terms people use to search our site. We check those logs to see if the users are actually finding what they're looking for, and if they're entering queries that we don't have answers for. It turns out they are, and so we're working to address this by adding a different type of search functionality (long story).
Our website is a portal, even though I don't care for that term. So, we care about the usage of the resources we link to. I built a little home-grown stats collection tool to track this information. This gives us a real nice view into how our site gets used, and by whom. We can see the effect of, for example, training activity on usage. And, if we see an unexpected spike in usage, we can investigate. Often, there is an interesting story behind dramatically increasing usage.
Assuming each view is actually shedding light on your goals, the more views you can muster, the better.
What do you think? Did I miss any obvious points? Let me know!
Posted by Karl
May 23, 2005 08:35 PM