Actionable Analytics

Thoughts on Web Measurement & Optimisation – by Jonny Longden
  • Home
  • About Me
« How to build a digital measurement framework
Essential guide to data accuracy in web analytics »

Errors of causation in web analytics

PostDateIcon July 16th, 2009 | PostAuthorIcon Author: Jonny

The other day I was presenting the findings of some analysis to a client. The focus of this analysis was to discover the behavioural factors affecting checkout completion rates in order to shed some light on why people drop out. For example, within this analysis I was able to say fairly basic things such as:

  • Visitors who spend a lot of time on the site before their purchase are less likely to drop out of the checkout process than those who’s session is shorter
  • Visitors who land on the homepage are more likely to drop out of the process than those who land on a product page

Now, the client immediately got rather excited about this and began to say, regarding the first point, “Wow, so if we can increase the time on site then we can improve our drop-out rates. Excellent, how do we increase time on site?”. Had I allowed it, this person would no doubt have been rushing back to the marketing team with a new objective to get the time on site up!

So what’s wrong with this? Well, apart from the very numerous issues with dwell-time associated with this specific example, this represents a very common misunderstanding in web analysis. Put very simply:

Your customer didn’t complete their purchase because they were on your site for a long time. They were probably on your site for a long time because they are interested in your products and your site is relevant to them which, in turn, means they are more likely to complete their purchase. Increasing dwell-time per se doesn’t make any sense in this example because it isn’t the cause.

To provide a simpler example of this: you might notice that people who dress smartly often have quite tidy hair as well. Does this mean that dressing smartly causes tidy hair? If I put a suit on, will my hair instantly become much tidier because I’m wearing a suit? No, there is some other factor (the person’s need to look smart) that is driving both of these things.

This leads to all kinds of problems in web analysis, some of which are quite subtle. Furthermore, this problem is inextricably bound up with our obsession with click-stream; if we can’t see beyond the web analysis tool then we have to find our causes within it. The biggest danger is that we stop being able to see our visitors as real people with real needs, and instead just view them as lines of data or collections of behaviours.

A couple more examples of how this can cause problems:

  • You notice that direct traffic is of a higher quality than other sources. Does this mean that you should simply get more people to come to you direct? You could do this by displaying your URL as a static image in non-clickable banners, meaning that people have to physically type it into the browser. Again, no. Real direct traffic is direct because of brand familiarity and relevance, which may have nothing to do with advertising. The pure fact that it’s direct is of little relevance. (by the way, be careful – direct traffic isn’t always what it seems)
  • You notice a correlation between downloads of your latest white paper and calls to your salesteam. Excellent, the white paper is a succesful acquisition tool and is driving leads! Or is it? Which way round is it really? Are people calling you because they downloaded the white paper, or did they look at your site and dowload the white paper because they called you?

Remember, correlation does not imply causation! You can avoid this by remembering that your customers are real people with needs, desires, habits and lifestyles. They are not lines of data with dwell-times, page counts and completion rates. These things are only behavioural indications of something else more complex that is happening. Look beyond the click-stream and understand how your customers think and feel.

  • LinkedIn
  • Digg
  • Twitter
  • Delicious
  • Technorati Favorites
  • Google Bookmarks
  • StumbleUpon
  • Facebook
  • MySpace
  • Reddit
  • Bebo
  • FriendFeed
  • Slashdot
  • Share/Bookmark
PostCategoryIcon Posted in Analysis | PostTagIcon Tags: Analysis, analytics, behaviour, causation, cause, correlation, direct traffic, dwell-time, Measurement, Metrics
« How to build a digital measurement framework
Essential guide to data accuracy in web analytics »

3 Responses to “Errors of causation in web analytics”

  • Christian says:
    July 31, 2009 at 2:12 pm

    Excellent point! However, your post then begs the question: How do you, with the aviailable data, identify the causes of conversion? Is it at all possible to do with web analytics? If not, what additional data would you want in order to determine the causes? Or, is the problem much more fundamental, namely that you are pointing to a limitation of statistics in general (hence your reference to the article from wikipedia?

  • Jonny says:
    July 31, 2009 at 4:54 pm

    Hi Christian,
    Good question. I think the answer is two-fold:

    1 – If we are ever going to get close to why people behave the way they do then we need to utilise qualitative data as well as click-stream, and sometimes even instead of click-stream. The only sure-fire way to get the bottom of some of these things is to ask people.
    2 – In some cases the reasons for behaviour are simply too complex to understand, or at least to draw out using data. Take buying a car; we can spend a lot of time and money trying to measure the exact detail of multi-channel buying behaviour, but at the end of the day this process may be completely unique for every single person. Sometimes we have to take what we have and make the most of it – assumptions guided by insight!

    Thanks for commenting

  • Justyn says:
    January 7, 2010 at 5:39 am

    You’re exactly right! I have a client who is using a static web page for email campaigns. Since a LARGE group of their clients use Outlook, the browser isn’t passing any information about a referral to us. So the customer is getting the email, clicking an ad, and our analytics show a “direct” referral. You can imagine their relief when I explained the issue and their email campaigns were actually performing above standard.

    My solution is to have them put tagged links throughout their email campaign so it doesn’t matter who referred them…I tell the analytics EXACTLY from where I got the click.

    Here’s the tool I use to build those tags out: http://www.google.com/support/googleanalytics/bin/answer.py?answer=55578&hl=en

    Thanks for making people challenge assumptions…

Leave a Reply

Click here to cancel reply.

Subscribe
Subscribe
Recent Posts
  • Web Analytics Vendor Review – Sophus3
  • Web Analytics – Art or Science?
  • Essential guide to an effective SEO backlink strategy
  • Web Analytics as the Enabler of Performance Marketing
  • Why goals are so important and how to create them
Recent Comments
  • Michelle Tee on Essential guide to an effective SEO backlink strategy
  • Buddy Scalera on Google Analytics vs. Omniture Site Catalyst
  • Justyn on Web Analytics – Art or Science?
  • Justyn on Analytics Direct Traffic is NOT What You Think It Is
  • Justyn on Errors of causation in web analytics
Categories
  • Analysis
  • Measurement
  • Metrics
  • Reviews
  • SEO
  • Strategy
  • Tools
Tags
accuracy Analysis analytics anchor text backlink behaviour benchmark causation cause cookies correlation customer journey data direct traffic dwell-time engagement goals google analytics KPI link building marketing Measurement measurement framework Metrics omniture optimisation performance marketing quality referrer reporting review ROI search sem SEO sophus3 Strategy targets Tools web analysis
Tweets
  • RT @immeria: Good read: "Defining Analytics: Data, Information and Knowledge" http://ow.ly/1ohflv #measure 8 hours ago
  • RT @RRS_ATL: Understanding Statistical Significance with Omniture Test & Target http://tinyurl.com/yaaqxdd #measure 9 hours ago
  • #iphone users - DON'T INSTALL software update 3.1.3 - it will destroy your phone!! Thanks very much Apple! 2010/02/07
  • RT @Web_Stats: Leveraging Standard Deviation data with Google Analytics - http://www.webfuel.ca/2poExcellent Blog Post #measure #ga 2010/02/05
  • RT @Al3x4nd3rR: @jonnylongden This was dismissed by @omniturecare a couple of days ago.. 2010/02/05
Blogroll
  • Analytics Talk
  • Anil Batra
  • Dave Chaffey
  • Google Analytics Blog
  • Immeria
  • Juice Analytics
  • Kaizen Analytics
  • Lies, Damned Lies by Ian Thomas
  • Measuring Success by Brian Clifton
  • Multichannel Marketing Metrics by Akin Arikan
  • Occam’s Razor by Avinash Kaushik
  • Omniture Industry Insights
  • Web Analysts Info
  • Web Analytics and Optimization Blog
  • Web Analytics Demystified
  • Web Analytics Management
  • Webtrends Blog
  • Words Go Further
Recently on Last.fm

    Disclaimer: The views and opinions expressed in this blog are mine alone and do not necessarily reflect the opinions of my employer or my clients