My Photo

Clicky

  • Clicky Web Analytics

Your email address:


Powered by FeedBlitz

« Omniture Implementation Toolkit Release w. HBX Transition Guide + X Change Scholarship | Main | X Change Xperts »

TrackBack

TrackBack URL for this entry:
http://www.typepad.com/services/trackback/6a00d83454a6d169e200e5536875a28833

Listed below are links to weblogs that reference Web Analytics - Behavioral Segmentation:

Comments

Hi Gary,
Thanks very much for this series of posts, which I find very, very interesting!

I have a couple of questions. First of all, I am a bit confused about the two approaches to segmentation that you mention. You say there is a difference between (1) starting with behavioral segmentation and (2) starting with survey segmentation. You also say that whereas the former approach works, the latter doesn’t.

Why is that the case? To my mind, at the end of the day you only have ONE data set containing the relations between behavior, demographics and psychographics – namely the survey data enhanced with the respondents’ clickstreams. Surely it must be this data set you use to construct your rich segments, right? And if so, does it matter where you start?

My other question: What exactly is the purpose of your segmentation? Do you want to use your segmentation for real-time targeting? (For example, if behavior X-Y-Z is typically associated with Segment A, then each time behavior X-Y-Z appears, we will show some unique content which we expect Segment A wants). Alternatively, do you want to use the segmentation for evaluating a campaign or to see which content on your site works best for whom?

I think, in the latter case, it isn’t always necessary to predict the demographics/psychographics of unknown visitors. Prediction is only relevant if you want to plan future actions. If, on the other hand, you want to evaluate a campaign or some content, you can always simply run an online survey at the same time, and then integrate the data with click streams. In this way you don’t have to predict; rather you can see directly how many people from this or that segment came from this or that source and saw this or that content. Of course, you may want to take into account that the survey data are not necessarily representative of all visitors, but this can be done by weighting the data.

Perhaps I should mention that I work for a web analytics vendor which offers a survey module that allows customers to build, launch and automatically integrate online surveys with behavioral data. My colleagues and I have carried out many consultancy projects where we have first launched a survey and then used data mining to segment and analyze the relationships between responses and behavior (see an example here:
http://www.netminers.dk/cms.ashx/!lang=en/analysis/webmapping.html ).

Thansk again for a great post!

Christian,

I've been hearing about your tool - you'll have to give me a demo sometime! Great comment.

You're comment about the difference in starting points would require a very long answer to deal with and I am going to be talking more about that. However, the short answer is that in most cases you are driving the core segmentation with either the survey data or the behavioral data - not both in a consolidated data set. And you are then "coloring" the segments with the second data. It ends up making a pretty substantial difference which is primary and which is secondary and my experience has been that it ends up being easier to color behavioral segments with demographic and psychographic data than to color traditional segments with behavioral data. I'm not sure I have a complete explanation for why that might be so (though I do have some thoughts).

Why wouldn't you just use both data sets in the initial segmentation creation? You can (and you would if you were doing customer segmentation and routinely had demographic and customer data), but doing so adds risks in terms of applying the segments to all visitors. This combined data set will actually produce the BEST and richest segmentation - but it comes with trade-offs in terms of your ability to extrapolate your segments to all online visitors.

I think a full-on behavioral segmentation (like a traditional segmentation) has myriad uses. I like to integrate it into management reporting, use it for targeting, and for ongoing analysis (and not just of campaigns). Cutting almost any true deep-dive analysis by rich behavioral segments will add significant analytic value. And management reporting that includes the segmentation is often much more interesting and comprehensible.

I do think that survey's are grossly underused for targeted analysis. And I find the lack of flexibility around implementing lots of one-off and targeted surveys a problem with many of our clients. There aren't many analytic deep-dives where I wouldn't love to be able to target a survey and add in highly customized survey data.

Hi Gary,

Many thanks for your reply – and interesting that you’ve been hearing about our tool considering our remote location in the northern periphery of Europe! I suppose the Internet has finally turned all of us into McLuhan’s Global Village. Yes, we definitely have to arrange a demo soon…

Anyway, I also wanted to say that I now understand what you mean by “coloring” the segments and using behavior as primary variables for segmentation.

I’m not entirely convinced, though, that this is the best approach. I fear it will collapse the differences in behavior between the psychographic/demographic segments from the survey. This could happen if, for example, there is a big difference between those who participate in the survey and those who do not. This would enable you to see immediately which behavioral segments would be less likely to respond to your survey, but I also fear that too many of the psychographic/demographic segments could end up in the same behavioral segment, if you understand what I mean.

I totally agree that the other approach is problematic when it comes to extrapolating the survey segments to the rest of the visitors... I suppose it is a give-and-take situation.

In any case I look very much forward to reading your upcoming posts on the issues. Perhaps I will be converted then… :-)

Christian

Interesting article. I agree that “segmentation” in the web analytics world is a highly over-used term. To summarise the problems of determining a “proper” segmentation :-

• The majority of web analytic tools do not provide the data
• Even if they did, then :-
o You’ll (probably) need some sort of ETL process to get the data into a form amenable to the analysis tools.
o Computing segments is expensive and you may need to sample.
o A specialist analysis tool can then be used to define the segments.

We’ve considered building an analysis tool into our product (we’re a vendor in the web analytics space) but to-date have decided that it would not be a good use of our resources. The reasons for this are :-
• The process of defining the business rules to create the segments is a one-off process. It is not something that will be undertaken on a daily or even weekly basis.
• We provide the data (down to the click-level) in a standard relational database. Further, we can apply flexible business rules to the low-grained data (e.g. to determine the “session style”) and this information can be de-normalised up to the visitor level. This reduces the amount of subsequent manipulation that is needed. It also reduces the amount of data that needs to be handled by an order of magnitude
• There are companies who specialise in all the weird and wonderful segmentation algorithms.

So we’ve made sure that it’s (relatively) easy to apply a segmentation and left the segmentation definition to the specialists. It may be that in the future we review this (e.g. if we determine a set of segmentation algorithms that behave extraordinarily well with the shape of web analytics data).

As web analytics becomes more and more mainstream I think this approach is the right one. Web analytics is really a form of business intelligence applied to the on-line channel. It makes sense to leverage as much prior art as possible.

I’ll also mention in passing that we allow users to define reports in SQL (full SQL level access). These SQL reports can be saved and scheduled in exactly the same manner as other reports. We also provide a drag-and-drop report wizard.

Guy
Site Intelligence Ltd (www.site-intelligence.co.uk)

Verify your Comment

Previewing your Comment

This is only a preview. Your comment has not yet been posted.

Working...
Your comment could not be posted. Error type:
Your comment has been saved. Comments are moderated and will not appear until approved by the author. Post another comment

The letters and numbers you entered did not match the image. Please try again.

As a final step before posting your comment, enter the letters and numbers you see in the image below. This prevents automated programs from posting comments.

Having trouble reading this image? View an alternate.

Working...

Post a comment

Comments are moderated, and will not appear until the author has approved them.