![]() |
||||||||||
|
||||||||||||||||||||||||||||||||||||
|
|
To take a very simple example, users of a technology product answered questions about where the product is located (home/work or elsewhere), their type of use (Business use only, Personal use only and both) and their volume of use (captured in a diary). The clusters are defined using these variables, which are known as indicators. In practice clusters can be defined on many more indicators than this. The model works by identifying segments which can be used to predict (i.e. reform) the underlying data. A number of other variables are used to describe the clusters: Model of Equipment, Region, Age, Job title and number of employees. Latent Class Analysis allows these types of variables (known in the Latent Class terminology as Covariates) to be used to more accurately predict cluster membership if the relationship with the clusters is statistically significant.
Including descriptive, as well as indicator variables, as we have done in this case can produce better clusters, which make better use of the available data and are easier to explain. Statistical tests show which indicators and covariates make a significant contribution to the cluster model. Rather than allocating each case to one cluster, the LC Clustering approach assesses the probability that every case (user) belongs to every cluster. For a model which converges well, these probabilities are usually close to 100% for the cluster a particular user is most associated with and 0% for the other clusters. This gives much more accurate cluster averages when analysing subgroups than with other methods. These probabilities can be turned into bi-plots such as the one below showing cluster membership for the subgroups of interest.
The main benefits of Latent Class Models are their flexibility, in terms of:
The very latest implementation of Latent Class Segmentation models (which only became available in 2005) allows continuous or ordinal factors to be formed along with the clusters. It also allows multilevel modelling. Continuous Factors This is a very powerful extension of traditional segmentation, made possible by recent advances in computer processing performance. It addresses the common concern about traditional segmentation; namely that they tend to focus on the obvious relationships in data rather than reveal more subtle patterns in data. Using this powerful extension to Latent Class Analysis means that these subtler patterns are more likely to be revealed. Using continuous factors as “intercepts” in ratings based cluster models, also enables scale rater bias to be factored out of the segmentation solution. This eliminates the age old problem of segments defined more on people’s tendency to generally rate high or low on a rating scale. Taking this out of the equation means that segments are free to focus on the relative ups and downs of ratings across different factors, rather than on the obvious differences in rating scale use overall. Ordinal Factors
In this sense, identifying ordinal factor such as this in data can be thought of as creating a composite ordinal measure. The segments in this case are different levels of this measure (low through to high). In the same study, we might find that a separate, independent ordinal factor exists which classifies organisations/consumers into:
Combining the possible levels of these factors might create 6 segments which not only meet the needs of the Business development manager commissioning the research, but also provides the best fit to the data. Multilevel Latent Class Models Taking the countries example, multilevel modelling allows clusters to be defined on individual responses, but with a higher level of additional clustering of the countries, according to the relative size of the individual-level clusters within those counties. For example, we might find there are 8 consumer segments and 3 groups of countries, consisting of Northern Europe (group 1), Southern Europe (Group 2) with Greece separate (3). The Northern Europe group might consist mainly of three consumer segments, with only a small proportion in the remaining segments. Southern Europe might mainly consist of 3 different consumer segments, while Greece might have two other dominant segments which are unique to Greece. Nested segmentations of this sort can be much more informative to marketers than a simple segmentation across the whole sample. Please contact Gary Bennett for further information (garyb@logitresearch.com).
|
| © Logit Research 2004 | Site by Ocean | |||