Guttman scale
In the analysis of multivariate observations designed to assess subjects with respect to an attribute, a Guttman Scale is a single ordinal scale for the assessment of the attribute, from which the original observations may be reproduced. The discovery of a Guttman Scale in data depends on their multivariate distribution's conforming to a particular structure. Hence, a Guttman Scale is a hypothesis about the structure of the data, formulated with respect to a specified attribute and a specified population and cannot be constructed for any given set of observations. Contrary to a widespread belief, a Guttman Scale is not limited to dichotomous variables and does not necessarily determine an order among the variables. But if variables are all dichotomous, the variables are indeed ordered by their sensitivity in recording the assessed attribute, as illustrated by Example 1.
Deterministic model
Example 1: Dichotomous variablesA Guttman scale may be hypothesized for the following five questions that concern the attribute "acceptance of social contact with immigrants", presented to a suitable population:
- Would you accept immigrants as residents in your country?
- Would you accept immigrants as residents in your town?
- Would you accept immigrants as residents in your neighborhood?
- Would you accept immigrants as next-door neighbors?
- Would you accept an immigrant as your child's spouse?
Table 1. Hypothesized Responses to the Five Social Distance Variables Form a Guttman Scale
Every row in the shaded part of Table 1 is the response profile of any number of respondents. Every profile in this table indicates acceptance of immigrants in all senses indicated by the previous profile, plus an additional sense in which immigrants are accepted. If, in a large number of observations, only the profiles listed in Table 1 are observed, then the Guttman Scale Hypothesis is supported, and the values of the Scale have the following properties:
- They assess the strength of the attribute "acceptance of social contact with immigrants";
- They reproduce the original observations.
While other scaling techniques produce a single scale by summing up respondents' scores—a procedure that assumes, often without justification, that all observed variables have equal weights — Guttman scale avoids weighting the observed variables; thus 'respecting' data for what they are. If a Guttman Scale is confirmed, the measurement of the attribute is intrinsically one-dimensional; the unidimensionality is not forced by summation or averaging. This feature renders it appropriate for the construction of replicable scientific theories and meaningful measurements, as explicated in Facet Theory.
Ordinal Variables
Given a data set of N subjects observed with respect to n ordinal variables, each having any finite number of numerical categories ordered by increasing strength of a pre-specified attribute, let aij be the score obtained by subject i on variable j, and define the list of scores that subject i obtained on the n variables, ai=ai1…ain, to be the profile of subject i..Define:
Two profiles, as and at are equal, denoted as=at, iff asj=atj for all j=1…n
Profile as is greater than Profile at, denoted as>at, iff asj ≥ atj for all j=1…n and asj'
Profiles as and at are comparable, denoted asSat, iff as=at; or as>at; or at>as
Profiles as and at are incomparable, denoted as$at, iff they are not comparable with respect to a given attribute, Guttman Scale is defined simply thus:
Definition: Guttman Scale is a data set in which all profile-pairs are comparable.
Example: Non-dichotomous variables
Consider the following four variables that assess arithmetic skills among a population P of pupils:V1: Can pupil perform addition of numbers? No=1; Yes, but only of two-digit numbers=2; Yes=3.
V2: Does pupil know the multiplication table? No=1; Yes=2.
V3: Can pupil perform multiplication of numbers? No=1; Yes, but only of two-digit numbers=2; Yes=3.
V4: Can pupil perform long division? No=1; Yes=2.
Data collected for the above four variables among a population of school children may be hypothesized to exhibit the Guttman Scale shown below in Table 2:
Table 2. Data of the Four Ordinal Arithmetic Skill Variables are Hypothesized to Form a Guttman Scale
V1 | V2 | V3 | V4 | Possible Scale Score |
1 | 1 | 1 | 1 | 4 |
2 | 1 | 1 | 1 | 5 |
2 | 2 | 1 | 1 | 6 |
3 | 2 | 1 | 1 | 7 |
3 | 2 | 2 | 1 | 8 |
3 | 2 | 3 | 1 | 9 |
3 | 2 | 3 | 2 | 10 |
The set profiles hypothesized to occur illustrates the defining feature of the Guttman Scale, namely, that any pair of profiles are comparable. Here too, if the hypothesis is confirmed, a single scale-score reproduces a subject's responses in all the variables observed.
Any ordered set of numbers could serve as scale. In this illustration we chose the sum of profile-scores. According to Facet Theory, only in data that conform to a Guttman scale such a summation may be justified.
Reproducibility
In practice, perfect Guttman Scales are rare, but approximate ones, have been found in specific populations with respect to attributes such as religious practices, narrowly defined domains of knowledge, specific skills, and ownership of household appliances. When data do not conform to a Guttman Scale, they may either represent a Guttman Scale with noise, or they may have a more complex structure requiring multiple scaling for identifying the scales intrinsic to them.The extent to which a data set conforms to a Guttman Scale can be estimated from the coefficient of reproducibility of which there are a few versions, depending on statistical assumptions and limitations. Guttman's original definition of the reproducibility coefficient, CR is simply 1 minus the ratio of the number of errors to the number of entries in the data set. And, to ensure that there is a range of responses the coefficient of scalability is used.
In Guttman scaling is found the beginnings of item response theory which, in contrast to classical test theory, acknowledges that items in questionnaires do not all have the same level of difficulty. Non-deterministic models have been developed such as the Mokken scale and the Rasch model. Guttman Scale has been generalized to the theory and procedures of Multiple Scaling which identifies the minimum number of scales needed for satisfactory reproducibility.
As a procedure that ties substantive contents with logical aspects of data, Guttman Scale heralded the advent of Facet Theory developed by Louis Guttman and his associates.
Guttman Scale in Qualitative Variables
Guttman's original definition of a Scale allows also for the exploratory scaling analysis of qualitative variables. This definition of Guttman Scale relies on the prior definition of a simple function.For a totally ordered set X, say, 1,2,…,m, and another finite set, Y, with k elements k ≤ m, a function from X to Y is a simple function if X can be partitioned into k intervals which are in a one-to-one correspondence with the values of Y.
A Guttman Scale may then be defined for a data set of n variables, with the jth variable having kj categories, Thus:
Definition: Guttman Scale is a data set for which there exists an ordinal variable, X, with a finite number m of categories, say, 1,…,m with m≥ maxj and a permutation of subjects' profiles such that each variable in the data set is a simple function of X.
Despite its seeming elegance and appeal for exploratory research, this definition has not been sufficiently studied or applied
Further Reading
1. Coombs, C. H., Coombs, L. C., & Lingoes, J. C.. Stochastic cumulative scales. In S. Shye, Theory construction and data analysis in the behavioral sciences. San Francisco: Jossey-Bass.2. Goodman, L. A.. A new model for scaling response patterns: An application of the quasi-independence concept. Journal of the American Statistical Association, 70, 755–768.
3. Guttman, L.. A basis for scaling qualitative data. American Sociological Review, 9, 139–150.
4. Green, B.F.. A method of scalogram analysis using summary statistics. Psychometrika, 21, 79-88.