Income inequality metrics
Income inequality metrics or income distribution metrics are used by social scientists to measure the distribution of income and economic inequality among the participants in a particular economy, such as that of a specific country or of the world in general. While different theories may try to explain how income inequality comes about, income inequality metrics simply provide a system of measurement used to determine the dispersion of incomes. The concept of inequality is distinct from poverty and fairness.
Income distribution has always been a central concern of economic theory and economic policy. Classical economists such as Adam Smith, Thomas Malthus and David Ricardo were mainly concerned with factor income distribution, that is, the distribution of income between the main factors of production, land, labour and capital. It is often related to wealth distribution, although separate factors influence wealth inequality.
Modern economists have also addressed this issue, but have been more concerned with the distribution of income across individuals and households. Important theoretical and policy concerns include the relationship between income inequality and economic growth. The article economic inequality discusses the social and policy aspects of income distribution questions.
Defining income
All of the metrics described below are applicable to evaluating the distributional inequality of various kinds of resources. Here the focus is on income as a resource. As there are various forms of "income", the investigated kind of income has to be clearly described.One form of income is the total amount of goods and services that a person receives, and thus there is not necessarily money or cash involved. If a subsistence farmer in Uganda grows his own grain, it will count as income. Services like public health and education are also counted in. Often expenditure or consumption is used to measure income. The World Bank uses the so-called "living standard measurement surveys" to measure income. These consist of questionnaires with more than 200 questions. Surveys have been completed in most developing countries.
Applied to the analysis of income inequality within countries, "income" often stands for the taxed income per individual or per household. Here, income inequality measures also can be used to compare the income distributions before and after taxation in order to measure the effects of progressive tax rates.
Properties of inequality metrics
In the discrete case, an economic inequality index may be represented by a function I, where x is a set of n economic values x= with xi being the economic value associated with "economic agent" i''.In the economic literature on inequality four properties are generally postulated that any measure of inequality should satisfy:
- Anonymity or symmetry
- Scale independence or homogeneity
- Population independence
- Transfer principle
- Non-negativity
- Egalitarian zero
- Bounded above by maximum inequality
- Subgroup decomposability
Common income inequality metrics
An additional property of an inequality metric that may be desirable from an empirical point of view is that of 'decomposability'. This means that if a particular economy is broken down into sub-regions, and an inequality metric is computed for each sub region separately, then the measure of inequality for the economy as a whole should be a weighted average of the regional inequalities plus a term proportional to the inequality in the averages of the regions.. Of the above indexes, only the Theil index has this property.
Because these income inequality metrics are summary statistics that seek to aggregate an entire distribution of incomes into a single index, the information on the measured inequality is reduced. This information reduction of course is the goal of computing inequality measures, as it reduces complexity.
A weaker reduction of complexity is achieved if income distributions are described by shares of total income. Rather than to indicate a single measure, the society under investigation is split into segments, such as into quintiles. Usually each segment contains the same share of income earners. In case of an unequal income distribution, the shares of income available in each segment are different.
In many cases the inequality indices mentioned above are computed from such segment data without evaluating the inequalities within the segments. The higher the number of segments, the closer the measured inequality of distribution gets to the real inequality.
Quintile measures of inequality satisfy the transfer principle only in its weak form because any changes in income distribution outside the relevant quintiles are not picked up by this measures; only the distribution of income between the very rich and the very poor matters while inequality in the middle plays no role.
Details of the three inequality measures are described in the respective Wikipedia articles. The following subsections cover them only briefly.
Gini index
The range of the Gini index is between 0 and 1, where 0 indicates perfect equality and 1 indicates maximum inequality.The Gini index is the most frequently used inequality index. The reason for its popularity is that it is easy to understand how to compute the Gini index as a ratio of two areas in Lorenz curve diagrams. As a disadvantage, the Gini index only maps a number to the properties of a diagram, but the diagram itself is not based on any model of a distribution process. The "meaning" of the Gini index only can be understood empirically. Additionally the Gini does not capture where in the distribution the inequality occurs. As a result, two very different distributions of income can have the same Gini index.
20:20 Ratio
The 20:20 or 20/20 ratio compares how much richer the top 20% of populations are to the bottom 20% of a given population. This can be more revealing of the actual impact of inequality in a population, as it reduces the effect on the statistics of outliers at the top and bottom and prevents the middle 60% from statistically obscuring inequality that is otherwise obvious in the field. The measure is used for the United Nations Development Programme Human Development Indicators. The 20:20 ratio for example shows that Japan and Sweden have a low equality gap, where the richest 20% only earn 4 times the poorest 20%, whereas in the UK the ratio is 7 times and in the US 8 times. Some believe the 20:20 ratio is a more useful measure as it correlates well with measures of human development and social stability including the index of child well-being, index of health and social problems, population in prison, physical health, mental health and many others.Palma ratio
The Palma ratio is defined as the ratio of the richest 10% of the population's share of gross national income divided by the poorest 40%'s share. It is based on the work of Chilean economist Gabriel Palma who found that middle class incomes almost always represent about half of gross national income while the other half is split between the richest 10% and poorest 40%, but the share of those two groups varies considerably across countries.The Palma ratio addresses the Gini index's over-sensitivity to changes in the middle of the distribution and insensitivity to changes at the top and bottom, and therefore more accurately reflects income inequality's economic impacts on society as a whole. Palma has suggested that distributional politics pertains mainly to the struggle between the rich and poor, and who the middle classes side with.
Hoover index
The Hoover index is the simplest of all inequality measures to calculate: It is the proportion of all income which would have to be redistributed to achieve a state of perfect equality.In a perfectly equal world, no resources would need to be redistributed to achieve equal distribution: a Hoover index of 0. In a world in which all income was received by just one family, almost 100% of that income would need to be redistributed in order to achieve equality. The Hoover index then ranges between 0 and 1, where 0 indicates perfect equality and 1 indicates maximum inequality.
Galt score
The Galt score is a simple ratio of a company’s CEO pay to the pay of that company's Median worker. A company which pays its CEO many times more than its median employee will have a high Galt score.It is named for the fictional character John Galt in Ayn Rand's novel Atlas Shrugged.
The score is calculated using the total compensation of the CEO, including salary, bonuses, the value of stock awards and employee stock options, as well as non-equity incentive plan compensation, and nonqualified deferred compensation.
Coefficient of variation
The coefficient of variation is the square root of the variance of the incomes divided by the mean income. It has the advantages of being mathematically tractable and its square is subgroup decomposable, but it is not bounded from above.Wage share
is the ratio between Compensation of employees and GDP. In other words, it is the total of employees' income divided by the national income.Theil index
A Theil index of 0 indicates perfect equality. A Theil index of 1 indicates that the distributional entropy of the system under investigation is almost similar to a system with an 82:18 distribution. This is slightly more unequal than the inequality in a system to which the "80:20 Pareto principle" applies. The Theil index can be transformed into an Atkinson index, which has a range between 0 and 1, where 0 indicates perfect equality and 1 indicates maximum inequality.The Theil index is an entropy measure. As for any resource distribution and with reference to information theory, "maximum entropy" occurs once income earners cannot be distinguished by their resources, i.e. when there is perfect equality. In real societies people can be distinguished by their different resources, with the resources being incomes. The more "distinguishable" they are, the lower is the "actual entropy" of a system consisting of income and income earners. Also based on information theory, the gap between these two entropies can be called "redundancy". It behaves like a negative entropy.
For the Theil index also the term "Theil entropy" had been used. This caused confusion. As an example, Amartya Sen commented on the Theil index, "given the association of doom with entropy in the context of thermodynamics, it may take a little time to get used to entropy as a good thing." It is important to understand that an increasing Theil index does not indicate an increasing entropy, instead it indicates an increasing redundancy.
High inequality yields high Theil redundancies. High redundancy means low entropy. But this does not necessarily imply that a very high inequality is "good", because very low entropies also can lead to explosive compensation processes. Neither does using the Theil index necessarily imply that a very low inequality is "good", because high entropy is associated with slow, weak and inefficient resource allocation processes.
There are three variants of the Theil index. When applied to income distributions, the first Theil index relates to systems within which incomes are stochastically distributed to income earners, whereas the second Theil index relates to systems within which income earners are stochastically distributed to incomes.
A third "symmetrized" Theil index is the arithmetic average of the two previous indices. The formula of the third Theil index has some similarity with the Hoover index. As in case of the Hoover index, the symmetrized Theil index does not change when swapping the incomes with the income earners. How to generate that third Theil index by means of a spreadsheet computation directly from distribution data is shown below.
An important property of the Theil index which makes its application popular is its decomposability into the between-group and within-group component. For example, the Theil index of overall income inequality can be decomposed in the between-region and within region components of inequality, while the relative share attributable to the between-region component suggests the relative importance of spatial dimension of income inequality.
Comparison of the Theil index and the Hoover index
The Theil index indicates the distributional redundancy of a system, within which incomes are assigned to income earners in a stochastic process. In comparison, the Hoover index indicates the minimum size of the income share of a society, which would have to be redistributed in order to reach maximum entropy. Not to exceed that minimum size would require a perfectly planned redistribution. Therefore, the Hoover index is the "non-stochastic" counterpart to the "stochastic" Theil index.Applying the Theil index to allocation processes in the real world does not imply that these processes are stochastic: the Theil yields the distance between an ordered resource distribution in an observed system to the final stage of stochastic resource distribution in a closed system. Similarly, applying the Hoover index does not imply that allocation processes occur in a perfectly planned economy: the Hoover index yields the distance between the resource distribution in an observed system to the final stage of a planned "equalization" of resource distribution. For both indices, such an equalization only serves as a reference, not as a goal.
For a given distribution the Theil index can be larger than the Hoover index or smaller than the Hoover index:
- For high inequalities the Theil index is larger than the Hoover index.
- For low inequalities the Theil index is smaller than the Hoover index.
In order to increase the redundancy in the distribution category of a society as a closed system, entropy needs to be exported from the subsystem operating in that economic category to other subsystems with other entropy categories in the society. For example, social entropy may increase. However, in the real world, societies are open systems, but the openness is restricted by the entropy exchange capabilities of the interfaces between the society and the environment of that society. For societies with a resource distribution which entropywise is similar to the resource distribution of a reference society with a 73:27 split, the point where the Hoover index and the Theil index are equal, is at a value of around 46% for the Hoover index and the Theil index.
Ratios
Another common class of metrics is to take the ratio of the income of two different groups, generally "higher over lower". This compares two parts of the income distribution, rather than the distribution as a whole; equality between these parts corresponds to 1:1, while the more unequal the parts, the greater the ratio. These statistics are easy to interpret and communicate, because they are relative, but, since they do not fall on an absolute scale, do not provide an absolute measure of inequality.Ratio of percentiles
Particularly common to compare a given percentile to the median, as in the chart at right; compare seven-number summary, which summarizes a distribution by certain percentiles. While such ratios do not represent the overall level of inequality in the population as a whole, they provide measures of the shape of income distribution. For example, the attached graph shows that in the period 1967–2003, US income ratio between median and 10th and 20th percentile did not change significantly, while the ratio between the median and 80th, 90th, and 95th percentile increased. This reflects that the increase in the Gini coefficient of the US in this time period is due to gains by upper income earners, rather than by losses by lower income earners.Share of income
A related class of ratios is "income share" – what percentage of national income a subpopulation accounts for. Taking the ratio of income share to subpopulation size corresponds to a ratio of mean subpopulation income relative to mean income. Because income distribution is generally positively skewed, mean is higher than median, so ratios to mean are lower than ratios to median. This is particularly used to measure that fraction of income accruing to top earners – top 10%, 1%,.1%,.01%, and also "top 100" earners or the like; in the US top 400 earners is.0002% of earners – to study concentration of income – wealth condensation, or rather income condensation. For example, in the chart at right, US income share of top earners was approximately constant from the mid-1950s to the mid-1980s, then increased from the mid-1980s through 2000s; this increased inequality was reflected in the Gini coefficient.For example, in 2007 the top decile of US earners accounted for 49.7% of total wages, and the top 0.01% of US earners accounted for 6% of total wages.
Spreadsheet computations
The Gini coefficient, the Hoover index and the Theil index as well as the related welfare functions can be computed together in a spreadsheet. The welfare functions serve as alternatives to the median income.Group | Members per Group | Income per Group | Income per Individual | Relative Deviation | Accumulated Income | Gini | Hoover | Theil |
1 | A1 | E1 | Ē1 = E1/A1 | D1 = E1/ΣE - A1/ΣA | K1 = E1 | G1 = * A1 | H1 = abs | T1 = ln * D1 |
2 | A2 | E2 | Ē2 = E2/A2 | D2 = E2/ΣE - A2/ΣA | K2 = E2 + K1 | G2 = * A2 | H2 = abs | T2 = ln * D2 |
3 | A3 | E3 | Ē3 = E3/A3 | D3 = E3/ΣE - A3/ΣA | K3 = E3 + K2 | G3 = * A3 | H3 = abs | T3 = ln * D3 |
4 | A4 | E4 | Ē4 = E4/A4 | D4 = E4/ΣE - A4/ΣA | K4 = E4 + K3 | G4 = * A4 | H4 = abs | T4 = ln * D4 |
Totals | ΣA | ΣE | Ē = ΣE/ΣA | ΣG | ΣH | ΣT | ||
Inequality Measures | Gini = 1 - ΣG/ΣA/ΣE | Hoover = ΣH / 2 | Theil = ΣT / 2 | |||||
Welfare Function | WG = Ē * | WH = Ē * | WT = Ē * |
In the table, fields with a yellow background are used for data input. From these data inequality measures as well as the related welfare functions are computed and displayed in fields with green background.
In the example given here, "Theil index" stands for the arithmetic mean of a Theil index computed for the distribution of income within a society to the individuals in that society and a Theil index computed for the distribution of the individuals in the society to the income of that society. The difference between the Theil index and the Hoover index is the weighting of the relative deviation D. For the Hoover index the relative deviation D per group is weighted with its own sign. For the Theil index the relative deviation D per group is weighted with the information size provided by the income per individual in that group.
For the computation the society usually is divided into income groups. Often there are four or five groups consisting of a similar number of individuals in each group. In other cases the groups are created based on income ranges which leads to having different numbers of individuals in the different groups. The table above shows a computation of inequality indices for four groups. For each group the number of individuals per group A and the total income in that group E is specified.
The parameter pairs A and E need to be sorted for the computation of the Gini coefficient. A and E have to be sorted so that the values in the column "Income per individual" are lined up in ascending order.
Proper use
- When using income metrics, it has to be made clear how income should be defined. Should it include capital gains, imputed house rents from home ownership, and gifts? If these income sources or alleged income sources are ignored, how might this bias the analysis? How should non-paid work be handled? Wealth or consumption may be more appropriate measures in some situations. Broader quality of life metrics might be useful.
- The comparison of inequality measures requires that the segmentation of compared groups into quintiles should be similar.
- Distinguish properly, whether the basic unit of measurement is households or individuals. The Gini value for households is always lower than for individuals because of income pooling and intra-family transfers. And households have a varying number of members. The metrics will be influenced either upward or downward depending on which unit of measurement is used.
- Consider life cycle effects. In most Western societies, an individual tends to start life with little or no income, gradually increase income till about age 50, after which incomes will decline, eventually becoming negative. This affects the conclusions which can be drawn from a measured inequality. It has been estimated that 30% of measured income inequality is due to the inequality an individual experiences as they go through the various stages of life.
- Clarify whether real or nominal income distributions should be used. What effect will inflation have on absolute measures? Do some groups feel the effect of inflation more than others?
- When drawing conclusion from inequality measurements, consider how we should allocate the benefits of government spending? How does the existence of a social security safety net influence the definition of absolute measures of poverty? Do government programs support some income groups more than others?
- Inequality metrics measure inequality. They do not measure possible causes of income inequality. Some alleged causes include: life cycle effects, inherited characteristics, willingness to take chances, the leisure/industriousness choice, inherited wealth, economic circumstances, education and training, discrimination, and market imperfections.
- Inequality metrics are anonymous. They ignore certain effects of income mobility, in which the identity of "who is rich" and "who is poor" is considered. For example, at a particular time, Alice may have $10 and Bob may have $2. At some time later, Bob may have $10 and Alice may have $2. The inequality index will be the same in both cases and rather high. However, the inequality of the average will be zero, since Alice's and Bob's average holdings are equal. The $8 which has changed hands is a measure of wealth mobility and the average inequality is generally higher than the inequality of the average.
Inequality, growth, and progress
Evidence from a broad panel of recent academic studies shows that there is a nonlinear relation between income inequality and the rate of growth and investment. Very high inequality slows growth; moderate inequality encourages growth. Studies differ on the effect of very low inequality.Robert J. Barro, Harvard University found in his study "Inequality and Growth in a Panel of Countries" that higher inequality tends to retard growth in poor countries and encourage growth in well-developed regions.
In their study for the World Institute for Development Economics Research, Giovanni Andrea Cornia and Julius Court reach slightly different conclusions. The authors therefore recommend to pursue moderation also as to the distribution of wealth and particularly to avoid the extremes. Both very high egalitarianism and very high inequality cause slow growth. Considering the inequalities in economically well developed countries, public policy should target an ‘efficient inequality range’. The authors claim that such efficiency range roughly lies between the values of the Gini coefficients of 0.25 and 0.40 has shown that in perfect markets inequality does not influence growth.
The precise shape of the inequality-growth curve obviously varies across countries depending upon their resource endowment, history, remaining levels of absolute poverty and available stock of social programs, as well as on the distribution of physical and human capital.
Literature
- A.B. Atkinson and F. Bourguignon, ed.. Handbook of Income Distribution, v. 1. Elsevier.
- _____," International Encyclopedia of the Social & Behavioral Sciences, pp. 7265–7271.
- Yoram Amiel, Frank A. Cowell: Thinking about Inequality: Personal Judgment and Income Distributions, 2000
- Philip B. Coulter: Measuring Inequality, 1989