Condorcet's jury theorem
Condorcet's jury theorem is a political science theorem about the relative probability of a given group of individuals arriving at a correct decision. The theorem was first expressed by the Marquis de Condorcet in his 1785 work Essay on the Application of Analysis to the Probability of Majority Decisions.
The assumptions of the simplest version of the theorem are that a group wishes to reach a decision by majority vote. One of the two outcomes of the vote is correct, and each voter has an independent probability p of voting for the correct decision. The theorem asks how many voters we should include in the group. The result depends on whether p is greater than or less than 1/2:
- If p is greater than 1/2, then adding more voters increases the probability that the majority decision is correct. In the limit, the probability that the majority votes correctly approaches 1 as the number of voters increases.
- On the other hand, if p is less than 1/2, then adding more voters makes things worse: the optimal jury consists of a single voter.
Proofs
Proof 1: Calculating the probability that two additional voters change the outcome
To avoid the need for a tie-breaking rule, we assume n is odd. Essentially the same argument works for even n if ties are broken by fair coin-flips.Now suppose we start with n voters, and let m of these voters vote correctly.
Consider what happens when we add two more voters. The majority vote changes in only two cases:
- m was one vote too small to get a majority of the n votes, but both new voters voted correctly.
- m was just equal to a majority of the n votes, but both new voters voted incorrectly.
Restricting our attention to this case, we can imagine that the first n-1 votes cancel out and that the deciding vote is cast by the n-th voter. In this case the probability of getting a correct majority is just p. Now suppose we send in the two extra voters. The probability that they change an incorrect majority to a correct majority is p2, while the probability that they change a correct majority to an incorrect majority is p. The first of these probabilities is greater than the second if and only if p > 1/2, proving the theorem.
Proof 2: Calculating the probability that the decision is correct
This proof is direct; it just sums up the probabilities of the majorities. Each term of the sum multiplies the number of combinations of a majority by the probability of that majority. Each majority is counted using a combination, n items taken k at a time, where n is the jury size, and k is the size of the majority. Probabilities range from 0, the vote is always wrong, to 1, always right. Each person decides independently, so the probabilities of their decisions multiply. The probability of each correct decision is p. The probability of an incorrect decision, q, is the opposite of p, i.e. 1 − p. The power notation, i.e. is a shorthand for x multiplications of p.Committee or jury accuracies can be easily estimated by using this approach in computer spreadsheets or programs.
First let us take the simplest case of n = 3, p = 0.8. We need to show that 3 people have higher than 0.8 chance of being right. Indeed:
Asymptotics
The probability of a correct majority decision P, when the individual probability p is close to 1/2 grows linearly in terms of p − 1/2. For n voters each one having probability p of deciding correctly and for odd n :where
and the asymptotic approximation in terms of n is very accurate. The expansion is only in odd powers and. In simple terms, this says that when the decision is difficult, the gain by having n voters grows proportionally to.
Non-uniform probabilities
Condorcet's theorem assumes that all voters have the same competence, i.e., the probability of deciding correctly is uniform among all voters. In practice, different voters have different competence levels.A stronger version of the theorem requires only that the average of the individual competence levels of the voters is slightly greater than half.
Correlated votes
Condorcet's theorem assumes that the votes are statistically independent. But real votes are not independent: voters are often influenced by other voters, causing a peer pressure effect.The non-asymptotic part of Condorcet's jury theorem does not hold for correlated votes in general. This is not necessarily a problem since the theorem may still hold under sufficiently general assumptions. A strong version of the theorem does not require voter independence, but takes into account the degree to which votes may be correlated.
In a jury comprising an odd number of jurors, let be the probability of a juror voting for the correct alternative and be the correlation coefficient between any two correct votes. If all higher-order correlation coefficients in the Bahadur representation of the joint probability distribution of votes equal to zero, and is an admissible pair, then:
The probability of the jury collectively reaching the correct decision under simple majority is given by:
where is the regularized incomplete beta function.
Example: Take a jury of three jurors, with individual competence and second-order correlation. Then. The competence of the jury is lower than the competence of a single juror, which equals to. Moreover, enlarging the jury by two jurors decreases the jury competence.
Note that and is an admissible pair of parameters. For and, the maximum admissible second-order correlation coefficient equals.
The above example shows that when the individual competence is low but the correlation is high
- The collective competence under simple majority may fall below that of a single juror,
- Enlarging the jury may decrease its collective competence.
Indirect majority systems
Condorcet's theorem considers a direct majority system, in which all votes are counted directly towards the final outcome. Many countries use an indirect majority system, in which the voters are divided into groups. The voters in each group decide on an outcome by an internal majority vote; then, the groups decide on the final outcome by a majority vote among them. For example, suppose there are 15 voters. In a direct majority system, a decision is accepted whenever at least 8 votes support it. Suppose now that the voters are grouped into 3 groups of size 5 each. A decision is accepted whenever at least 2 groups support it, and in each group, a decision is accepted whenever at least 3 voters support it. Therefore, a decision may be accepted even if only 6 voters support it.Boland, Proschan and Tong prove that, when the voters are independent and p>1/2, a direct majority system - as in Condorcet's theorem - always has a higher chance of accepting the correct decision than any indirect majority system.
Berg and Paroush consider multi-tier voting hierarchies, which may have several levels with different decision-making rules in each level. They study the optimal voting structure, and compares the competence against the benefit of time-saving and other expenses.
Other Limitations
Condorcet's theorem is correct given its assumptions, but its assumptions are unrealistic in practice. Besides the issue of correlated votes, some objections that are commonly raised are:1. The notion of "correctness" may not be meaningful when making policy decisions, as opposed to deciding questions of fact. Some defenders of the theorem hold that it is applicable when voting is aimed at determining which policy best promotes the public good, rather than at merely expressing individual preferences. On this reading, what the theorem says is that although each member of the electorate may only have a vague perception of which of two policies is better, majority voting has an amplifying effect. The "group competence level", as represented by the probability that the majority chooses the better alternative, increases towards 1 as the size of the electorate grows assuming that each voter is more often right than wrong.
2. The theorem doesn't directly apply to decisions between more than two outcomes. This critical limitation was in fact recognized by Condorcet, and in general it is very difficult to reconcile individual decisions between three or more outcomes, although List and Goodin present evidence to the contrary. This limitation may also be overcome by means of a sequence of votes on pairs of alternatives, as is commonly realized via the legislative amendment process.
3. The behaviour that everybody in the jury votes according to his own beliefs might not be a Nash equilibrium under certain circumstances.
Despite these objections, Condorcet's jury theorem provides a theoretical basis for democracy, even if somewhat idealized, as well as a basis of the decision of questions of fact by jury trial, and as such continues to be studied by political scientists.
The theorem in other disciplines
The Condorcet jury theorem has recently been used to conceptualize score integration when several physician readers independently evaluate images for disease activity. This task arises in central reading performed during clinical trials and has similarities to voting. According to the authors, the application of the theorem can translate individual reader scores into a final score in a fashion that is both mathematically sound, mathematically tractable for further analysis, and in a manner that is consistent with the scoring task at handThe Condorcet jury theorem is also used in ensemble learning in the field of machine learning. An ensemble method combines the predictions of many individual classifiers by majority voting. Assuming that each of the individual classifiers predict with slightly greater than 50% accuracy and their predictions are independent, then the ensemble of their predictions will be far greater than their individual predictive scores.