Bertrand's ballot theorem

In combinatorics, Bertrand's ballot problem is the question: "In an election where candidate A receives p votes and candidate B receives q votes with p > q, what is the probability that A will be strictly ahead of B throughout the count?" The answer is
The result was first published by W. A. Whitworth in 1878, but is named after Joseph Louis François Bertrand who rediscovered it in 1887.
In Bertrand's original paper, he sketches a proof based on a general formula for the number of favourable sequences using a recursion relation. He remarks that it seems probable that such a simple result could be proved by a more direct method. Such a proof was given by Désiré André, based on the observation that the unfavourable sequences can be divided into two equally probable cases, one of which is easily computed; he proves the equality by an explicit bijection. A variation of his method is popularly known as André's reflection method, although André did not use any reflections.

Example

Suppose there are 5 voters, of whom 3 vote for candidate A and 2 vote for candidate B. There are ten possibilities for the order of the votes cast:

AAABB
AABAB
ABAAB
BAAAB
AABBA
ABABA
BAABA
ABBAA
BABAA
BBAAA

For the order AABAB, the tally of the votes as the election progresses is:

Candidate	A	A	B	A	B
A	1	2	2	3	3
B	0	0	1	1	2

For each column the tally for A is always larger than the tally for B so the A is always strictly ahead of B. For the order AABBA the tally of the votes as the election progresses is:

Candidate	A	A	B	B	A
A	1	2	2	2	3
B	0	0	1	2	2

For this order, B is tied with A after the fourth vote, so A is not always strictly ahead of B.
Of the 10 possible orders, A is always ahead of B only for AAABB and AABAB. So the probability that A will always be strictly ahead is
and this is indeed equal to as the theorem predicts.

Equivalent problems

Rather than computing the probability that a random vote counting order has the desired property, one can instead compute the number of favourable counting orders, then divide by the total number of ways in which the votes could have been counted. The total number of ways is the binomial coefficient ; Bertrand's proof shows that the number of favourable orders in which to count the votes is . And indeed after division this gives.
Another equivalent problem is to calculate the number of random walks on the integers that consist of n steps of unit length, beginning at the origin and ending at the point m, that never become negative. Assuming n and m have the same parity and n ≥ m ≥ 0, this number is
When m = 0 and n is even, this gives the Catalan number.

Proof by reflection

For A to be strictly ahead of B throughout the counting of the votes, there can be no ties. Separate the counting sequences according to the first vote. Any sequence that begins with a vote for B must reach a tie at some point, because A eventually wins. For any sequence that begins with A and reaches a tie, reflect the votes up to the point of the first tie to obtain a sequence that begins with B. Hence every sequence that begins with A and reaches a tie is in one-to-one correspondence with a sequence that begins with B, and the probability that a sequence begins with B is, so the probability that A always leads the vote is

Proof by induction

Another method of proof is by mathematical induction:

We loosen the condition to. Clearly, the theorem is correct when, since in this case the first candidate will not be strictly ahead after all the votes have been counted.
Clearly the theorem is true if p > 0 and q = 0 when the probability is 1, given that the first candidate receives all the votes; it is also true when p = q > 0 as we have just seen.
Assume it is true both when p = a − 1 and q = b, and when p = a and q = b − 1, with a > b > 0. Then considering the case with p = a and q = b, the last vote counted is either for the first candidate with probability a/, or for the second with probability b/. So the probability of the first being ahead throughout the count to the penultimate vote counted is:
And so it is true for all p and q with p > q > 0.
Proof by permutation

A simple proof is based on the beautiful Cycle Lemma of Dvoretzky and Motzkin.
Call a ballot sequence dominating if A is strictly ahead of B throughout the counting of the votes. The Cycle Lemma asserts that any sequence of A's and B's, where, has precisely dominating cyclic permutations. To see this, just arrange the given sequence of A's and B's in a circle and repeatedly remove adjacent pairs AB until only A's remain. Each of these A's was the start of a dominating cyclic permutation before anything was removed. So out of the cyclic permutations of any arrangement of A votes and B votes are dominating.

Bertrand's and André's proofs

Bertrand expressed the solution as
where is the total number of voters and is the number of voters for the first candidate. He states that the result follows from the formula
where is the number of favourable sequences, but "it seems probable that such a simple result could be shown in a more direct way". Indeed, a more direct proof was soon produced by Désiré André. His approach is often mistakenly labelled "the reflection principle" by modern authors but in fact uses a permutation. He shows that the "unfavourable" sequences consist of an equal number of sequences that begin with A as those that begin with B. Every sequence that begins with B is unfavourable, and there are such sequences with a B followed by an arbitrary sequence of B's and p A's. Each unfavourable sequence that begins with A can be transformed to an arbitrary sequence of B's and p A's by finding the first B that violates the rule and deleting it, and interchanging the order of the remaining parts. To reverse the process, take any sequence of B's and p A's and search from the end to find where the number of A's first exceeds the number of B's, and then interchange the order of the parts and place a B in between. For example, the unfavourable sequence AABBABAA corresponds uniquely to the arbitrary sequence ABAAAAB. From this, it follows that the number of favourable sequences of p A's and q B's is
and thus the required probability is
as expected.

Variant: ties allowed

The original problem is to find the probability that the first candidate is always strictly ahead in the vote count. One may instead consider the problem of finding the probability that the second candidate is never ahead. In this case, the answer is
The variant problem can be solved by the reflection method in a similar way to the original problem. The number of possible vote sequences is. Call a sequence "bad" if the second candidate is ever ahead, and if the number of bad sequences can be enumerated then the number of "good" sequences can be found by subtraction and the probability can be computed.
Represent a voting sequence as a lattice path on the Cartesian plane as follows:

Start the path at
Each time a vote for the first candidate is received move right 1 unit.
Each time a vote for the second candidate is received move up 1 unit.

Each such path corresponds to a unique sequence of votes and will end at. A sequence is 'good' exactly when the corresponding path never goes above the diagonal line y = x; equivalently, a sequence is 'bad' exactly when the corresponding path touches the line y = x + 1.
For each 'bad' path P, define a new path P′ by reflecting the part of P up to the first point it touches the line across it. P′ is a path from to. The same operation applied again restores the original P. This produces a one-to-one correspondence between the 'bad' paths and the paths from to. The number of these paths is and so that is the number of 'bad' sequences. This leaves the number of 'good' sequences as
Since there are altogether, the probability of a sequence being good is.
In fact, the solutions to the original problem and the variant problem are easily related. For candidate A to be strictly ahead throughout the vote count, they must receive the first vote and for the remaining votes they must be either strictly ahead or tied throughout the count. Hence the solution to the original problem is
as required.
Conversely, the tie case can be derived from the non-tie case. Note that the number of non-tie sequences with p+1 votes for A is equal to the number of tie sequences with p votes for A. The number of non-tie votes with p + 1 votes for A votes is, which by algebraic manipulation is , so the fraction of sequences with p votes for A votes is.

Popular movies

The Hunger Games (film) - 2012 American dystopian action thriller science fiction-adventure film directed by Gary Ross and based on Suzanne Collins’s 2008 novel of the same name. It is the first insta...
untitled Captain Marvel sequel - part of Marvel Cinematic Universe....
Killers of the Flower Moon (film project) - Killers of the Flower Moon - film project in United States of America. It was presented as drama, detective fiction, thriller. The film project starred Leonardo Dicaprio, Robert De Niro. Director of...
Five Nights at Freddy's (film) - Five Nights at Freddy's - film published in 2017 in United States of America. Scenarist of the film - Scott Cawthon....

Popular books

Book of Revelation - The Book of Revelation is the final book of the New Testament, and consequently is also the final book of the Christian Bible. Its title is derived from the first word of the Koine Greek text: apok...
Book of Genesis - account of the creation of the world, the early history of humanity, Israel's ancestors and the origins...
Gospel of Matthew - The Gospel According to Matthew is the first book of the New Testament and one of the three synoptic gospels. It tells how Israel's Messiah, rejected and executed in Israel, pronounces judgement on ...
Michelin Guide - Michelin Guides are a series of guide books published by the French tyre company Michelin for more than a century. The term normally refers to the annually published Michelin Red Guide , the oldest...
Psalms - The Book of Psalms , commonly referred to simply as Psalms , the Psalter or "the Psalms", is the first book of the Ketuvim , the third section of the Hebrew Bible, and thus a book of th...
Ecclesiastes - Ecclesiastes is one of 24 books of the Tanakh , where it is classified as one of the Ketuvim . Originally written c. 450–200 BCE, it is also among the canonical Wisdom literature of the Old Tes...
The 48 Laws of Power - non-fiction book by American author Robert Greene. The book...

Popular television series

The Crown (TV series) - historical drama web television series about the reign of Queen Elizabeth II, created and principally written by Peter Morgan, and produced by Left Bank Pictures and Sony Pictures Tel...
Friends - American sitcom television series, created by David Crane and Marta Kauffman, which aired on NBC from September 22, 1994, to May 6, 2004, lasting ten seasons. With an ensemble cast sta...
Young Sheldon - spin-off prequel to The Big Bang Theory and begins with the character Sheldon...
Modern Family - American television mockumentary family sitcom created by Christopher Lloyd and Steven Levitan for the American Broadcasting Company. It ran for eleven seasons, from September 23...
Loki (TV series) - upcoming American web television miniseries created for Disney+ by Michael Waldron, based on the Marvel Comics character of the same name. It is set in the Marvel Cinematic Universe, shar...
Game of Thrones - American fantasy drama television series created by David Benioff and D. B. Weiss for HBO. It...
Shameless (American TV series) - American comedy-drama television series developed by John Wells which debuted on Showtime on January 9, 2011. It...