Poisson binomial distribution


In probability theory and statistics, the Poisson binomial distribution is the discrete probability distribution of a sum of independent Bernoulli trials that are not necessarily identically distributed. The concept is named after Siméon Denis Poisson.
In other words, it is the probability distribution of the
number of successes in a sequence of n independent yes/no experiments with success probabilities. The ordinary binomial distribution is a special case of the Poisson binomial distribution, when all success probabilities are the same, that is.

Mean and Variance

Since a Poisson binomial distributed variable is a sum of n independent Bernoulli distributed variables, its mean and variance will simply be sums of the mean and variance of the n Bernoulli distributions:
For fixed values of the mean and size, the variance is maximal when all success probabilities are equal and we have a binomial distribution. When the mean is fixed, the variance is bounded from above by the variance of the Poisson distribution with the same mean which is attained asymptotically as n tends to infinity.

Probability Mass Function

The probability of having k successful trials out of a total of n can be written as the sum
where is the set of all subsets of k integers that can be selected from. For example, if n = 3, then. is the complement of, i.e..
will contain elements, the sum over which is infeasible to compute in practice unless the number of trials n is small. However, there are other, more efficient ways to calculate.
As long as none of the success probabilities are equal to one, one can calculate the probability of k successes using the recursive formula
where
The recursive formula is not numerically stable, and should be avoided if is greater than approximately 20. Another possibility is using the discrete Fourier transform.
where and.
Still other methods are described in

Entropy

There is no simple formula for the entropy of a Poisson binomial distribution, but the entropy is bounded above by the entropy of a binomial distribution with the same number parameter and the same mean. Therefore, the entropy is also bounded above by the entropy of a Poisson distribution with the same mean.
The Shepp–Olkin concavity conjecture, due to Lawrence Shepp and Ingram Olkin in 1981, states that the entropy of a Poisson binomial distribution is a concave function of the success probabilities. This conjecture was proved by Erwan Hillion and Oliver Johnson in 2015. The Shepp-Olkin monotonicity conjecture, also from the same 1981 paper, is that the entropy is monotone increasing in, if all. This conjecture was also proved by Hillion and Johnson, in 2019

Chernoff bound

The probability that a Poisson binomial distribution gets large, can be bounded using its moment generating function as follows :
where we took. This is similar to the tail bounds of a binomial distribution.