In probability and statistics, an urn problem is an idealizedmental exercise in which some objects of real interest are represented as colored balls in an urn or other container. One pretends to remove one or more balls from the urn; the goal is to determine the probability of drawing one color or another, or some other properties. A number of important variations are described below. An urn model is either a set of probabilities that describe events within an urn problem, or it is a probability distribution, or a family of such distributions, of random variables associated with urn problems.
Basic urn model
In this basic urn model in probability theory, the urn contains x white and y black balls, well-mixed together. One ball is drawn randomly from the urn and its color observed; it is then placed back in the urn, and the selection process is repeated. Possible questions that can be answered in this model are:
Can I infer the proportion of white and black balls from n observations? With what degree of confidence?
Knowing x and y, what is the probability of drawing a specific sequence ?
If I only observe n balls, how sure can I be that there are no black balls?
beta-binomial distribution: as above, except that every time a ball is observed, an additional ball of the same color is added to the urn. Hence, the number of total marbles in the urn grows. SeePólya urn model.
multinomial distribution: the urn contains balls in more than two colors.
hypergeometric distribution: the balls are not returned to the urn once extracted. Hence, the number of total marbles in the urn decreases. This is referred to as "drawing without replacement", by opposition to "drawing with replacement".
multivariate hypergeometric distribution: as above, but with balls of more than two colors.
geometric distribution: number of draws before the first successful draw.
negative binomial distribution: number of draws before a certain number of failures occurs.
Statistical physics: derivation of energy and velocity distributions.
Pólya urn: each time a ball of a particular colour is drawn, it is replaced along with an additional ball of the same colour.
Hoppe urn: a Pólya urn with an additional ball called the mutator. When the mutator is drawn it is replaced along with an additional ball of an entirely new colour.
: the distribution of the number of occupied urns after the random assignment of k balls into n urns.
Historical remarks
In Ars Conjectandi, Jacob Bernoulli considered the problem of determining, given a number of pebbles drawn from an urn, the proportions of different colored pebbles within the urn. This problem was known as the inverse probability problem, and was a topic of research in the eighteenth century, attracting the attention of Abraham de Moivre and Thomas Bayes. Bernoulli used the Latin word ', which primarily means a clay vessel, but is also the term used in ancient Rome for a vessel of any kind for collecting ballots or lots; the present-day Italian word for ballot box is still '. Bernoulli's inspiration may have been lotteries, elections, or games of chance which involved drawing balls from a container, and it has been asserted that