Minimax estimator

In statistical decision theory, where we are faced with the problem of estimating a deterministic parameter from observations an estimator is called minimax if its maximal risk is minimal among all estimators of. In a sense this means that is an estimator which performs best in the worst possible case allowed in the problem.

Problem setup

Consider the problem of estimating a deterministic parameter from noisy or corrupt data related through the conditional probability distribution. Our goal is to find a "good" estimator for estimating the parameter, which minimizes some given risk function. Here the risk function is the expectation of some loss function with respect to. A popular example for a loss function is the squared error loss, and the risk function for this loss is the mean squared error.
Unfortunately, in general, the risk cannot be minimized since it depends on the unknown parameter itself. Therefore additional criteria for finding an optimal estimator in some sense are required. One such criterion is the minimax criterion.

Definition

Definition : An estimator is called minimax with respect to a risk function if it achieves the smallest maximum risk among all estimators, meaning it satisfies

Least favorable distribution

Logically, an estimator is minimax when it is the best in the worst case. Continuing this logic, a minimax estimator should be a Bayes estimator with respect to a least favorable prior distribution of. To demonstrate this notion denote the average risk of the Bayes estimator with respect to a prior distribution as
Definition: A prior distribution is called least favorable if for every other distribution the average risk satisfies.
Theorem 1: If then:

is minimax.
If is a unique Bayes estimator, it is also the unique minimax estimator.
is least favorable.

Corollary: If a Bayes estimator has constant risk, it is minimax. Note that this is not a necessary condition.
Example 1: Unfair coin: Consider the problem of estimating the "success" rate of a binomial variable,. This may be viewed as estimating the rate at which an unfair coin falls on "heads" or "tails". In this case the Bayes estimator with respect to a Beta-distributed prior, is
with constant Bayes risk
and, according to the Corollary, is minimax.
Definition: A sequence of prior distributions is called least favorable if for any other distribution,
Theorem 2: If there are a sequence of priors and an estimator such that
, then :

is minimax.
The sequence is least favorable.

Notice that no uniqueness is guaranteed here. For example, the ML estimator from the previous example may be attained as the limit of Bayes estimators with respect to a uniform prior, with increasing support and also with respect to a zero-mean normal prior with increasing variance. So neither the resulting ML estimator is unique minimax nor the least favorable prior is unique.
Example 2: Consider the problem of estimating the mean of dimensional Gaussian random vector,. The maximum likelihood estimator for in this case is simply, and its risk is
The risk is constant, but the ML estimator is actually not a Bayes estimator, so the Corollary of Theorem 1 does not apply. However, the ML estimator is the limit of the Bayes estimators with respect to the prior sequence, and, hence, indeed minimax according to Theorem 2. Nonetheless, minimaxity does not always imply admissibility. In fact in this example, the ML estimator is known to be inadmissible whenever. The famous James–Stein estimator dominates the ML whenever. Though both estimators have the same risk when, and they are both minimax, the James–Stein estimator has smaller risk for any finite. This fact is illustrated in the following figure.

Some examples

In general, it is difficult, often even impossible to determine the minimax estimator. Nonetheless, in many cases, a minimax estimator has been determined.
Example 3: Bounded normal mean: When estimating the mean of a normal vector, where it is known that. The Bayes estimator with respect to a prior which is uniformly distributed on the edge of the bounding sphere is known to be minimax whenever. The analytical expression for this estimator is
where, is the modified Bessel function of the first kind of order n.

Asymptotic minimax estimator

The difficulty of determining the exact minimax estimator has motivated the study of estimators of asymptotic minimax – an estimator is called -asymptotic minimax if
For many estimation problems, especially in the non-parametric estimation setting, various approximate minimax estimators have been established. The design of the approximate minimax estimator is intimately related to the geometry, such as the metric entropy number, of.

Randomised minimax estimator

Sometimes, a minimax estimator may take the form of a randomised decision rule. An example is shown on the left. The parameter space has just two elements and each point on the graph corresponds to the risk of a decision rule: the x-coordinate is the risk when the parameter is and the y-coordinate is the risk when the parameter is. In this decision problem, the minimax estimator lies on a line segment connecting two deterministic estimators. Choosing with probability and with probability minimises the supremum risk.

Relationship to robust optimization

is an approach to solve optimization problems under uncertainty in the knowledge of underlying parameters,. For instance, the MMSE Bayesian estimation of a parameter requires the knowledge of parameter correlation function. If the knowledge of this correlation function is not perfectly available, a popular minimax robust optimization approach is to define a set characterizing the uncertainty about the correlation function, and then pursuing a minimax optimization over the uncertainty set and the estimator respectively. Similar minimax optimizations can be pursued to make estimators robust to certain imprecisely known parameters. For instance, a recent study dealing with such techniques in the area of signal processing can be found in.
In R. Fandom Noubiap and W. Seidel an algorithm for calculating a Gamma-minimax decision rule has been developed, when Gamma is given by a finite number of generalized moment conditions. Such a decision rule minimizes the maximum of the integrals of the risk function with respect to all distributions in Gamma. Gamma-minimax decision rules are of interest in robustness studies in Bayesian statistics.

Popular movies

The Hunger Games (film) - 2012 American dystopian action thriller science fiction-adventure film directed by Gary Ross and based on Suzanne Collins’s 2008 novel of the same name. It is the first insta...
untitled Captain Marvel sequel - part of Marvel Cinematic Universe....
Killers of the Flower Moon (film project) - Killers of the Flower Moon - film project in United States of America. It was presented as drama, detective fiction, thriller. The film project starred Leonardo Dicaprio, Robert De Niro. Director of...
Five Nights at Freddy's (film) - Five Nights at Freddy's - film published in 2017 in United States of America. Scenarist of the film - Scott Cawthon....

Popular books

Book of Revelation - The Book of Revelation is the final book of the New Testament, and consequently is also the final book of the Christian Bible. Its title is derived from the first word of the Koine Greek text: apok...
Book of Genesis - account of the creation of the world, the early history of humanity, Israel's ancestors and the origins...
Gospel of Matthew - The Gospel According to Matthew is the first book of the New Testament and one of the three synoptic gospels. It tells how Israel's Messiah, rejected and executed in Israel, pronounces judgement on ...
Michelin Guide - Michelin Guides are a series of guide books published by the French tyre company Michelin for more than a century. The term normally refers to the annually published Michelin Red Guide , the oldest...
Psalms - The Book of Psalms , commonly referred to simply as Psalms , the Psalter or "the Psalms", is the first book of the Ketuvim , the third section of the Hebrew Bible, and thus a book of th...
Ecclesiastes - Ecclesiastes is one of 24 books of the Tanakh , where it is classified as one of the Ketuvim . Originally written c. 450–200 BCE, it is also among the canonical Wisdom literature of the Old Tes...
The 48 Laws of Power - non-fiction book by American author Robert Greene. The book...

Popular television series

The Crown (TV series) - historical drama web television series about the reign of Queen Elizabeth II, created and principally written by Peter Morgan, and produced by Left Bank Pictures and Sony Pictures Tel...
Friends - American sitcom television series, created by David Crane and Marta Kauffman, which aired on NBC from September 22, 1994, to May 6, 2004, lasting ten seasons. With an ensemble cast sta...
Young Sheldon - spin-off prequel to The Big Bang Theory and begins with the character Sheldon...
Modern Family - American television mockumentary family sitcom created by Christopher Lloyd and Steven Levitan for the American Broadcasting Company. It ran for eleven seasons, from September 23...
Loki (TV series) - upcoming American web television miniseries created for Disney+ by Michael Waldron, based on the Marvel Comics character of the same name. It is set in the Marvel Cinematic Universe, shar...
Game of Thrones - American fantasy drama television series created by David Benioff and D. B. Weiss for HBO. It...
Shameless (American TV series) - American comedy-drama television series developed by John Wells which debuted on Showtime on January 9, 2011. It...