Confidence distribution

In statistical inference, the concept of a confidence distribution has often been loosely referred to as a distribution function on the parameter space that can represent confidence intervals of all levels for a parameter of interest. Historically, it has typically been constructed by inverting the upper limits of lower sided confidence intervals of all levels, and it was also commonly associated with a fiducial interpretation, although it is a purely frequentist concept. A confidence distribution is NOT a probability distribution function of the parameter of interest, but may still be a function useful for making inferences.
In recent years, there has been a surge of renewed interest in confidence distributions. In the more recent developments, the concept of confidence distribution has emerged as a purely frequentist concept, without any fiducial interpretation or reasoning. Conceptually, a confidence distribution is no different from a point estimator or an interval estimator, but it uses a sample-dependent distribution function on the parameter space to estimate the parameter of interest.
A simple example of a confidence distribution, that has been broadly used in statistical practice, is a bootstrap distribution. The development and interpretation of a bootstrap distribution does not involve any fiducial reasoning; the same is true for the concept of a confidence distribution. But the notion of confidence distribution is much broader than that of a bootstrap distribution. In particular, recent research suggests that it encompasses and unifies a wide range of examples, from regular parametric cases to bootstrap distributions, p-value functions, normalized likelihood functions and, in some cases, Bayesian priors and Bayesian posteriors.
Just as a Bayesian posterior distribution contains a wealth of information for any type of Bayesian inference, a confidence distribution contains a wealth of information for constructing almost all types of frequentist inferences, including point estimates, confidence intervals and p-values, among others. Some recent developments have highlighted the promising potentials of the CD concept, as an effective inferential tool.

The history of CD concept

Neyman introduced the idea of "confidence" in his seminal paper on confidence intervals which clarified the frequentist repetition property. According to Fraser, the seed of confidence distribution can even be traced back to Bayes and Fisher. Some researchers view the confidence distribution as "the Neymanian interpretation of Fisher's fiducial distributions", which was "furiously disputed by Fisher". It is also believed that these "unproductive disputes" and Fisher's "stubborn insistence" might be the reason that the concept of confidence distribution has been long misconstrued as a fiducial concept and not been fully developed under the frequentist framework. Indeed, the confidence distribution is a purely frequentist concept with a purely frequentist interpretation, and it also has ties to Bayesian inference concepts and the fiducial arguments.

Definition

Classical definition

Classically, a confidence distribution is defined by inverting the upper limits of a series of lower-sided confidence intervals. In particular,
Efron stated that this distribution "assigns probability 0.05 to θ lying between the upper endpoints of the 0.90 and 0.95 confidence interval, etc." and "it has powerful intuitive appeal".
In the classical literature, the confidence distribution function is interpreted as a distribution function of the parameter θ, which is impossible unless fiducial reasoning is involved since, in a frequentist setting, the parameters are fixed and nonrandom.
To interpret the CD function entirely from a frequentist viewpoint and not interpret it as a distribution function of a parameter is one of the major departures of recent development relative to the classical approach. The nice thing about treating confidence distributions as a purely frequentist concept is that it is now free from those restrictive, if not controversial, constraints set forth by Fisher on fiducial distributions.

The modern definition

The following definition applies; Θ is the parameter space of the unknown parameter of interest θ, and χ is the sample space corresponding to data X_n=:
Also, the function H is an asymptotic CD, if the U requirement is true only asymptotically and the continuity requirement on H_n is dropped.
In nontechnical terms, a confidence distribution is a function of both the parameter and the random sample, with two requirements. The first requirement simply requires that a CD should be a distribution on the parameter space. The second requirement sets a restriction on the function so that inferences based on the confidence distribution have desired frequentist properties. This is similar to the restrictions in point estimation to ensure certain desired properties, such as unbiasedness, consistency, efficiency, etc.
A confidence distribution derived by inverting the upper limits of confidence intervals also satisfies the requirements in the above definition and this version of the definition is consistent with the classical definition.
Unlike the classical fiducial inference, more than one confidence distributions may be available to estimate a parameter under any specific setting. Also, unlike the classical fiducial inference, optimality is not a part of requirement. Depending on the setting and the criterion used, sometimes there is a unique "best" confidence distribution. But sometimes there is no optimal confidence distribution available or, in some extreme cases, we may not even be able to find a meaningful confidence distribution. This is not different from the practice of point estimation.

Examples

Example 1: Normal mean and variance

Suppose a normal sample X_i ~ N, i = 1, 2, ..., n is given.
Variance σ² is known
Let, Φ be the cumulative distribution function of the standard normal distribution, and the cumulative distribution function of the Student distribution. Both the functions and given by
satisfy the two requirements in the CD definition, and they are confidence distribution functions for μ. Furthermore,
satisfies the definition of an asymptotic confidence distribution when n→∞, and it is an asymptotic confidence distribution for μ. The uses of and are equivalent to state that we use and to estimate, respectively.
Variance σ² is unknown
For the parameter μ, since involves the unknown parameter σ and it violates the two requirements in the CD definition, it is no longer a "distribution estimator" or a confidence distribution for μ. However, is still a CD for μ and is an aCD for μ.
For the parameter σ², the sample-dependent cumulative distribution function
is a confidence distribution function for σ². Here, is the cumulative distribution function of the distribution.
In the case when the variance σ² is known, is optimal in terms of producing the shortest confidence intervals at any given level. In the case when the variance σ² is unknown, is an optimal confidence distribution for μ.

Example 2: Bivariate normal correlation

Let ρ denotes the correlation coefficient of a bivariate normal population. It is well known that Fisher's z defined by the Fisher transformation:
has the limiting distribution with a fast rate of convergence, where r is the sample correlation and n is the sample size.
The function
is an asymptotic confidence distribution for ρ.

Using CD to make inference

Confidence interval

From the CD definition, it is evident that the interval and provide 100%-level confidence intervals of different kinds, for θ, for any α ∈ . Also is a level 100% confidence interval for the parameter θ for any α₁ > 0, α₂ > 0 and α₁ + α₂ < 1. Here, is the 100β% quantile of or it solves for θ in equation. The same holds for an aCD, where the confidence level is achieved in limit.

Point estimation

Point estimators can also be constructed given a confidence distribution estimator for the parameter of interest. For example, given H_n the CD for a parameter θ, natural choices of point estimators include the median M_n = H_n⁻¹, the mean, and the maximum point of the CD density
Under some modest conditions, among other properties, one can prove that these point estimators are all consistent.

Hypothesis testing

One can derive a p-value for a test, either one-sided or two-sided, concerning the parameter θ, from its confidence distribution H_n. Denote by the probability mass of a set C under the confidence distribution function This p_s is called "support" in the CD inference and also known as "belief" in the fiducial literature. We have
For the one-sided test K₀: θ ∈ C vs. K₁: θ ∈ C^c, where C is of the type of, one can show from the CD definition that sup_{θ ∈ C}P_θ = α. Thus, p_s = H_n is the corresponding p-value of the test.
For the singleton test K₀: θ = b vs. K₁: θ ≠ b, P = α. Thus, 2 min = 2 min is the corresponding p-value of the test. Here, C_lo = .
See Figure 1 from Xie and Singh for a graphical illustration of the CD inference.

Implementations

A few statistical programs have implemented the ability to construct and graph confidence distributions.
R, via the concurve, pvaluefunctions, and episheet packages
Excel, via episheet
Stata, via concurve

Popular movies

The Hunger Games (film) - 2012 American dystopian action thriller science fiction-adventure film directed by Gary Ross and based on Suzanne Collins’s 2008 novel of the same name. It is the first insta...
untitled Captain Marvel sequel - part of Marvel Cinematic Universe....
Killers of the Flower Moon (film project) - Killers of the Flower Moon - film project in United States of America. It was presented as drama, detective fiction, thriller. The film project starred Leonardo Dicaprio, Robert De Niro. Director of...
Five Nights at Freddy's (film) - Five Nights at Freddy's - film published in 2017 in United States of America. Scenarist of the film - Scott Cawthon....

Popular books

Book of Revelation - The Book of Revelation is the final book of the New Testament, and consequently is also the final book of the Christian Bible. Its title is derived from the first word of the Koine Greek text: apok...
Book of Genesis - account of the creation of the world, the early history of humanity, Israel's ancestors and the origins...
Gospel of Matthew - The Gospel According to Matthew is the first book of the New Testament and one of the three synoptic gospels. It tells how Israel's Messiah, rejected and executed in Israel, pronounces judgement on ...
Michelin Guide - Michelin Guides are a series of guide books published by the French tyre company Michelin for more than a century. The term normally refers to the annually published Michelin Red Guide , the oldest...
Psalms - The Book of Psalms , commonly referred to simply as Psalms , the Psalter or "the Psalms", is the first book of the Ketuvim , the third section of the Hebrew Bible, and thus a book of th...
Ecclesiastes - Ecclesiastes is one of 24 books of the Tanakh , where it is classified as one of the Ketuvim . Originally written c. 450–200 BCE, it is also among the canonical Wisdom literature of the Old Tes...
The 48 Laws of Power - non-fiction book by American author Robert Greene. The book...

Popular television series

The Crown (TV series) - historical drama web television series about the reign of Queen Elizabeth II, created and principally written by Peter Morgan, and produced by Left Bank Pictures and Sony Pictures Tel...
Friends - American sitcom television series, created by David Crane and Marta Kauffman, which aired on NBC from September 22, 1994, to May 6, 2004, lasting ten seasons. With an ensemble cast sta...
Young Sheldon - spin-off prequel to The Big Bang Theory and begins with the character Sheldon...
Modern Family - American television mockumentary family sitcom created by Christopher Lloyd and Steven Levitan for the American Broadcasting Company. It ran for eleven seasons, from September 23...
Loki (TV series) - upcoming American web television miniseries created for Disney+ by Michael Waldron, based on the Marvel Comics character of the same name. It is set in the Marvel Cinematic Universe, shar...
Game of Thrones - American fantasy drama television series created by David Benioff and D. B. Weiss for HBO. It...
Shameless (American TV series) - American comedy-drama television series developed by John Wells which debuted on Showtime on January 9, 2011. It...