Representer theorem

In statistical learning theory, a representer theorem is any of several related results stating that a minimizer of a regularized empirical risk functional defined over a reproducing kernel Hilbert space can be represented as a finite linear combination of kernel products evaluated on the input points in the training set data.

Formal statement

The following Representer Theorem and its proof are due to Schölkopf, Herbrich, and Smola:
Theorem: Consider a positive-definite real-valued kernel on a non-empty set with a corresponding reproducing kernel Hilbert space. Let there be given

a training sample,
a strictly increasing real-valued function, and
an arbitrary error function,

which together define the following regularized empirical risk functional on :
Then, any minimizer of the empirical risk
admits a representation of the form:
where for all.
Proof:
Define a mapping
. Since is a reproducing kernel, then
where is the inner product on.
Given any, one can use orthogonal projection to decompose any into a sum of two functions, one lying in, and the other lying in the orthogonal complement:
where for all.
The above orthogonal decomposition and the reproducing property together show that applying to any training point produces
which we observe is independent of. Consequently, the value of the error function in is likewise independent of. For the second term, since is orthogonal to and is strictly monotonic, we have
Therefore setting does not affect the first term of, while it strictly decreasing the second term. Consequently, any minimizer in must have, i.e., it must be of the form
which is the desired result.

Generalizations

The Theorem stated above is a particular example of a family of results that are collectively referred to as "representer theorems"; here we describe several such.
The first statement of a representer theorem was due to Kimeldorf and Wahba for the special case in which
for. Schölkopf, Herbrich, and Smola generalized this result by relaxing the assumption of the squared-loss cost and allowing the regularizer to be any strictly monotonically increasing function of the Hilbert space norm.
It is possible to generalize further by augmenting the regularized empirical risk functional through the addition of unpenalized offset terms. For example, Schölkopf, Herbrich, and Smola also consider the minimization
i.e., we consider functions of the form, where and is an unpenalized function lying in the span of a finite set of real-valued functions. Under the assumption that the matrix has rank, they show that the minimizer in
admits a representation of the form
where and the are all uniquely determined.
The conditions under which a representer theorem exists were investigated by Argyriou, Micchelli, and Pontil, who proved the following:
Theorem: Let be a nonempty set, a positive-definite real-valued kernel on with corresponding reproducing kernel Hilbert space, and let be a differentiable regularization function. Then given a training sample and an arbitrary error function, a minimizer
of the regularized empirical risk admits a representation of the form
where for all, if and only if there exists a nondecreasing function for which
Effectively, this result provides a necessary and sufficient condition on a differentiable regularizer under which the corresponding regularized empirical risk minimization will have a representer theorem. In particular, this shows that a broad class of regularized risk minimizations have representer theorems.

Applications

Representer theorems are useful from a practical standpoint because they dramatically simplify the regularized empirical risk minimization problem. In most interesting applications, the search domain for the minimization will be an infinite-dimensional subspace of, and therefore the search does not admit implementation on finite-memory and finite-precision computers. In contrast, the representation of afforded by a representer theorem reduces the original minimization problem to a search for the optimal -dimensional vector of coefficients ; can then be obtained by applying any standard function minimization algorithm. Consequently, representer theorems provide the theoretical basis for the reduction of the general machine learning problem to algorithms that can actually be implemented on computers in practice.
The following provides an example of how to solve for the minimizer whose existence is guaranteed by the representer theorem. This method works for any positive definite kernel, and allows us to transform a complicated optimization problem into a simple linear system that can be solved numerically.
Assume that we are using a least squares error function

and a regularization function
for some. By the representer theorem, the minimizer

has the form

for some. Noting that

we see that has the form

where and. This can be factored out and simplified to

Since is positive definite, there is indeed a single global minima for this expression. Let and note that is convex. Then, the global minima, can be solved by setting. Recalling that all positive definite matricies are invertible, we see that

so the minimizer may be found via a linear solve.

Popular movies

The Hunger Games (film) - 2012 American dystopian action thriller science fiction-adventure film directed by Gary Ross and based on Suzanne Collins’s 2008 novel of the same name. It is the first insta...
untitled Captain Marvel sequel - part of Marvel Cinematic Universe....
Killers of the Flower Moon (film project) - Killers of the Flower Moon - film project in United States of America. It was presented as drama, detective fiction, thriller. The film project starred Leonardo Dicaprio, Robert De Niro. Director of...
Five Nights at Freddy's (film) - Five Nights at Freddy's - film published in 2017 in United States of America. Scenarist of the film - Scott Cawthon....

Popular books

Book of Revelation - The Book of Revelation is the final book of the New Testament, and consequently is also the final book of the Christian Bible. Its title is derived from the first word of the Koine Greek text: apok...
Book of Genesis - account of the creation of the world, the early history of humanity, Israel's ancestors and the origins...
Gospel of Matthew - The Gospel According to Matthew is the first book of the New Testament and one of the three synoptic gospels. It tells how Israel's Messiah, rejected and executed in Israel, pronounces judgement on ...
Michelin Guide - Michelin Guides are a series of guide books published by the French tyre company Michelin for more than a century. The term normally refers to the annually published Michelin Red Guide , the oldest...
Psalms - The Book of Psalms , commonly referred to simply as Psalms , the Psalter or "the Psalms", is the first book of the Ketuvim , the third section of the Hebrew Bible, and thus a book of th...
Ecclesiastes - Ecclesiastes is one of 24 books of the Tanakh , where it is classified as one of the Ketuvim . Originally written c. 450–200 BCE, it is also among the canonical Wisdom literature of the Old Tes...
The 48 Laws of Power - non-fiction book by American author Robert Greene. The book...

Popular television series

The Crown (TV series) - historical drama web television series about the reign of Queen Elizabeth II, created and principally written by Peter Morgan, and produced by Left Bank Pictures and Sony Pictures Tel...
Friends - American sitcom television series, created by David Crane and Marta Kauffman, which aired on NBC from September 22, 1994, to May 6, 2004, lasting ten seasons. With an ensemble cast sta...
Young Sheldon - spin-off prequel to The Big Bang Theory and begins with the character Sheldon...
Modern Family - American television mockumentary family sitcom created by Christopher Lloyd and Steven Levitan for the American Broadcasting Company. It ran for eleven seasons, from September 23...
Loki (TV series) - upcoming American web television miniseries created for Disney+ by Michael Waldron, based on the Marvel Comics character of the same name. It is set in the Marvel Cinematic Universe, shar...
Game of Thrones - American fantasy drama television series created by David Benioff and D. B. Weiss for HBO. It...
Shameless (American TV series) - American comedy-drama television series developed by John Wells which debuted on Showtime on January 9, 2011. It...