proved this theorem for the case of a deterministic function in 1930; Aleksandr Khinchin later formulated an analogous result for stationary stochastic processes and published that probabilistic analogue in 1934. Albert Einstein explained, without proofs, the idea in a brief two-page memo in 1914.
For continuous time, the Wiener–Khinchin theorem says that if is a wide-sense stationary process whose autocorrelation function defined in terms of statistical expected value, , exists and is finite at every lag, then there exists a monotone function in the frequency domain such that where the integral is a Riemann–Stieltjes integral. This is a kind of spectral decomposition of the auto-correlation function. F is called the power spectral distribution function and is a statistical distribution function. It is sometimes called the integrated spectrum. The Fourier transform of does not exist in general, because stationary random functions are not generally either square-integrable or absolutely integrable. Nor is assumed to be absolutely integrable, so it need not have a Fourier transform either. But if is absolutely continuous, for example, if the process is purely indeterministic, then is differentiable almost everywhere. In this case, one can define, the power spectral density of, by taking the averaged derivative of. Because the left and right derivatives of exist everywhere, we can put everywhere,, and the theorem simplifies to If now one assumes that r and S satisfy the necessary conditions for Fourier inversion to be valid, the Wiener–Khinchin theorem takes the simple form of saying that r and S are a Fourier-transform pair, and
The case of a discrete-time process
For the discrete-time case, the power spectral density of the function with discrete values is where is the discrete autocorrelation function of, provided this is absolutely integrable. Being a sampled and discrete-time sequence, the spectral density is periodic in the frequency domain. This is due to the problem of aliasing: the contribution of any frequency higher than the Nyquist frequency seems to be equal to that of its alias between 0 and 1. For this reason, the domain of the function is usually restricted to lie between 0 and 1 or between −0.5 and 0.5.
Application
The theorem is useful for analyzing linear time-invariant systems when the inputs and outputs are not square-integrable, so their Fourier transforms do not exist. A corollary is that the Fourier transform of the autocorrelation function of the output of an LTI system is equal to the product of the Fourier transform of the autocorrelation function of the input of the system times the squared magnitude of the Fourier transform of the system impulse response. This works even when the Fourier transforms of the input and output signals do not exist because these signals are not square-integrable, so the system inputs and outputs cannot be directly related by the Fourier transform of the impulse response. Since the Fourier transform of the autocorrelation function of a signal is the power spectrum of the signal, this corollary is equivalent to saying that the power spectrum of the output is equal to the power spectrum of the input times the energy transfer function. This corollary is used in the parametric method for power spectrum estimation.
Discrepancies in terminology
In many textbooks and in much of the technical literature it is tacitly assumed that Fourier inversion of the autocorrelation function and the power spectral density is valid, and the Wiener–Khinchin theorem is stated, very simply, as if it said that the Fourier transform of the autocorrelation function was equal to the power spectral density, ignoring all questions of convergence. But the theorem was applied by Norbert Wiener and Aleksandr Khinchin to the sample functions of wide-sense-stationary random processes, signals whose Fourier transforms do not exist. The whole point of Wiener's contribution was to make sense of the spectral decomposition of the autocorrelation function of a sample function of a wide-sense-stationary random process even when the integrals for the Fourier transform and Fourier inversion do not make sense. Further complicating the issue is that the discrete Fourier transform always exists for digital, finite-length sequences, meaning that the theorem can be blindly applied to calculate auto-correlations of numerical sequences. As mentioned earlier, the relation of this discrete sampled data to a mathematical model is often misleading, and related errors can show up as a divergence when the sequence length is modified. Some authors refer to as the autocovariance function. They then proceed to normalise it, by dividing by, to obtain what they refer to as the autocorrelation function.