Data processing inequality


The Data processing inequality is an information theoretic concept which states that the information content of a signal cannot be increased via a local physical operation. This can be expressed concisely as 'post-processing cannot increase information'.

Definition

Let three random variables form the Markov chain, implying that the conditional distribution of depends only on and is conditionally independent of. Specifically, we have such a Markov chain if the joint probability mass function can be written as
In this setting, no processing of Y, deterministic or random, can increase the information that Y contains about X. Using the mutual information, this can be written as :
With the equality if and only if, i.e. and contain the same information about, and also forms a Markov chain.