Taylor diagram


Taylor diagrams are mathematical diagrams designed to graphically indicate which of several approximate representations of a system, process, or phenomenon is most realistic. This diagram, invented by Karl E. Taylor in 1994 facilitates the comparative assessment of different models. It is used to quantify the degree of correspondence between the modeled and observed behavior in terms of three statistics: the Pearson correlation coefficient, the root-mean-square error error, and the standard deviation.
Although Taylor diagrams have primarily been used to evaluate models designed to study climate and other aspects of Earth's environment, they can be used for purposes unrelated to environmental science.
Taylor diagrams can be constructed with a number of different open source and commercial software packages, including: GrADS, IDL, MATLAB, NCL, Python, R, and UV-CDAT.

Sample diagram

The sample Taylor diagram shown in Figure 1 provides a summary of the relative skill with which several global climate models simulate the spatial pattern of annual mean precipitation. Eight models, each represented by a different letter on the diagram, are compared, and the distance between each model and the point labeled “observed” is a measure of how realistically each model reproduces observations. For each model, three statistics are plotted: the Pearson correlation coefficient is related to the azimuthal angle ; the centered RMS error in the simulated field is proportional to the distance from the point on the x-axis identified as “observed” ; and the standard deviation of the simulated pattern is proportional to the radial distance from the origin. It is evident from this diagram, for example, that for Model F the correlation coefficient is about 0.65, the RMS error is about 2.6 mm/day and the standard deviation is about 3.3 mm/day. Model F's standard deviation is clearly greater than the standard deviation of the observed field.
The relative merits of various models can be inferred from Figure 1. Simulated patterns that agree well with observations will lie nearest the point marked "observed" on the x-axis. These models have relatively high correlation and low RMS errors. Models lying on the dashed arc have the correct standard deviation. In Figure 1 it can be seen that models A and C generally agree best with observations, each with about the same RMS error. Model A, however, has a slightly higher correlation with observations and has the same standard deviation as the observed, whereas model C has too little spatial variability. Of the poorer performing models, model E has a low pattern correlation, while model D has variations that are much larger than observed, in both cases resulting in a relatively large centered RMS error in the precipitation fields. Although models D and B have about the same correlation with observations, model B simulates the amplitude of the variations much better than model D, resulting in a smaller RMS error.

Theoretical basis

Taylor diagrams display statistics useful for assessing the similarity of a variable simulated by a model to its observed counterpart. Mathematically, the three statistics displayed on a Taylor diagram are related by the following formula :
,
where ρ is the correlation coefficient between the test and reference fields, E′ is the centered RMS difference between the fields provides the key to forming the geometrical relationship between the four quantities that underlie the Taylor diagram. The standard deviation of the observed field is side a. The standard deviation of the simulated field is side b, the centered RMS difference between the two fields is side c, and the cosine of the angle between sides a and b is the correlation coefficient.
The means of the fields are subtracted out before computing their second-order statistics, so the diagram does not provide information about overall biases, but solely characterizes the centered pattern error.

Taylor diagram variants

Among the several minor variations on the diagram that have been suggested are :