Fan chart (statistics)


A fan chart is made of a group of dispersion fan diagrams,
which may be positioned according to two categorising dimensions.
A dispersion fan diagram is a circular diagram which
reports the same information about a dispersion as a box plot:
namely median, quartiles, and two extreme values.

Elements

The elements of a dispersion fan diagram
are:
  1. a circular line as scale
  2. a diameter which indicates the median
  3. a fan which indicates the quartiles
  4. two feathers which indicate the extreme values.
The scale on the circular line begins at the left
with the starting value.
The following values are applicated clockwise.
The white tail of diameter indicates the median.
The dark fan indicates the dispersion of the middle half of the observed
values; thus it encompasses the values from the first to the third quartile.
The white feathers indicate the dispersion of the middle 90% of the
observed values.
The length of the white part of the diameter corresponds with the number
of observations.

Application

A fan chart gives a quick summary of observed values which depend from two variables. This is possible thanks of a dense representation
and a constant size which does not depend on the size of the single dispersion fan diagrams.
An essential advantage compared to a sequence of box plots
is the possibility to compare dispersion fan diagrams not only within one direction
but within two directions.

Example

The following example presents data from the data set MathAchieve
which is part of the R package
of José Pinheiro et al.
It contains mathematics achievement scores of 7185 students.
The students are categorised
according to sex and membership of a minority ethnic group.
The graphics show the mathematics achievement scores in dependency
on the socio-economic status of the students
and on the average socio-economic status of all students
at the same school.
The four graphic panels differentiate the students
according to sex and membership of a minority ethnic group.
The fan charts reveals clearly how the median value
is partially following a big main tendency
while the values of the single subgroups scatter largely
what could lead to doubts about a possible correlation.