Bray–Curtis dissimilarity


In ecology and biology, the Bray–Curtis dissimilarity, named after J. Roger Bray and John T. Curtis, is a statistic used to quantify the compositional dissimilarity between two different sites, based on counts at each site. As defined by Bray and Curtis, the index of dissimilarity is:
Where is the sum of the lesser values for only those species in common between both sites. and are the total number of specimens counted at both sites. The index can be simplified to 1-2C/2 = 1-C when the abundances at each site are expressed as proportions, though the two forms of the equation only produce matching results when the total number of specimens counted at both sites are the same. Further treatment can be found in Legendre & Legendre.
The Bray–Curtis dissimilarity is directly related to the quantitative Sørensen similarity index between the same sites:
The Bray–Curtis dissimilarity is bounded between 0 and 1, where 0 means the two sites have the same composition, and 1 means the two sites do not share any species. At sites with where BC is intermediate this index differs from other commonly used indices.
The Bray–Curtis dissimilarity is often erroneously called a distance. It is not a distance since it does not satisfy triangle inequality, and should always be called a dissimilarity to avoid confusion.
Bray–Curtis and Jaccard indices are rank-order similar, but, Jaccard index is metric, and probably should be preferred instead of the default Bray-Curtis which is semimetric.