Abstract

Abstract The classic Jaccard and Sørensen indices of compositional similarity (and other indices that depend upon the same variables) are notoriously sensitive to sample size, especially for assemblages with numerous rare species. Further, because these indices are based solely on presence–absence data, accurate estimators for them are unattainable. We provide a probabilistic derivation for the classic, incidence‐based forms of these indices and extend this approach to formulate new Jaccard‐type or Sørensen‐type indices based on species abundance data. We then propose estimators for these indices that include the effect of unseen shared species, based on either (replicated) incidence‐ or abundance‐based sample data. In sampling simulations, these new estimators prove to be considerably less biased than classic indices when a substantial proportion of species are missing from samples. Based on species‐rich empirical datasets, we show how incorporating the effect of unseen shared species not only increases accuracy but also can change the interpretation of results.

Keywords

Jaccard indexEstimatorAbundance (ecology)Similarity (geometry)StatisticsSample (material)Relative species abundanceSampling (signal processing)Probabilistic logicSample size determinationMathematicsEcologyEconometricsComputer scienceBiologyArtificial intelligenceCluster analysis

Affiliated Institutions

Related Publications

Publication Info

Year
2004
Type
article
Volume
8
Issue
2
Pages
148-159
Citations
1877
Access
Closed

External Links

Social Impact

Social media, news, blog, policy document mentions

Citation Metrics

1877
OpenAlex

Cite This

Anne Chao, Robin L. Chazdon, Robert K. Colwell et al. (2004). A new statistical approach for assessing similarity of species composition with incidence and abundance data. Ecology Letters , 8 (2) , 148-159. https://doi.org/10.1111/j.1461-0248.2004.00707.x

Identifiers

DOI
10.1111/j.1461-0248.2004.00707.x