Abstract

Existing image classification datasets used in computer vision tend to have a uniform distribution of images across object categories. In contrast, the natural world is heavily imbalanced, as some species are more abundant and easier to photograph than others. To encourage further progress in challenging real world conditions we present the iNaturalist species classification and detection dataset, consisting of 859,000 images from over 5,000 different species of plants and animals. It features visually similar species, captured in a wide variety of situations, from all over the world. Images were collected with different camera types, have varying image quality, feature a large class imbalance, and have been verified by multiple citizen scientists. We discuss the collection of the dataset and present extensive baseline experiments using state-of-the-art computer vision classification and detection models. Results show that current non-ensemble based methods achieve only 67% top one classification accuracy, illustrating the difficulty of the dataset. Specifically, we observe poor results for classes with small numbers of training examples suggesting more attention is needed in low-shot learning.

Keywords

Computer scienceArtificial intelligenceFeature (linguistics)Variety (cybernetics)Pattern recognition (psychology)Contextual image classificationObject (grammar)Object detectionClass (philosophy)Contrast (vision)Shot (pellet)Feature extractionImage (mathematics)Training setMachine learningComputer vision

Affiliated Institutions

Related Publications

Publication Info

Year
2018
Type
article
Citations
1291
Access
Closed

External Links

Social Impact

Social media, news, blog, policy document mentions

Citation Metrics

1291
OpenAlex

Cite This

Grant Van Horn, Oisin Mac Aodha, Yang Song et al. (2018). The iNaturalist Species Classification and Detection Dataset. . https://doi.org/10.1109/cvpr.2018.00914

Identifiers

DOI
10.1109/cvpr.2018.00914