Abstract

Abstract The National Center for Biotechnology Information (NCBI) Taxonomy includes organism names and classifications for every sequence in the nucleotide and protein sequence databases of the International Nucleotide Sequence Database Collaboration. Since the last review of this resource in 2012, it has undergone several improvements. Most notable is the shift from a single SQL database to a series of linked databases tied to a framework of data called NameBank. This means that relations among data elements can be adjusted in more detail, resulting in expanded annotation of synonyms, the ability to flag names with specific nomenclatural properties, enhanced tracking of publications tied to names and improved annotation of scientific authorities and types. Additionally, practices utilized by NCBI Taxonomy curators specific to major taxonomic groups are described, terms peculiar to NCBI Taxonomy are explained, external resources are acknowledged and updates to tools and other resources are documented. Database URL: https://www.ncbi.nlm.nih.gov/taxonomy

Keywords

Taxonomy (biology)Data curationComputer scienceInformation retrievalWorld Wide WebData scienceBiologyEcology

MeSH Terms

AnimalsBacteriaClassificationDatabase Management SystemsDatabasesGeneticHumansNational Library of Medicine (U.S.)PlantsUnited StatesViruses

Affiliated Institutions

Related Publications

Publication Info

Year
2020
Type
review
Volume
2020
Citations
2072
Access
Closed

Social Impact

Social media, news, blog, policy document mentions

Citation Metrics

2072
OpenAlex
79
Influential
1773
CrossRef

Cite This

Conrad L. Schoch, Stacy Ciufo, Michael Domrachev et al. (2020). NCBI Taxonomy: a comprehensive update on curation, resources and tools. Database , 2020 . https://doi.org/10.1093/database/baaa062

Identifiers

DOI
10.1093/database/baaa062
PMID
32761142
PMCID
PMC7408187

Data Quality

Data completeness: 90%