NCBI Reference Sequence (RefSeq): a curated non-redundant sequence database of genomes, transcripts and proteins

2004 Nucleic Acids Research 1,622 citations

Abstract

The National Center for Biotechnology Information (NCBI) Reference Sequence (RefSeq) database (http://www.ncbi.nlm.nih.gov/RefSeq/) provides a non-redundant collection of sequences representing genomic data, transcripts and proteins. Although the goal is to provide a comprehensive dataset representing the complete sequence information for any given species, the database pragmatically includes sequence data that are currently publicly available in the archival databases. The database incorporates data from over 2400 organisms and includes over one million proteins representing significant taxonomic diversity spanning prokaryotes, eukaryotes and viruses. Nucleotide and protein sequences are explicitly linked, and the sequences are linked to other resources including the NCBI Map Viewer and Gene. Sequences are annotated to include coding regions, conserved domains, variation, references, names, database cross-references, and other features using a combined approach of collaboration and other input from the scientific community, automated annotation, propagation from GenBank and curation by NCBI staff.

Keywords

RefSeqGenBankBiologySequence databaseEnsemblAnnotationGenomedbSNPDatabaseSequence (biology)Computational biologyReference genomeGenome projectBioinformaticsGeneticsGenomicsGeneComputer scienceSingle-nucleotide polymorphism

Affiliated Institutions

Related Publications

Publication Info

Year
2004
Type
article
Volume
33
Issue
Database issue
Pages
D501-D504
Citations
1622
Access
Closed

External Links

Social Impact

Social media, news, blog, policy document mentions

Citation Metrics

1622
OpenAlex

Cite This

Kim D. Pruitt (2004). NCBI Reference Sequence (RefSeq): a curated non-redundant sequence database of genomes, transcripts and proteins. Nucleic Acids Research , 33 (Database issue) , D501-D504. https://doi.org/10.1093/nar/gki025

Identifiers

DOI
10.1093/nar/gki025