The InterPro protein families and domains database: 20 years on

2020 Nucleic Acids Research 2,201 citations

Abstract

Abstract The InterPro database (https://www.ebi.ac.uk/interpro/) provides an integrative classification of protein sequences into families, and identifies functionally important domains and conserved sites. InterProScan is the underlying software that allows protein and nucleic acid sequences to be searched against InterPro's signatures. Signatures are predictive models which describe protein families, domains or sites, and are provided by multiple databases. InterPro combines signatures representing equivalent families, domains or sites, and provides additional information such as descriptions, literature references and Gene Ontology (GO) terms, to produce a comprehensive resource for protein classification. Founded in 1999, InterPro has become one of the most widely used resources for protein family annotation. Here, we report the status of InterPro (version 81.0) in its 20th year of operation, and its associated software, including updates to database content, the release of a new website and REST API, and performance improvements in InterProScan.

Keywords

BiologyComputational biologyGeneticsBioinformaticsEvolutionary biologyDatabase

Affiliated Institutions

Related Publications

Publication Info

Year
2020
Type
article
Volume
49
Issue
D1
Pages
D344-D354
Citations
2201
Access
Closed

External Links

Social Impact

Social media, news, blog, policy document mentions

Citation Metrics

2201
OpenAlex

Cite This

Matthias Blum, Hsin-Yu Chang, Sara Chuguransky et al. (2020). The InterPro protein families and domains database: 20 years on. Nucleic Acids Research , 49 (D1) , D344-D354. https://doi.org/10.1093/nar/gkaa977

Identifiers

DOI
10.1093/nar/gkaa977