InterPro: the integrative protein signature database

2008 Nucleic Acids Research 2,076 citations

Abstract

The InterPro database (http://www.ebi.ac.uk/interpro/) integrates together predictive models or 'signatures' representing protein domains, families and functional sites from multiple, diverse source databases: Gene3D, PANTHER, Pfam, PIRSF, PRINTS, ProDom, PROSITE, SMART, SUPERFAMILY and TIGRFAMs. Integration is performed manually and approximately half of the total approximately 58,000 signatures available in the source databases belong to an InterPro entry. Recently, we have started to also display the remaining un-integrated signatures via our web interface. Other developments include the provision of non-signature data, such as structural data, in new XML files on our FTP site, as well as the inclusion of matchless UniProtKB proteins in the existing match XML files. The web interface has been extended and now links out to the ADAN predicted protein-protein interaction database and the SPICE and Dasty viewers. The latest public release (v18.0) covers 79.8% of UniProtKB (v14.1) and consists of 16 549 entries. InterPro data may be accessed either via the web address above, via web services, by downloading files by anonymous FTP or by using the InterProScan search software (http://www.ebi.ac.uk/Tools/InterProScan/).

Keywords

UniProtXMLUploadBiologyDatabaseUser interfaceInterface (matter)World Wide WebComputer scienceBioinformaticsGenetics

MeSH Terms

DatabasesProteinProteinsSequence AnalysisProteinSystems Integration

Affiliated Institutions

Related Publications

Publication Info

Year
2008
Type
article
Volume
37
Issue
Database
Pages
D211-D215
Citations
2076
Access
Closed

Social Impact

Social media, news, blog, policy document mentions

Citation Metrics

2076
OpenAlex
185
Influential
1739
CrossRef

Cite This

Sarah Hunter, Rolf Apweiler, Teresa K. Attwood et al. (2008). InterPro: the integrative protein signature database. Nucleic Acids Research , 37 (Database) , D211-D215. https://doi.org/10.1093/nar/gkn785

Identifiers

DOI
10.1093/nar/gkn785
PMID
18940856
PMCID
PMC2686546

Data Quality

Data completeness: 90%