Abstract

Abstract MOTIVATION: To cope with the increasing amount of sequence data, reliable automatic annotation tools are required. The TrEMBL database contains together with SWISS-PROT nearly all publicly available protein sequences, but in contrast to SWISS-PROT only limited functional annotation. To improve this situation, we had to develop a method of automatic annotation that produces highly reliable functional prediction using the language and the syntax of SWISS-PROT. RESULTS: An algorithm was developed and successfully used for the automatic annotation of a testset of unknown proteins. The predicted information included description, function, catalytic activity, cofactors, pathway, subcellular location, quaternary structure, similarity to other protein, active sites, and keywords. The algorithm showed a low coverage (10%), but a high specificity and reliability. AVAILABILITY: The results can be obtained by anonymous ftp from ftp.ebi.ac.uk/pub/databases/sp_tr_nrdb. The source code is available on request from the authors.

Keywords

AnnotationComputer scienceComputational biologyArtificial intelligenceNatural language processingInformation retrievalBiology

Affiliated Institutions

Related Publications

Publication Info

Year
1999
Type
article
Volume
15
Issue
3
Pages
228-233
Citations
90
Access
Closed

External Links

Social Impact

Social media, news, blog, policy document mentions

Citation Metrics

90
OpenAlex

Cite This

Wolfgang Fleischmann, S Möller, A Gateau et al. (1999). A novel method for automatic functional annotation of proteins.. Bioinformatics , 15 (3) , 228-233. https://doi.org/10.1093/bioinformatics/15.3.228

Identifiers

DOI
10.1093/bioinformatics/15.3.228