Abstract

Abstract Motivation: Many multiple sequence alignment tools have been developed in the past, progressing either in speed or alignment accuracy. Given the importance and wide-spread use of alignment tools, progress in both categories is a contribution to the community and has driven research in the field so far. Results: We introduce a graph-based extension to the consistency-based, progressive alignment strategy. We apply the consistency notion to segments instead of single characters. The main problem we solve in this context is to define segments of the sequences in such a way that a graph-based alignment is possible. We implemented the algorithm using the SeqAn library and report results on amino acid and DNA sequences. The benefit of our approach is threefold: (1) sequences with conserved blocks can be rapidly aligned, (2) the implementation is conceptually easy, generic and fast and (3) the consistency idea can be extended to align multiple genomic sequences. Availability: The segment-based multiple sequence alignment tool can be downloaded from http://www.seqan.de/projects/msa.html. A novel version of T-Coffee interfaced with the tool is available from http://www.tcoffee.org. The usage of the tool is described in both documentations. Contact: rausch@inf.fu-berlin.de

Keywords

Multiple sequence alignmentComputer scienceConsistency (knowledge bases)Alignment-free sequence analysisSequence alignmentContext (archaeology)Sequence (biology)Extension (predicate logic)GraphField (mathematics)Data miningTheoretical computer scienceArtificial intelligenceProgramming languageBiologyPeptide sequenceMathematicsGeneticsGene

Related Publications

Publication Info

Year
2008
Type
article
Volume
24
Issue
16
Pages
i187-i192
Citations
57
Access
Closed

External Links

Social Impact

Social media, news, blog, policy document mentions

Citation Metrics

57
OpenAlex

Cite This

Tobias Rausch, Anne‐Katrin Emde, David Weese et al. (2008). Segment-based multiple sequence alignment. Bioinformatics , 24 (16) , i187-i192. https://doi.org/10.1093/bioinformatics/btn281

Identifiers

DOI
10.1093/bioinformatics/btn281