Abstract

To compare entire genomes from different species, biologists increasingly need alignment methods that are efficient enough to handle long sequences, and accurate enough to correctly align the conserved biological features between distant species. We present LAGAN, a system for rapid global alignment of two homologous genomic sequences, and Multi-LAGAN, a system for multiple global alignment of genomic sequences. We tested our systems on a data set consisting of greater than 12 Mb of high-quality sequence from 12 vertebrate species. All the sequence was derived from the genomic region orthologous to an ∼1.5-Mb region on human chromosome 7q31.3. We found that both LAGAN and Multi-LAGAN compare favorably with other leading alignment methods in correctly aligning protein-coding exons, especially between distant homologs such as human and chicken, or human and fugu. Multi-LAGAN produced the most accurate alignments, while requiring just 75 minutes on a personal computer to obtain the multiple alignment of all 12 sequences. Multi-LAGAN is a practical method for generating multiple alignments of long genomic sequences at any evolutionary distance. Our systems are publicly available at http://lagan.stanford.edu .

Keywords

BiologyMultiple sequence alignmentComputational biologyAlignment-free sequence analysisGenomeSequence alignmentGeneticsHuman genomegenomic DNACoding regionSequence (biology)DNAGenePeptide sequence

Affiliated Institutions

Related Publications

Publication Info

Year
2003
Type
article
Volume
13
Issue
4
Pages
721-731
Citations
1139
Access
Closed

External Links

Social Impact

Social media, news, blog, policy document mentions

Citation Metrics

1139
OpenAlex

Cite This

Michael Brudno, Joshua Shulman, Gregory M. Cooper et al. (2003). LAGAN and Multi-LAGAN: Efficient Tools for Large-Scale Multiple Alignment of Genomic DNA. Genome Research , 13 (4) , 721-731. https://doi.org/10.1101/gr.926603

Identifiers

DOI
10.1101/gr.926603