Many-core algorithms for statistical phylogenetics

Abstract

Abstract Motivation: Statistical phylogenetics is computationally intensive, resulting in considerable attention meted on techniques for parallelization. Codon-based models allow for independent rates of synonymous and replacement substitutions and have the potential to more adequately model the process of protein-coding sequence evolution with a resulting increase in phylogenetic accuracy. Unfortunately, due to the high number of codon states, computational burden has largely thwarted phylogenetic reconstruction under codon models, particularly at the genomic-scale. Here, we describe novel algorithms and methods for evaluating phylogenies under arbitrary molecular evolutionary models on graphics processing units (GPUs), making use of the large number of processing cores to efficiently parallelize calculations even for large state-size models. Results: We implement the approach in an existing Bayesian framework and apply the algorithms to estimating the phylogeny of 62 complete mitochondrial genomes of carnivores under a 60-state codon model. We see a near 90-fold speed increase over an optimized CPU-based computation and a &gt;140-fold increase over the currently available implementation, making this the first practical use of codon models for phylogenetic inference over whole mitochondrial or microorganism genomes. Availability and implementation: Source code provided in BEAGLE: Broad-platform Evolutionary Analysis General Likelihood Evaluator, a cross-platform/processor library for phylogenetic likelihood computation (http://beagle-lib.googlecode.com/). We employ a BEAGLE-implementation using the Bayesian phylogenetics framework BEAST (http://beast.bio.ed.ac.uk/). Contact: msuchard@ucla.edu; a.rambaut@ed.ac.uk

Keywords

Phylogenetic treePhylogeneticsComputer scienceBayesian probabilityCodon usage biasInferenceAlgorithmBayesian inferenceComputational biologyGenomeBiologyArtificial intelligenceGeneticsGene

Affiliated Institutions

Related Publications

BEAGLE: An Application Programming Interface and High-Performance Computing Library for Statistical Phylogenetics

Daniel L. Ayres , Aaron E. Darling , Derrick J. Zwickl +9 more

Phylogenetic inference is fundamental to our understanding of most aspects of the origin and evolution of life, and in recent years, there has been a concentration of interest i...

2011 Systematic Biology 739 citations

Likelihood Ratio Tests for Model Selection and Non-Nested Hypotheses

Quang Vuong

In this paper, we develop a classical approach to model selection. Using the Kullback-Leibler Information Criterion to measure the closeness of a model to the truth, we propose ...

1989 Econometrica 5870 citations

Success of Phylogenetic Methods in the Four-Taxon Case

John P. Huelsenbeck , David M. Hillis

The success of 16 methods of phylogenetic inference was examined using consistency and simulation analysis. Success—the frequency with which a tree-making method correctly ident...

1993 Systematic Biology 753 citations

Terrace Aware Data Structure for Phylogenomic Inference from Supermatrices

Olga Chernomor , Arndt von Haeseler , Bùi Quang Minh

In phylogenomics the analysis of concatenated gene alignments, the so-called supermatrix, is commonly accompanied by the assumption of partition models. Under such models each g...

2016 Systematic Biology 2153 citations

Gaussian Processes for Machine Learning

Carl Edward Rasmussen , Christopher K. I. Williams

We give a basic introduction to Gaussian Process regression models. We focus on understanding the role of the stochastic process and how it is used to define a distribution over...

2005 The MIT Press eBooks 10408 citations

Publication Info

Year: 2009
Type: article
Volume: 25
Issue: 11
Pages: 1370-1376
Citations: 430
Access: Closed

External Links

View on DOI.org

Social Impact

Altmetric

Many-core algorithms for statistical phylogenetics

PlumX Metrics

Social media, news, blog, policy document mentions

Citation Metrics

430

OpenAlex

Cite This

APA Style

                            
                                    Marc A. Suchard, 
                                
                                    Andrew Rambaut
                                
                            (2009). 
                            Many-core algorithms for statistical phylogenetics. 
                            Bioinformatics
                            , 25
                            (11)
                            , 1370-1376.
                            https://doi.org/10.1093/bioinformatics/btp244

Identifiers

DOI: 10.1093/bioinformatics/btp244