Abstract

Effective population size is fundamental in population genetics and characterizes genetic diversity. To infer past population dynamics from molecular sequence data, coalescent-based models have been developed for Bayesian nonparametric estimation of effective population size over time. Among the most successful is a Gaussian Markov random field (GMRF) model for a single gene locus. Here, we present a generalization of the GMRF model that allows for the analysis of multilocus sequence data. Using simulated data, we demonstrate the improved performance of our method to recover true population trajectories and the time to the most recent common ancestor (TMRCA). We analyze a multilocus alignment of HIV-1 CRF02_AG gene sequences sampled from Cameroon. Our results are consistent with HIV prevalence data and uncover some aspects of the population history that go undetected in Bayesian parametric estimation. Finally, we recover an older and more reconcilable TMRCA for a classic ancient DNA data set.

Keywords

Coalescent theoryBiologyPopulationMost recent common ancestorBayesian probabilityInferencePopulation geneticsEvolutionary biologyDemographic historyPopulation sizeBayes' theoremEffective population sizeGeneticsArtificial intelligenceComputer scienceGenetic variationGenePhylogenetics

Affiliated Institutions

Related Publications

Publication Info

Year
2012
Type
article
Volume
30
Issue
3
Pages
713-724
Citations
597
Access
Closed

External Links

Social Impact

Social media, news, blog, policy document mentions

Citation Metrics

597
OpenAlex

Cite This

M. S. Gill, Philippe Lemey, Nuno R. Faria et al. (2012). Improving Bayesian Population Dynamics Inference: A Coalescent-Based Model for Multiple Loci. Molecular Biology and Evolution , 30 (3) , 713-724. https://doi.org/10.1093/molbev/mss265

Identifiers

DOI
10.1093/molbev/mss265