Abstract

Abstract Molecular sequences obtained at different sampling times from populations of rapidly evolving pathogens and from ancient subfossil and fossil sources are increasingly available with modern sequencing technology. Here, we present a Bayesian statistical inference approach to the joint estimation of mutation rate and population size that incorporates the uncertainty in the genealogy of such temporally spaced sequences by using Markov chain Monte Carlo (MCMC) integration. The Kingman coalescent model is used to describe the time structure of the ancestral tree. We recover information about the unknown true ancestral coalescent tree, population size, and the overall mutation rate from temporally spaced data, that is, from nucleotide sequences gathered at different times, from different individuals, in an evolving haploid population. We briefly discuss the methodological implications and show what can be inferred, in various practically relevant states of prior knowledge. We develop extensions for exponentially growing population size and joint estimation of substitution model parameters. We illustrate some of the important features of this approach on a genealogy of HIV-1 envelope (env) partial sequences.

Keywords

Coalescent theoryMutation rateBiologyMarkov chain Monte CarloPopulationBayesian probabilityInferenceEvolutionary biologyMutationMarkov chainTree (set theory)GeneticsStatisticsPhylogenetic treeArtificial intelligenceComputer scienceMathematicsDemographyCombinatorics

Affiliated Institutions

Related Publications

Publication Info

Year
2002
Type
article
Volume
161
Issue
3
Pages
1307-1320
Citations
1077
Access
Closed

External Links

Social Impact

Social media, news, blog, policy document mentions

Citation Metrics

1077
OpenAlex

Cite This

Alexei J. Drummond, Geoff K. Nicholls, Allen G. Rodrigo et al. (2002). Estimating Mutation Parameters, Population History and Genealogy Simultaneously From Temporally Spaced Sequence Data. Genetics , 161 (3) , 1307-1320. https://doi.org/10.1093/genetics/161.3.1307

Identifiers

DOI
10.1093/genetics/161.3.1307