Abstract

Abstract We introduce a new Bayesian clustering algorithm for studying population structure using individually geo-referenced multilocus data sets. The algorithm is based on the concept of hidden Markov random field, which models the spatial dependencies at the cluster membership level. We argue that (i) a Markov chain Monte Carlo procedure can implement the algorithm efficiently, (ii) it can detect significant geographical discontinuities in allele frequencies and regulate the number of clusters, (iii) it can check whether the clusters obtained without the use of spatial priors are robust to the hypothesis of discontinuous geographical variation in allele frequencies, and (iv) it can reduce the number of loci required to obtain accurate assignments. We illustrate and discuss the implementation issues with the Scandinavian brown bear and the human CEPH diversity panel data set.

Keywords

Cluster analysisBayesian probabilityMarkov chainMarkov chain Monte CarloPopulationHierarchical clusteringPopulation geneticsComputer scienceBiologyData miningArtificial intelligenceMachine learning

Affiliated Institutions

Related Publications

Publication Info

Year
2006
Type
article
Volume
174
Issue
2
Pages
805-816
Citations
344
Access
Closed

External Links

Social Impact

Social media, news, blog, policy document mentions

Citation Metrics

344
OpenAlex

Cite This

Olivier François, Sophie Ancelet, G. Guillot (2006). Bayesian Clustering Using Hidden Markov Random Fields in Spatial Population Genetics. Genetics , 174 (2) , 805-816. https://doi.org/10.1534/genetics.106.059923

Identifiers

DOI
10.1534/genetics.106.059923