Abstract
Recently, Mulvey and Crowder (Mulvey, J., H. Crowder. 1979. Cluster analysis: an application of Lagrangian relaxation. Management Sci. 25 329–340.) suggested that the p-median problem might be useful for cluster analysis problems (where the goal is to group objects described by a vector of characteristics in such a way that objects in the same group are somehow more alike than objects in different groups). The intent of this paper is to test Mulvey and Crowder's proposal using the mixture model approach; i.e., by applying a number of algorithms (including one for the p-median problem) to a set of objects randomly sampled from a number of known multivariate populations and comparing the ability of each algorithm to detect the original populations. In order to evaluate the results, a generalized partition comparison measure and its distribution are developed. Using this measure, results from various algorithms are compared.
Keywords
Affiliated Institutions
Related Publications
On Some Invariant Criteria for Grouping Data
Abstract This paper deals with methods of "cluster analysis". In particular we attack the problem of exploring the structure of multivariate data in search of "clusters". The ap...
On Some Invariant Criteria for Grouping Data
Abstract This paper deals with methods of "cluster analysis". In particular we attack the problem of exploring the structure of multivariate data in search of "clusters". The ap...
Inference of Population Structure Under a Dirichlet Process Model
Abstract Inferring population structure from genetic data sampled from some number of individuals is a formidable statistical problem. One widely used approach considers the num...
A Bayesian approach to the identification of panmictic populations and the assignment of individuals
We present likelihood-based methods for assigning the individuals in a sample to source populations, on the basis of their genotypes at co-dominant marker loci. The source popul...
Combining Mixture Components for Clustering
Model-based clustering consists of fitting a mixture model to data and identifying each cluster with one of its components. Multivariate normal distributions are typically used....
Publication Info
- Year
- 1985
- Type
- article
- Volume
- 31
- Issue
- 1
- Pages
- 84-95
- Citations
- 62
- Access
- Closed
External Links
Social Impact
Social media, news, blog, policy document mentions
Citation Metrics
Cite This
Identifiers
- DOI
- 10.1287/mnsc.31.1.84