The <i>p</i>-Median Problem for Cluster Analysis: A Comparative Test Using the Mixture Model Approach

Ted Klastorin

doi:10.1287/mnsc.31.1.84

Abstract

Recently, Mulvey and Crowder (Mulvey, J., H. Crowder. 1979. Cluster analysis: an application of Lagrangian relaxation. Management Sci. 25 329–340.) suggested that the p-median problem might be useful for cluster analysis problems (where the goal is to group objects described by a vector of characteristics in such a way that objects in the same group are somehow more alike than objects in different groups). The intent of this paper is to test Mulvey and Crowder's proposal using the mixture model approach; i.e., by applying a number of algorithms (including one for the p-median problem) to a set of objects randomly sampled from a number of known multivariate populations and comparing the ability of each algorithm to detect the original populations. In order to evaluate the results, a generalized partition comparison measure and its distribution are developed. Using this measure, results from various algorithms are compared.

Keywords

Partition (number theory)Measure (data warehouse)Cluster (spacecraft)Set (abstract data type)Computer scienceMathematicsAlgorithmCombinatoricsData mining

Affiliated Institutions

University of Washington US

Related Publications

On Some Invariant Criteria for Grouping Data

Herman Friedman , Jerrold Rubin

Abstract This paper deals with methods of "cluster analysis". In particular we attack the problem of exploring the structure of multivariate data in search of "clusters". The ap...

1967 Journal of the American Statistical A... 570 citations

On Some Invariant Criteria for Grouping Data

Herman Friedman , Jerrold Rubin

Abstract This paper deals with methods of "cluster analysis". In particular we attack the problem of exploring the structure of multivariate data in search of "clusters". The ap...

1967 Journal of the American Statistical A... 152 citations

Inference of Population Structure Under a Dirichlet Process Model

John P. Huelsenbeck , Peter Andolfatto

Abstract Inferring population structure from genetic data sampled from some number of individuals is a formidable statistical problem. One widely used approach considers the num...

2007 Genetics 293 citations

A Bayesian approach to the identification of panmictic populations and the assignment of individuals

Kevin J. Dawson , Khalid Belkhir

We present likelihood-based methods for assigning the individuals in a sample to source populations, on the basis of their genotypes at co-dominant marker loci. The source popul...

2001 Genetics Research 229 citations

Combining Mixture Components for Clustering

Jean-Patrick Baudry , Adrian E. Raftery , Gilles Celeux +2 more

Model-based clustering consists of fitting a mixture model to data and identifying each cluster with one of its components. Multivariate normal distributions are typically used....

2010 Journal of Computational and Graphica... 332 citations

Publication Info

Year: 1985
Type: article
Volume: 31
Issue: 1
Pages: 84-95
Citations: 62
Access: Closed

External Links

View on DOI.org

Social Impact

Altmetric

The <i>p</i>-Median Problem for Cluster Analysis: A Comparative Test Using the Mixture Model Approach

PlumX Metrics

Social media, news, blog, policy document mentions

Citation Metrics

OpenAlex

Cite This

APA Style

                            
                                    Ted Klastorin
                                
                            (1985). 
                            The <i>p</i>-Median Problem for Cluster Analysis: A Comparative Test Using the Mixture Model Approach. 
                            Management Science
                            , 31
                            (1)
                            , 84-95.
                            https://doi.org/10.1287/mnsc.31.1.84

Identifiers

DOI: 10.1287/mnsc.31.1.84