Variable Selection via Gibbs Sampling

Abstract

Abstract A crucial problem in building a multiple regression model is the selection of predictors to include. The main thrust of this article is to propose and develop a procedure that uses probabilistic considerations for selecting promising subsets. This procedure entails embedding the regression setup in a hierarchical normal mixture model where latent variables are used to identify subset choices. In this framework the promising subsets of predictors can be identified as those with higher posterior probability. The computational burden is then alleviated by using the Gibbs sampler to indirectly sample from this multinomial posterior distribution on the set of possible subset choices. Those subsets with higher probability—the promising ones—can then be identified by their more frequent appearance in the Gibbs sample.

Keywords

Gibbs samplingSelection (genetic algorithm)Multinomial logistic regressionMultinomial distributionStatisticsMathematicsLatent variableProbabilistic logicPosterior probabilityComputer scienceCovariateEconometricsMachine learningBayesian probability

Affiliated Institutions

Related Publications

Estimation of Finite Mixture Distributions Through Bayesian Sampling

Jean Diebolt , Christian P. Robert

SUMMARY A formal Bayesian analysis of a mixture model usually leads to intractable calculations, since the posterior distribution takes into account all the partitions of the sa...

1994 Journal of the Royal Statistical Soci... 904 citations

Bayesian Variable Selection in Linear Regression

Toby J. Mitchell , John J. Beauchamp

Abstract This article is concerned with the selection of subsets of predictor variables in a linear regression model for the prediction of a dependent variable. It is based on a...

1988 Journal of the American Statistical A... 1367 citations

A Split-Merge Markov chain Monte Carlo Procedure for the Dirichlet Process Mixture Model

Sonia Jain , Radford M. Neal

This article proposes a split-merge Markov chain algorithm to address the problem of inefficient sampling for conjugate Dirichlet process mixture models. Traditional Markov chai...

2004 Journal of Computational and Graphica... 464 citations

A multivariate technique for multiply imputing missing values using a sequence of regression models

Trivellore E. Raghunathan , James M. Lepkowski , John Van Hoewyk +1 more

This article describes and evaluates a procedure for imputing missing values for a relatively complex data structure when the data are missing at random. The imputations are obt...

2001 Survey methodology 1994 citations

Gibbs Sampling for Bayesian Non-Conjugate and Hierarchical Models by Using Auxiliary Variables

P. Damlen , Jon Wakefield , Stephen Walker

Summary We demonstrate the use of auxiliary (or latent) variables for sampling non-standard densities which arise in the context of the Bayesian analysis of non-conjugate and hi...

1999 Journal of the Royal Statistical Soci... 356 citations

Publication Info

Year: 1993
Type: article
Volume: 88
Issue: 423
Pages: 881-889
Citations: 2675
Access: Closed

External Links

View on DOI.org

Social Impact

Altmetric

Variable Selection via Gibbs Sampling

PlumX Metrics

Social media, news, blog, policy document mentions

Citation Metrics

2675

OpenAlex

Cite This

APA Style

                            
                                    Edward I. George, 
                                
                                    Robert E. McCulloch
                                
                            (1993). 
                            Variable Selection via Gibbs Sampling. 
                            Journal of the American Statistical Association
                            , 88
                            (423)
                            , 881-889.
                            https://doi.org/10.1080/01621459.1993.10476353

Identifiers

DOI: 10.1080/01621459.1993.10476353