A Least Squares Correction for Selectivity Bias

Abstract

WHEN ESTIMATING REGRESSION MODELS it is very nearly always assumed that the sample is random. The recent literature has begun to deal with the problems which arise when estimating a regression model with samples which may not be random. The most general case in which one only has access to a single nonrandom sample has not been addressed since it is a very imposing problem. The case which has been addressed starts with a random sample but considers the problem of missing values for the dependent variable of a regression. If the determination of which values are to be observed is related to the unobservable error term in the regression, then methods such as ordinary least squares are in general inappropriate. By constructing a joint model which represents both the regression model to be estimated and the process determining when the dependent variable is to be observed, some progress can be made towards taking into account nonrandomness for the observed values of the dependent variable. The actual techniques employed fall into two rough groups, full information maximum likelihood models, and limited information methods which are more easily estimated. In the full information category are two methods. One model combines the probit and the normal regression models, and the other combines the Tobit or limited dependent variable model with the normal regression model. The form of the probit regression model is

Keywords

StatisticsEconometricsLeast-squares function approximationEconomicsSelectivityMathematicsChemistryOrganic chemistry

Related Publications

Partial least squares regression and projection on latent structure regression (PLS Regression)

Hervé Abdi

Abstract Partial least squares (PLS) regression ( a.k.a. projection on latent structures) is a recent technique that combines features from and generalizes principal component a...

2010 Wiley Interdisciplinary Reviews Compu... 1363 citations

A Comparison of Least Squares and Latent Root Regression Estimators

Richard F. Gunst , J. T. Webster , Robert Mason

Miilticollinesrity among the columns of regressor variables is known to cause severe distortion of the least squares estimates of the parameters in a multiple linear regression ...

1976 Technometrics 83 citations

pGenTHREADER and pDomTHREADER: new methods for improved protein fold recognition and superfamily discrimination

Anna Lobley , Michael I. Sadowski , David T. Jones

Abstract Motivation: Generation of structural models and recognition of homologous relationships for unannotated protein sequences are fundamental problems in bioinformatics. Im...

2009 Bioinformatics 302 citations

Bootstrap Methods: Another Look at the Jackknife

B. Efron

We discuss the following problem: given a random sample $\\mathbf{X} = (X_1, X_2, \\cdots, X_n)$ from an unknown probability distribution $F$, estimate the sampling distribution...

1979 The Annals of Statistics 16966 citations

Regression methods for high dimensional multicollinear data

Lorna Aucott , Paul H. Garthwaite , James Currall

To compare their performance on high dimensional data, several regression methods are applied to data sets in which the number of exploratory variables greatly exceeds the sampl...

2000 Communications in Statistics - Simula... 13 citations

Publication Info

Year: 1980
Type: article
Volume: 48
Issue: 7
Pages: 1815-1815
Citations: 365
Access: Closed

External Links

View on DOI.org

Social Impact

Altmetric

A Least Squares Correction for Selectivity Bias

PlumX Metrics

Social media, news, blog, policy document mentions

Citation Metrics

365

OpenAlex

Cite This

APA Style

                            
                                    Randall J. Olsen
                                
                            (1980). 
                            A Least Squares Correction for Selectivity Bias. 
                            Econometrica
                            , 48
                            (7)
                            , 1815-1815.
                            https://doi.org/10.2307/1911938

Identifiers

DOI: 10.2307/1911938