Abstract
WHEN ESTIMATING REGRESSION MODELS it is very nearly always assumed that the sample is random. The recent literature has begun to deal with the problems which arise when estimating a regression model with samples which may not be random. The most general case in which one only has access to a single nonrandom sample has not been addressed since it is a very imposing problem. The case which has been addressed starts with a random sample but considers the problem of missing values for the dependent variable of a regression. If the determination of which values are to be observed is related to the unobservable error term in the regression, then methods such as ordinary least squares are in general inappropriate. By constructing a joint model which represents both the regression model to be estimated and the process determining when the dependent variable is to be observed, some progress can be made towards taking into account nonrandomness for the observed values of the dependent variable. The actual techniques employed fall into two rough groups, full information maximum likelihood models, and limited information methods which are more easily estimated. In the full information category are two methods. One model combines the probit and the normal regression models, and the other combines the Tobit or limited dependent variable model with the normal regression model. The form of the probit regression model is
Keywords
Related Publications
Partial least squares regression and projection on latent structure regression (PLS Regression)
Abstract Partial least squares (PLS) regression ( a.k.a. projection on latent structures) is a recent technique that combines features from and generalizes principal component a...
A Comparison of Least Squares and Latent Root Regression Estimators
Miilticollinesrity among the columns of regressor variables is known to cause severe distortion of the least squares estimates of the parameters in a multiple linear regression ...
pGenTHREADER and pDomTHREADER: new methods for improved protein fold recognition and superfamily discrimination
Abstract Motivation: Generation of structural models and recognition of homologous relationships for unannotated protein sequences are fundamental problems in bioinformatics. Im...
Bootstrap Methods: Another Look at the Jackknife
We discuss the following problem: given a random sample $\\mathbf{X} = (X_1, X_2, \\cdots, X_n)$ from an unknown probability distribution $F$, estimate the sampling distribution...
Regression methods for high dimensional multicollinear data
To compare their performance on high dimensional data, several regression methods are applied to data sets in which the number of exploratory variables greatly exceeds the sampl...
Publication Info
- Year
- 1980
- Type
- article
- Volume
- 48
- Issue
- 7
- Pages
- 1815-1815
- Citations
- 365
- Access
- Closed
External Links
Social Impact
Social media, news, blog, policy document mentions
Citation Metrics
Cite This
Identifiers
- DOI
- 10.2307/1911938