Abstract

Abstract A common problem in the statistical analysis of clinical studies is the selection of those variables in the framework of a regression model which might influence the outcome variable. Stepwise methods have been available for a long time, but as with many other possible strategies, there is a lot of criticism of their use. Investigations of the stability of a selected model are often called for, but usually are not carried out in a systematic way. Since analytical approaches are extremely difficult, data‐dependent methods might be an useful alternative. Based on a bootstrap resampling procedure, Chen and George investigated the stability of a stepwise selection procedure in the framework of the Cox proportional hazard regression model. We extend their proposal and develop a bootstrap‐model selection procedure, combining the bootstrap method with existing selection techniques such as stepwise methods. We illustrate the proposed strategy in the process of model building by using data from two cancer clinical trials featuring two different situations commonly arising in clinical research. In a brain tumour study the adjustment for covariates in an overall treatment comparison is of primary interest calling for the selection of even ‘mild’ effects. In a prostate cancer study we concentrate on the analysis of treatment‐covariate interactions demanding that only ‘strong’ effects should be selected. Both variants of the strategy will be demonstrated analysing the clinical trials with a Cox model, but they can be applied in other types of regression with obvious and straightforward modifications.

Keywords

CovariateResamplingComputer scienceProportional hazards modelRegressionModel selectionSelection (genetic algorithm)Regression analysisFeature selectionStability (learning theory)Stepwise regressionEconometricsStatisticsData miningMachine learningArtificial intelligenceMathematics

Affiliated Institutions

Related Publications

Bootstrap Methods: Another Look at the Jackknife

We discuss the following problem: given a random sample $\\mathbf{X} = (X_1, X_2, \\cdots, X_n)$ from an unknown probability distribution $F$, estimate the sampling distribution...

1979 The Annals of Statistics 16966 citations

Publication Info

Year
1992
Type
article
Volume
11
Issue
16
Pages
2093-2109
Citations
581
Access
Closed

External Links

Social Impact

Social media, news, blog, policy document mentions

Citation Metrics

581
OpenAlex

Cite This

Willi Sauerbrei, Martin Schumacher (1992). A bootstrap resampling procedure for model building: Application to the cox regression model. Statistics in Medicine , 11 (16) , 2093-2109. https://doi.org/10.1002/sim.4780111607

Identifiers

DOI
10.1002/sim.4780111607