Abstract

The spatial sign is a multivariate extension of the concept of sign. Recently multivariate estimators of covariance structures based on spatial signs have been examined by various authors. These new estimators are found to be robust to outlying observations. From a computational point of view, estimators based on spatial sign are very easy to implement as they boil down to a transformation of the data to their spatial signs, from which the classical estimator is then computed. Hence, one can also consider the transformation to spatial signs to be a preprocessing technique, which ensures that the calibration procedure as a whole is robust. In this paper, we examine the special case of spatial sign preprocessing in combination with partial least squares regression as the latter technique is frequently applied in the context of chemical data analysis. In a simulation study, we compare the performance of the spatial sign transformation to nontransformed data as well as to two robust counterparts of partial least squares regression. It turns out that the spatial sign transform is fairly efficient but has some undesirable bias properties. The method is applied to a recently published data set in the field of quantitative structure-activity relationships, where it is seen to perform equally well as the previously described best linear model for these data.

Keywords

Multivariate statisticsRobustness (evolution)EstimatorPreprocessorComputer scienceSimple (philosophy)Sign (mathematics)Artificial intelligenceMultivariate analysisStatisticsData miningPattern recognition (psychology)AlgorithmMathematicsMachine learningChemistry

Affiliated Institutions

Related Publications

Publication Info

Year
2006
Type
article
Volume
46
Issue
3
Pages
1402-1409
Citations
56
Access
Closed

External Links

Social Impact

Social media, news, blog, policy document mentions

Citation Metrics

56
OpenAlex

Cite This

Sven Serneels, Evert De Nolf, Pierre J. Van Espen (2006). Spatial Sign Preprocessing:  A Simple Way To Impart Moderate Robustness to Multivariate Estimators. Journal of Chemical Information and Modeling , 46 (3) , 1402-1409. https://doi.org/10.1021/ci050498u

Identifiers

DOI
10.1021/ci050498u