Abstract

The pls package implements principal component regression (PCR) and partial least squares regression (PLSR) in R (R Development Core Team 2006b), and is freely available from the Comprehensive R Archive Network (CRAN), licensed under the GNU General Public License (GPL). The user interface is modelled after the traditional formula interface, as exemplified by lm. This was done so that people used to R would not have to learn yet another interface, and also because we believe the formula interface is a good way of working interactively with models. It thus has methods for generic functions like predict, update and coef. It also has more specialised functions like scores, loadings and RMSEP, and a exible crossvalidation system. Visual inspection and assessment is important in chemometrics, and the pls package has a number of plot functions for plotting scores, loadings, predictions, coefficients and RMSEP estimates. The package implements PCR and several algorithms for PLSR. The design is modular, so that it should be easy to use the underlying algorithms in other functions. It is our hope that the package will serve well both for interactive data analysis and as a building block for other functions or packages using PLSR or PCR. We will here describe the package and how it is used for data analysis, as well as how it can be used as a part of other packages. Also included is a section about formulas and data frames, for people not used to the R modelling idioms.

Keywords

Partial least squares regressionR packageInterface (matter)Principal component analysisComputer sciencePrincipal component regressionComponent (thermodynamics)Data miningChemometricsRegressionStatisticsArtificial intelligenceMathematicsMachine learningComputational science

Affiliated Institutions

Related Publications

Publication Info

Year
2007
Type
article
Volume
18
Issue
2
Citations
1663
Access
Closed

External Links

Social Impact

Social media, news, blog, policy document mentions

Citation Metrics

1663
OpenAlex

Cite This

Bjørn‐Helge Mevik, Ron Wehrens (2007). The<b>pls</b>Package: Principal Component and Partial Least Squares Regression in<i>R</i>. Journal of Statistical Software , 18 (2) . https://doi.org/10.18637/jss.v018.i02

Identifiers

DOI
10.18637/jss.v018.i02