Abstract
Abstract Principal curves are smooth one-dimensional curves that pass through the middle of a p-dimensional data set, providing a nonlinear summary of the data. They are nonparametric, and their shape is suggested by the data. The algorithm for constructing principal curves starts with some prior summary, such as the usual principal-component line. The curve in each successive iteration is a smooth or local average of the p-dimensional points, where the definition of local is based on the distance in arc length of the projections of the points onto the curve found in the previous iteration. In this article principal curves are defined, an algorithm for their construction is given, some theoretical results are presented, and the procedure is compared to other generalizations of principal components. Two applications illustrate the use of principal curves. The first describes how the principal-curve procedure was used to align the magnets of the Stanford linear collider. The collider uses about 950 magnets in a roughly circular arrangement to bend electron and positron beams and bring them to collision. After construction, it was found that some of the magnets had ended up significantly out of place. As a result, the beams had to be bent too sharply and could not be focused. The engineers realized that the magnets did not have to be moved to their originally planned locations, but rather to a sufficiently smooth arc through the middle of the existing positions. This arc was found using the principal-curve procedure. In the second application, two different assays for gold content in several samples of computer-chip waste appear to show some systematic differences that are blurred by measurement error. The classical approach using linear errors in variables regression can detect systematic linear differences but is not able to account for nonlinearities. When the first linear principal component is replaced with a principal curve, a local "bump" is revealed, and bootstrapping is used to verify its presence.
Keywords
Affiliated Institutions
Related Publications
The IntCal20 Northern Hemisphere Radiocarbon Age Calibration Curve (0–55 cal kBP)
ABSTRACT Radiocarbon ( 14 C) ages cannot provide absolutely dated chronologies for archaeological or paleoenvironmental studies directly but must be converted to calendar age eq...
Self-organizing distributed sensor networks
Advances in CMOS IC and micro electrical-mechanical systems (MEMS) technologies are enabling construction of low-cost building blocks each of which incorporates sensing, signal ...
The Bayesian Approach to Radiocarbon Calibration Curve Estimation: The IntCal13, Marine13, and SHCal13 Methodologies
This article outlines the Bayesian models and methods used to facilitate construction of the 2013 internationally agreed radiocarbon calibration curves known as IntCal13, Marine...
Discrete Variational Method for the Energy-Band Problem with General Crystal Potentials
A general variational method for efficiently calculating energy bands and charge densities in solids is presented; the method can be viewed as a weighted local-energy procedure ...
Spline Smoothing: The Equivalent Variable Kernel Method
The spline smoothing approach to nonparametric regression and curve estimation is considered. It is shown that, in a certain sense, spline smoothing corresponds approximately to...
Publication Info
- Year
- 1989
- Type
- article
- Volume
- 84
- Issue
- 406
- Pages
- 502-502
- Citations
- 346
- Access
- Closed
External Links
Social Impact
Social media, news, blog, policy document mentions
Citation Metrics
Cite This
Identifiers
- DOI
- 10.2307/2289936