Abstract

We derive new bounds for the generalization error of kernel machines, such as support vector machines and related regularization networks by obtaining new bounds on their covering numbers. The proofs make use of a viewpoint that is apparently novel in the field of statistical learning theory. The hypothesis class is described in terms of a linear operator mapping from a possibly infinite-dimensional unit ball in feature space into a finite-dimensional space. The covering numbers of the class are then determined via the entropy numbers of the operator. These numbers, which characterize the degree of compactness of the operator, can be bounded in terms of the eigenvalues of an integral operator induced by the kernel function used by the machine. As a consequence, we are able to theoretically explain the effect of the choice of kernel function on the generalization performance of support vector machines.

Keywords

MathematicsEntropy (arrow of time)Eigenvalues and eigenvectorsStatistical learning theoryBounded functionSupport vector machinePositive-definite kernelOperator (biology)Discrete mathematicsComputer scienceArtificial intelligenceOperator theoryMathematical analysisFourier integral operator

Affiliated Institutions

Related Publications

Publication Info

Year
2001
Type
article
Volume
47
Issue
6
Pages
2516-2532
Citations
169
Access
Closed

External Links

Social Impact

Social media, news, blog, policy document mentions

Citation Metrics

169
OpenAlex

Cite This

Robert C. Williamson, Alex Smola, Bernhard Schölkopf (2001). Generalization performance of regularization networks and support vector machines via entropy numbers of compact operators. IEEE Transactions on Information Theory , 47 (6) , 2516-2532. https://doi.org/10.1109/18.945262

Identifiers

DOI
10.1109/18.945262