Abstract
The Gram matrix plays a central role in many kernel methods. Knowledge about the distribution of eigenvalues of the Gram matrix is useful for developing appropriate model selection methods for kernel PCA. We use methods adapted from the statistical physics of classical fluids in order to study the averaged spectrum of the Gram matrix. We focus in particular on a variational mean-field theory and related diagrammatic approach. We show that the mean-field theory correctly reproduces previously obtained asymptotic results for standard PCA. Comparison with simulations for data distributed uniformly on the sphere shows that the method provides a good qualitative approximation to the averaged spectrum for kernel PCA with a Gaussian Radial Basis Function kernel. We also develop an analytical approximation to the spectral density that agrees closely with the numerical solution and provides insight into the number of samples required to resolve the corresponding process eigenvalues of a given order.
Original language | English |
---|---|
Title of host publication | Lecture Notes in Artificial Intelligence (Subseries of Lecture Notes in Computer Science)|Lect Notes Artif Intell |
Editors | J. Shawe-Taylor, Y. Singer |
Publisher | Springer Nature |
Pages | 579-593 |
Number of pages | 14 |
Volume | 3120 |
Publication status | Published - 2004 |
Event | 17th Annual Conference on Learning Theory, COLT 2004 - Banff Duration: 1 Jul 2004 → … http://dblp.uni-trier.de/db/conf/colt/colt2004.html#HoyleR04http://dblp.uni-trier.de/rec/bibtex/conf/colt/HoyleR04.xmlhttp://dblp.uni-trier.de/rec/bibtex/conf/colt/HoyleR04 |
Publication series
Name | Lecture Notes in Computer Science |
---|
Conference
Conference | 17th Annual Conference on Learning Theory, COLT 2004 |
---|---|
City | Banff |
Period | 1/07/04 → … |
Internet address |