Abstract
Whitening, or sphering, is a common preprocessing step in statistical analysis to transform random variables to orthogonality. However, due to rotational freedom there are infinitely many possible whitening procedures. Consequently, there is a diverse range of sphering methods in use, for example based on principal component analysis (PCA), Cholesky matrix decomposition and zero-phase component analysis (ZCA), among others. Here we provide an overview of the underlying theory and discuss five natural whitening procedures. Subsequently, we demonstrate that investigating the cross-covariance and the cross-correlation matrix between sphered and original variables allows to break the rotational invariance and to identify optimal whitening transformations. As a result we recommend two particular approaches: ZCA-cor whitening to produce sphered variables that are maximally similar to the original variables, and PCA-cor whitening to obtain sphered variables that maximally compress the original variables.
Original language | English |
---|---|
Pages (from-to) | 309-314 |
Number of pages | 6 |
Journal | The American Statistician |
Volume | 72 |
Issue number | 4 |
Early online date | 19 Jan 2017 |
DOIs | |
Publication status | Published - 2 Oct 2018 |
Keywords
- CAR score
- CAT score
- Cholesky decomposition
- Decorrelation
- Principal components analysis
- Whitening
- ZCA-Mahalanobis transformation
Fingerprint
Dive into the research topics of 'Optimal Whitening and Decorrelation'. Together they form a unique fingerprint.Datasets
-
Optimal Whitening and Decorrelation
Kessy, A. (Creator), Lewin, A. (Creator) & Strimmer, K. (Creator), figshare , 19 Jan 2017
DOI: 10.6084/m9.figshare.4568002
Dataset