Optimal Whitening and Decorrelation

被引:246
作者
Kessy, Agnan [1 ]
Lewin, Alex [2 ]
Strimmer, Korbinian [3 ]
机构
[1] Imperial Coll London, Dept Math, Stat Sect, South Kensington Campus, London, England
[2] Brunel Univ London, Dept Math, Kingstone Lane, Uxbridge, Middx, England
[3] Imperial Coll London, Sch Publ Hlth, Epidemiol & Biostat, Norfolk Pl, London W2 1PG, England
关键词
CAR score; CAT score; Cholesky decomposition; Decorrelation; Principal components analysis; Whitening; ZCA-Mahalanobis transformation;
D O I
10.1080/00031305.2016.1277159
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
Whitening, or sphering, is a common preprocessing step in statistical analysis to transform random variables to orthogonality. However, due to rotational freedom there are infinitely many possible whitening procedures. Consequently, there is a diverse range of sphering methods in use, for example, based on principal component analysis (PCA), Cholesky matrix decomposition, and zero-phase component analysis (ZCA), among others. Here, we provide an overview of the underlying theory and discuss five natural whitening procedures. Subsequently, we demonstrate that investigating the cross-covariance and the cross-correlation matrix between sphered and original variables allows to break the rotational invariance and to identify optimal whitening transformations. As a result we recommend two particular approaches: ZCA-cor whitening to produce sphered variables that are maximally similar to the original variables, and PCA-cor whitening to obtain sphered variables that maximally compress the original variables.
引用
收藏
页码:309 / 314
页数:6
相关论文
共 16 条
[1]   FEATURE SELECTION IN OMICS PREDICTION PROBLEMS USING CAT SCORES AND FALSE NONDISCOVERY RATE CONTROL [J].
Ahdesmaeki, Miika ;
Strimmer, Korbinian .
ANNALS OF APPLIED STATISTICS, 2010, 4 (01) :503-519
[2]  
[Anonymous], 2002, Principal components analysis
[3]   The ''independent components'' of natural scenes are edge filters [J].
Bell, AJ ;
Sejnowski, TJ .
VISION RESEARCH, 1997, 37 (23) :3327-3338
[4]   MMSE whitening and subspace whitening [J].
Eldar, YC ;
Oppenheim, AV .
IEEE TRANSACTIONS ON INFORMATION THEORY, 2003, 49 (07) :1846-1851
[5]   The use of multiple measurements in taxonomic problems [J].
Fisher, RA .
ANNALS OF EUGENICS, 1936, 7 :179-188
[6]   EXPLORATORY PROJECTION PURSUIT [J].
FRIEDMAN, JH .
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 1987, 82 (397) :249-266
[7]   Orthogonalization of vectors with minimal adjustment [J].
Garthwaite, Paul H. ;
Critchley, Frank ;
Anaya-Izquierdo, Karim ;
Mubwandarikwa, Emmanuel .
BIOMETRIKA, 2012, 99 (04) :787-798
[8]  
GENIZI A, 1993, STAT SINICA, V3, P407
[9]   Sparsifying the Fisher linear discriminant by rotation [J].
Hao, Ning ;
Dong, Bin ;
Fan, Jianqing .
JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-STATISTICAL METHODOLOGY, 2015, 77 (04) :827-851
[10]   MINIMAL TRANSFORMATION TO ORTHONORMALITY [J].
JOHNSON, RM .
PSYCHOMETRIKA, 1966, 31 (01) :61-61