A Tutorial on Canonical Correlation Methods

被引:72
作者
Uurtio, Viivi [1 ]
Monteiro, Joao M. [2 ,3 ]
Kandola, Jaz [4 ]
Shawe-Taylor, John [2 ]
Fernandez-Reyes, Delmiro [2 ]
Rousu, Juho [1 ]
机构
[1] Aalto Univ, Helsinki Inst Informat Technol HIIT, Dept Comp Sci, Konemiehentie 2, Espoo 02150, Finland
[2] UCL, Dept Comp Sci, Gower St, London WC1E 6BT, England
[3] UCL, Max Planck Ctr Computat Psychiat Ageing Res, Gower St, London WC1E 6BT, England
[4] Imperial Coll London, Div Brain Sci, DuCane Rd, London WC12 0NN, England
基金
芬兰科学院; 英国工程与自然科学研究理事会;
关键词
Canonical correlation; regularisation; kernel methods; sparsity; GENOME-WIDE ASSOCIATION; DIMENSIONALITY REDUCTION; 2; SETS; STATISTICS; REGRESSION; SUBSPACES; FRAMEWORK;
D O I
10.1145/3136624
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Canonical correlation analysis is a family of multivariate statistical methods for the analysis of paired sets of variables. Since its proposition, canonical correlation analysis has, for instance, been extended to extract relations between two sets of variables when the sample size is insufficient in relation to the data dimensionality, when the relations have been considered to be non-linear, and when the dimensionality is too large for human interpretation. This tutorial explains the theory of canonical correlation analysis, including its regularised, kernel, and sparse variants. Additionally, the deep and Bayesian CCA extensions are briefly reviewed. Together with the numerical examples, this overview provides a coherent compendium on the applicability of the variants of canonical correlation analysis. By bringing together techniques for solving the optimisation problems, evaluating the statistical significance and generalisability of the canonical correlation model, and interpreting the relations, we hope that this article can serve as a hands-on tool for applying canonical correlation methods in data analysis.
引用
收藏
页数:33
相关论文
共 119 条
[1]  
Akaho Shotaro, 2001, P INT M PSYCH SOC IM
[2]  
Alam Ashad, 2008, 2008 11th International Conference on Computer and Information Technology (ICCIT), P399, DOI 10.1109/ICCITECHN.2008.4802966
[3]  
Anderson T., 2003, An introduction to statistical multivariate analysis
[4]  
Andrienko G., 2013, Introduction, P1
[5]  
[Anonymous], 2013, P 30 INT C MACHINE L
[6]  
[Anonymous], 2007, P 24 INT C MACHINE L, DOI DOI 10.1145/1273496.1273550
[7]  
[Anonymous], BMC P
[8]  
[Anonymous], 2005, PROBABILISTIC INTERP
[9]  
[Anonymous], 2010, 27 INT C MACH LEARN
[10]  
[Anonymous], 2004, KERNEL METHODS PATTE