Bayesian Canonical Correlation Analysis

被引:0
作者
Klami, Arto [1 ]
Virtanen, Seppo [1 ]
Kaski, Samuel [1 ,2 ]
机构
[1] Aalto Univ, Dept Informat & Comp Sci, Helsinki Inst Informat Technol HIIT, Aalto 00076, Finland
[2] Univ Helsinki, Dept Comp Sci, Helsinki Inst Informat Technol HIIT, FIN-00014 Helsinki, Finland
基金
芬兰科学院;
关键词
Bayesian modeling; canonical correlation analysis; group-wise sparsity; inter-battery factor analysis; variational Bayesian approximation; DEPENDENCIES; MODELS;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Canonical correlation analysis (CCA) is a classical method for seeking correlations between two multivariate data sets. During the last ten years, it has received more and more attention in the machine learning community in the form of novel computational formulations and a plethora of applications. We review recent developments in Bayesian models and inference methods for CCA which are attractive for their potential in hierarchical extensions and for coping with the combination of large dimensionalities and small sample sizes. The existing methods have not been particularly successful in fulfilling the promise yet; we introduce a novel efficient solution that imposes group-wise sparsity to estimate the posterior of an extended model which not only extracts the statistical dependencies (correlations) between data sets but also decomposes the data into shared and data set-specific components. In statistics literature the model is known as inter-battery factor analysis (IBFA), for which we now provide a Bayesian treatment.
引用
收藏
页码:965 / 1003
页数:39
相关论文
共 67 条
[21]  
Guan Y., 2009, J MACHINE LEARNING R, P185
[22]  
Haghighi Aria, 2008, P ANN M ASS COMP LIN, P771
[23]   Canonical correlation analysis: An overview with application to learning methods [J].
Hardoon, DR ;
Szedmak, S ;
Shawe-Taylor, J .
NEURAL COMPUTATION, 2004, 16 (12) :2639-2664
[24]   Relations between two sets of variates [J].
Hotelling, H .
BIOMETRIKA, 1936, 28 :321-377
[25]   Nonlinear canonical correlation analysis by neural networks [J].
Hsieh, WW .
NEURAL NETWORKS, 2000, 13 (10) :1095-1105
[26]   Multivariate multi-way analysis of multi-source data [J].
Huopaniemi, Ilkka ;
Suvitaival, Tommi ;
Nikkila, Janne ;
Oresic, Matej ;
Kaski, Samuel .
BIOINFORMATICS, 2010, 26 (12) :i391-i398
[27]   Two-way analysis of high-dimensional collinear data [J].
Huopaniemi, Ilkka ;
Suvitaival, Tommi ;
Nikkila, Janne ;
Oresic, Matej ;
Kaski, Samuel .
DATA MINING AND KNOWLEDGE DISCOVERY, 2009, 19 (02) :261-276
[28]  
Ilin A, 2010, J MACH LEARN RES, V11, P1957
[29]  
Ji S., 2008, P 14 ACM SIGKDD INT, P381, DOI [DOI 10.1145/1401890.1401939, 10.1145/1401890.1401939]
[30]  
Jia Y., 2010, Adv. Neural Inf. Process. Syst., V23, P982