Adaptive Sparse Multiple Canonical Correlation Analysis With Application to Imaging (Epi)Genomics Study of Schizophrenia

被引:42
|
作者
Hu, Wenxing [1 ]
Lin, Dongdong [2 ,3 ]
Cao, Shaolong [4 ]
Liu, Jingyu [2 ,3 ]
Chen, Jiayu [2 ,3 ]
Calhoun, Vince D. [2 ,3 ]
Wang, Yu-Ping [1 ]
机构
[1] Tulane Univ, Dept Biomed Engn, New Orleans, LA 70118 USA
[2] Univ New Mexico, Mind Res Network, Albuquerque, NM 87131 USA
[3] Univ New Mexico, Dept Elect & Comp Engn, Albuquerque, NM 87131 USA
[4] Univ Texas MD Anderson Canc Ctr, Dept Bioinformat & Computat Biol, Houston, TX 77030 USA
基金
美国国家卫生研究院; 美国国家科学基金会;
关键词
Imaging genomics; genomic data analysis; canonical correlation analysis; multi-omics data integration; data fusion; GENE-EXPRESSION; ASSOCIATION; DECOMPOSITION; JOINT;
D O I
10.1109/TBME.2017.2771483
中图分类号
R318 [生物医学工程];
学科分类号
0831 ;
摘要
Finding correlations across multiple data sets in imaging and (epi)genomics is a common challenge. Sparse multiple canonical correlation analysis (SMCCA) is a multivariate model widely used to extract contributing features from each data while maximizing the cross-modality correlation. The model is achieved by using the combination of pairwise covariances between any two data sets. However, the scales of different pairwise covariances could be quite different and the direct combination of pairwise covariances in SMCCA is unfair. The problem of "unfair combination of pairwise covariances" restricts the power of SMCCA for feature selection. In this paper, we propose a novel formulation of SMCCA, called adaptive SMCCA, to overcome the problem by introducing adaptive weights when combining pairwise covariances. Both simulation and real-data analysis show the out-performance of adaptive SMCCA in terms of feature selection over conventional SMCCA and SMCCA with fixed weights. Large-scale numerical experiments show that adaptive SMCCA converges as fast as conventional SMCCA. When applying it to imaging (epi) genetics study of schizophrenia subjects, we can detect significant (epi) genetic variants and brain regions, which are consistent with other existing reports. In addition, several significant brain-development related pathways, e.g., neural tube development, are detected by our model, demonstrating imaging epigenetic association may be overlooked by conventional SMCCA. All these results demonstrate that adaptive SMCCA are well suited for detecting three-way or multiway correlations and thus can find widespread applications in multiple omics and imaging data integration.
引用
收藏
页码:390 / 399
页数:10
相关论文
共 50 条
  • [1] Integration of Imaging (epi)Genomics Data for the Study of Schizophrenia Using Group Sparse Joint Nonnegative Matrix Factorization
    Wang, Min
    Huang, Ting-Zhu
    Fang, Jian
    Calhoun, Vince D.
    Wang, Yu-Ping
    IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2020, 17 (05) : 1671 - 1681
  • [2] IMAGING GENETICS VIA SPARSE CANONICAL CORRELATION ANALYSIS
    Chi, Eric C.
    Allen, Genevera I.
    Zhou, Hua
    Kohannim, Omid
    Lange, Kenneth
    Thompson, Paul M.
    2013 IEEE 10TH INTERNATIONAL SYMPOSIUM ON BIOMEDICAL IMAGING (ISBI), 2013, : 740 - 743
  • [3] Robust sparse canonical correlation analysis
    Wilms, Ines
    Croux, Christophe
    BMC SYSTEMS BIOLOGY, 2016, 10
  • [4] Sparse canonical correlation analysis
    David R. Hardoon
    John Shawe-Taylor
    Machine Learning, 2011, 83 : 331 - 353
  • [5] Sparse canonical correlation analysis
    Hardoon, David R.
    Shawe-Taylor, John
    MACHINE LEARNING, 2011, 83 (03) : 331 - 353
  • [6] Sparse Canonical Correlation Analysis with Application to Genomic Data Integration
    Parkhomenko, Elena
    Tritchler, David
    Beyene, Joseph
    STATISTICAL APPLICATIONS IN GENETICS AND MOLECULAR BIOLOGY, 2009, 8 (01)
  • [7] The group sparse canonical correlation analysis method in the imaging genetics research
    Wu, Jie
    Xu, Jiawei
    Chen, Wei
    Sun, Deyan
    2020 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE, 2020, : 2554 - 2557
  • [8] Sparse additive discriminant canonical correlation analysis for multiple features fusion
    Wang, Zhan
    Wang, Lizhi
    Huang, Hua
    NEUROCOMPUTING, 2021, 463 : 185 - 197
  • [9] Simultaneous Analysis of Multiple Data Types in Pharmacogenomic Studies Using Weighted Sparse Canonical Correlation Analysis
    Chalise, Prabhakar
    Batzler, Anthony
    Abo, Ryan
    Wang, Liewei
    Fridley, Brooke L.
    OMICS-A JOURNAL OF INTEGRATIVE BIOLOGY, 2012, 16 (7-8) : 363 - 373
  • [10] Sparse canonical correlation analysis from a predictive point of view
    Wilms, Ines
    Croux, Christophe
    BIOMETRICAL JOURNAL, 2015, 57 (05) : 834 - 851