Data integration through canonical correlation analysis and its application to OMICs research

被引:0
作者
Wrobel, Sonia [1 ]
Turek, Cezary [2 ]
Stepien, Ewa [1 ,3 ,4 ]
Piwowar, Monika [2 ]
机构
[1] Jagiellonian Univ, Marian Smoluchowski Inst Phys, Dept Med Phys, Krakow, Poland
[2] Jagiellonian Univ, Med Coll, Dept Bioinformat & Telemed, Krakow, Poland
[3] Jagiellonian Univ, Ctr Theranost, Ul Kopern 40, PL-31034 Krakow, Poland
[4] Jagiellonian Univ, Total Body Jagiellonian PET Lab, Krakow, Poland
关键词
Biomarkers; Canonical correlation analysis; Bioinformatics; Cancer; Omics; ASSOCIATION;
D O I
10.1016/j.jbi.2023.104575
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
The subject of the paper is a review of multidimensional data analysis methods, which is the canonical analysis with its various variants and its use in omics data research. The dynamic development of high-throughput methods, and with them the availability of large and constantly growing data resources, forces the development of new analytical approaches that allow the review of the analyzed processes, taking into account data from various levels of the organization of living organisms. The multidimensional perspective allows for the assessment of the analyzed phenomenon in a more realistic way, as it generally takes into account much more data (including OMICs data). Without omitting the complexity of an organism, the method simplifies the multidimensional view, finally giving the result so that the researcher can draw practical conclusions. This is particularly important in medical sciences, where the study of pathological processes is usually aimed at developing treatment regimens. One of the primary methods for studying biomedical processes in a multidimensional approach is the canonical correlation analysis (CCA) with various variants. The use of CCA unique methodologies for simultaneous analysis of multiset biomolecular data opens up new avenues for studying previously undiscovered processes and interdependencies such as e.g. in the tumor microenvironment (TME) connected to intercellular communication. Because of the huge and still untapped potential of canonical correlation, in this review available implementations of CCA techniques are presented. In particular, the possibility of using the technique of canonical correlation analysis for OMICs data is emphasized.
引用
收藏
页数:7
相关论文
共 60 条
  • [1] Abraham G., 2016, FlashPCA: fast sparse canonical correlation analysis of genomic data
  • [2] Computational strategies for single-cell multi-omics integration
    Adossa, Nigatu
    Khan, Sofia
    Rytkonen, Kalle T.
    Elo, Laura L.
    [J]. COMPUTATIONAL AND STRUCTURAL BIOTECHNOLOGY JOURNAL, 2021, 19 : 2588 - 2596
  • [3] Conditional canonical correlation estimation based on covariates with random forests
    Alakus, Cansu
    Larocque, Denis
    Jacquemont, Sebastien
    Barlaam, Fanny
    Martin, Charles-Olivier
    Agbogba, Kristian
    Lippe, Sarah
    Labbe, Aurelie
    [J]. BIOINFORMATICS, 2021, 37 (17) : 2714 - 2721
  • [4] A global reference for human genetic variation
    Altshuler, David M.
    Durbin, Richard M.
    Abecasis, Goncalo R.
    Bentley, David R.
    Chakravarti, Aravinda
    Clark, Andrew G.
    Donnelly, Peter
    Eichler, Evan E.
    Flicek, Paul
    Gabriel, Stacey B.
    Gibbs, Richard A.
    Green, Eric D.
    Hurles, Matthew E.
    Knoppers, Bartha M.
    Korbel, Jan O.
    Lander, Eric S.
    Lee, Charles
    Lehrach, Hans
    Mardis, Elaine R.
    Marth, Gabor T.
    McVean, Gil A.
    Nickerson, Deborah A.
    Wang, Jun
    Wilson, Richard K.
    Boerwinkle, Eric
    Doddapaneni, Harsha
    Han, Yi
    Korchina, Viktoriya
    Kovar, Christie
    Lee, Sandra
    Muzny, Donna
    Reid, Jeffrey G.
    Zhu, Yiming
    Chang, Yuqi
    Feng, Qiang
    Fang, Xiaodong
    Guo, Xiaosen
    Jian, Min
    Jiang, Hui
    Jin, Xin
    Lan, Tianming
    Li, Guoqing
    Li, Jingxiang
    Li, Yingrui
    Liu, Shengmao
    Liu, Xiao
    Lu, Yao
    Ma, Xuedi
    Tang, Meifang
    Wang, Bo
    [J]. NATURE, 2015, 526 (7571) : 68 - +
  • [5] Andrew G., 2013, Proc. Mach. Learni. Res., V28
  • [6] Apweiler R, 2004, NUCLEIC ACIDS RES, V32, pD115, DOI [10.1093/nar/gkh131, 10.1093/nar/gkw1099]
  • [7] Benton A, 2019, 4TH WORKSHOP ON REPRESENTATION LEARNING FOR NLP (REPL4NLP-2019), P1
  • [8] Pyrcca: Regularized Kernel Canonical Correlation Analysis in Python']Python and Its Application to Neuroimaging
    Bilenko, Natalia Y.
    Gallant, Jack L.
    [J]. FRONTIERS IN NEUROINFORMATICS, 2016, 10
  • [9] Sparse canonical correlation analysis applied to -omics studies for integrative analysis and biomarker discovery
    Cao, Dong-Sheng
    Liu, Shao
    Zeng, Wen-Bin
    Liang, Yi-Zeng
    [J]. JOURNAL OF CHEMOMETRICS, 2015, 29 (06) : 371 - 378
  • [10] CC-PROMISE effectively integrates two forms of molecular data with multiple biologically related endpoints
    Cao, Xueyuan
    Crews, Kristine R.
    Downing, James
    Lamba, Jatinder
    Pounds, Stanley B.
    [J]. BMC BIOINFORMATICS, 2016, 17