Multi-omics integration-a comparison of unsupervised clustering methodologies

被引:87
|
作者
Tini, Giulia [1 ,2 ]
Marchetti, Luca [3 ,4 ]
Priami, Corrado [5 ]
Scott-Boyer, Marie-Pier
机构
[1] Univ Trento, Math, Trento, Italy
[2] COSBI, Trento, Italy
[3] Univ Verona, Verona, Italy
[4] COSBI, Computat Biol Team, Trento, Italy
[5] Univ Trento, Comp Sci, Trento, Italy
关键词
molecular-level interaction; biological systems; unsupervised classification; data preprocessing; JOINT; DISCOVERY; MODULES; BREAST; ONPLS;
D O I
10.1093/bib/bbx167
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
With the recent developments in the field of multi-omics integration, the interest in factors such as data preprocessing, choice of the integration method and the number of different omics considered had increased. In this work, the impact of these factors is explored when solving the problem of sample classification, by comparing the performances of five unsupervised algorithms: Multiple Canonical Correlation Analysis, Multiple Co-Inertia Analysis, Multiple Factor Analysis, Joint and Individual Variation Explained and Similarity Network Fusion. These methods were applied to three real data sets taken from literature and several ad hoc simulated scenarios to discuss classification performance in different conditions of noise and signal strength across the data types. The impact of experimental design, feature selection and parameter training has been also evaluated to unravel important conditions that can affect the accuracy of the result.
引用
收藏
页码:1269 / 1279
页数:11
相关论文
共 50 条
  • [1] Evaluation and comparison of multi-omics data integration methods for cancer subtyping
    Duan, Ran
    Gao, Lin
    Gao, Yong
    Hu, Yuxuan
    Xu, Han
    Huang, Mingfeng
    Song, Kuo
    Wang, Hongda
    Dong, Yongqiang
    Jiang, Chaoqun
    Zhang, Chenxing
    Jia, Songwei
    PLOS COMPUTATIONAL BIOLOGY, 2021, 17 (08)
  • [2] A survey on data integration for multi-omics sample clustering
    Lovino, Marta
    Randazzo, Vincenzo
    Ciravegna, Gabriele
    Barbiero, Pietro
    Ficarra, Elisa
    Cirrincione, Giansalvo
    NEUROCOMPUTING, 2022, 488 : 494 - 508
  • [3] Clustering and variable selection evaluation of 13 unsupervised methods for multi-omics data integration
    Pierre-Jean, Morgane
    Deleuze, Jean-Francois
    Le Floch, Edith
    Mauger, Florence
    BRIEFINGS IN BIOINFORMATICS, 2020, 21 (06) : 2011 - 2030
  • [4] Multi-Omics Factor Analysis-a framework for unsupervised integration of multi-omics data sets
    Argelaguet, Ricard
    Velten, Britta
    Arnol, Damien
    Dietrich, Sascha
    Zenz, Thorsten
    Marioni, John C.
    Buettner, Florian
    Huber, Wolfgang
    Stegle, Oliver
    MOLECULAR SYSTEMS BIOLOGY, 2018, 14 (06)
  • [5] Integrative clustering methods for multi-omics data
    Zhang, Xiaoyu
    Zhou, Zhenwei
    Xu, Hanfei
    Liu, Ching-Ti
    WILEY INTERDISCIPLINARY REVIEWS-COMPUTATIONAL STATISTICS, 2022, 14 (03)
  • [6] Unsupervised Multi-Omics Data Integration Methods: A Comprehensive Review
    Vahabi, Nasim
    Michailidis, George
    FRONTIERS IN GENETICS, 2022, 13
  • [7] Multi-omics Data Integration, Interpretation, and Its Application
    Subramanian, Indhupriya
    Verma, Srikant
    Kumar, Shiva
    Jere, Abhay
    Anamika, Krishanpal
    BIOINFORMATICS AND BIOLOGY INSIGHTS, 2020, 14
  • [8] Machine learning for multi-omics data integration in cancer
    Cai, Zhaoxiang
    Poulos, Rebecca C.
    Liu, Jia
    Zhong, Qing
    ISCIENCE, 2022, 25 (02)
  • [9] Evaluation of integrative clustering methods for the analysis of multi-omics data
    Chauvel, Cecile
    Novoloaca, Alexei
    Veyre, Pierre
    Reynier, Frederic
    Becker, Jeremie
    BRIEFINGS IN BIOINFORMATICS, 2020, 21 (02) : 541 - 552
  • [10] DeFusion: a denoised network regularization framework for multi-omics integration
    Wang, Weiwen
    Zhang, Xiwen
    Dai, Dao-Qing
    BRIEFINGS IN BIOINFORMATICS, 2021, 22 (05)