Evaluation of integrative clustering methods for the analysis of multi-omics data

被引:55
|
作者
Chauvel, Cecile [1 ]
Novoloaca, Alexei [2 ]
Veyre, Pierre [3 ]
Reynier, Frederic [4 ]
Becker, Jeremie [5 ]
机构
[1] Bioaster, Data Management & Anal Unit, Biostat, Lyon, France
[2] World Hlth Org, Int Agcy Res Canc, Epigenet Grp, Biostat, Lyon, France
[3] Bioaster, Data Management & Anal Unit, Lyon, France
[4] Bioaster, Genom & Transcript, Lyon, France
[5] Bioaster, Genom & Transcript Unit, Biostat, Lyon, France
关键词
benchmark; clustering; data integration; multi-omics; unsupervised analysis; BREAST; JOINT; CLASSIFICATION; EXPRESSION; CRITERIA; MODULES;
D O I
10.1093/bib/bbz015
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Recent advances in sequencing, mass spectrometry and cytometry technologies have enabled researchers to collect large-scale omics data from the same set of biological samples. The joint analysis of multiple omics offers the opportunity to uncover coordinated cellular processes acting across different omic layers. In this work, we present a thorough comparison of a selection of recent integrative clustering approaches, including Bayesian (BCC and MDI) and matrix factorization approaches (iCluster, moCluster, JIVE and iNMF). Based on simulations, the methods were evaluated on their sensitivity and their ability to recover both the correct number of clusters and the simulated clustering at the common and data-specific levels. Standard non-integrative approaches were also included to quantify the added value of integrative methods. For most matrix factorization methods and one Bayesian approach (BCC), the shared and specific structures were successfully recovered with high and moderate accuracy, respectively. An opposite behavior was observed on non-integrative approaches, i.e. high performances on specific structures only. Finally, we applied the methods on the Cancer Genome Atlas breast cancer data set to check whether results based on experimental data were consistent with those obtained in the simulations.
引用
收藏
页码:541 / 552
页数:12
相关论文
共 50 条
  • [1] Integrative clustering methods for multi-omics data
    Zhang, Xiaoyu
    Zhou, Zhenwei
    Xu, Hanfei
    Liu, Ching-Ti
    WILEY INTERDISCIPLINARY REVIEWS-COMPUTATIONAL STATISTICS, 2022, 14 (03)
  • [2] Comparative analysis of integrative classification methods for multi-omics data
    Novoloaca, Alexei
    Broc, Camilo
    Beloeil, Laurent
    Yu, Wen-Han
    Becker, Jeremie
    BRIEFINGS IN BIOINFORMATICS, 2024, 25 (04)
  • [3] Benchmarking multi-omics integrative clustering methods for subtype identification in colorectal cancer
    Zhang, Shuai
    Lv, Jiali
    Zhang, Jinglan
    Fan, Zhe
    Gu, Bingbing
    Fan, Bingbing
    Li, Chunxia
    Wang, Cheng
    Zhang, Tao
    COMPUTER METHODS AND PROGRAMS IN BIOMEDICINE, 2025, 261
  • [4] Knowledge-guided learning methods for integrative analysis of multi-omics data
    Li, Wenrui
    Ballard, Jenna
    Zhao, Yize
    Long, Qi
    COMPUTATIONAL AND STRUCTURAL BIOTECHNOLOGY JOURNAL, 2024, 23 : 1945 - 1950
  • [5] Representation Learning for the Clustering of Multi-Omics Data
    Viaud, Gautier
    Mayilvahanan, Prasanna
    Cournede, Paul-Henry
    IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2022, 19 (01) : 135 - 145
  • [6] Clustering and variable selection evaluation of 13 unsupervised methods for multi-omics data integration
    Pierre-Jean, Morgane
    Deleuze, Jean-Francois
    Le Floch, Edith
    Mauger, Florence
    BRIEFINGS IN BIOINFORMATICS, 2020, 21 (06) : 2011 - 2030
  • [7] Sliced inverse regression for integrative multi-omics data analysis
    Jain, Yashita
    Ding, Shanshan
    Qiu, Jing
    STATISTICAL APPLICATIONS IN GENETICS AND MOLECULAR BIOLOGY, 2019, 18 (01)
  • [8] Evaluation and comparison of multi-omics data integration methods for cancer subtyping
    Duan, Ran
    Gao, Lin
    Gao, Yong
    Hu, Yuxuan
    Xu, Han
    Huang, Mingfeng
    Song, Kuo
    Wang, Hongda
    Dong, Yongqiang
    Jiang, Chaoqun
    Zhang, Chenxing
    Jia, Songwei
    PLOS COMPUTATIONAL BIOLOGY, 2021, 17 (08)
  • [9] A pan-cancer integrative pathway analysis of multi-omics data
    Linder, Henry
    Zhang, Yuping
    QUANTITATIVE BIOLOGY, 2020, 8 (02) : 130 - 142
  • [10] Methods for the integration of multi-omics data: mathematical aspects
    Bersanelli, Matteo
    Mosca, Ettore
    Remondini, Daniel
    Giampieri, Enrico
    Sala, Claudia
    Castellani, Gastone
    Milanesi, Luciano
    BMC BIOINFORMATICS, 2016, 17