Evaluation of integrative clustering methods for the analysis of multi-omics data

被引:55
|
作者
Chauvel, Cecile [1 ]
Novoloaca, Alexei [2 ]
Veyre, Pierre [3 ]
Reynier, Frederic [4 ]
Becker, Jeremie [5 ]
机构
[1] Bioaster, Data Management & Anal Unit, Biostat, Lyon, France
[2] World Hlth Org, Int Agcy Res Canc, Epigenet Grp, Biostat, Lyon, France
[3] Bioaster, Data Management & Anal Unit, Lyon, France
[4] Bioaster, Genom & Transcript, Lyon, France
[5] Bioaster, Genom & Transcript Unit, Biostat, Lyon, France
关键词
benchmark; clustering; data integration; multi-omics; unsupervised analysis; BREAST; JOINT; CLASSIFICATION; EXPRESSION; CRITERIA; MODULES;
D O I
10.1093/bib/bbz015
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Recent advances in sequencing, mass spectrometry and cytometry technologies have enabled researchers to collect large-scale omics data from the same set of biological samples. The joint analysis of multiple omics offers the opportunity to uncover coordinated cellular processes acting across different omic layers. In this work, we present a thorough comparison of a selection of recent integrative clustering approaches, including Bayesian (BCC and MDI) and matrix factorization approaches (iCluster, moCluster, JIVE and iNMF). Based on simulations, the methods were evaluated on their sensitivity and their ability to recover both the correct number of clusters and the simulated clustering at the common and data-specific levels. Standard non-integrative approaches were also included to quantify the added value of integrative methods. For most matrix factorization methods and one Bayesian approach (BCC), the shared and specific structures were successfully recovered with high and moderate accuracy, respectively. An opposite behavior was observed on non-integrative approaches, i.e. high performances on specific structures only. Finally, we applied the methods on the Cancer Genome Atlas breast cancer data set to check whether results based on experimental data were consistent with those obtained in the simulations.
引用
收藏
页码:541 / 552
页数:12
相关论文
共 50 条
  • [41] Integrative Hypergraph Regularization Principal Component Analysis for Sample Clustering and Co-Expression Genes Network Analysis on Multi-Omics Data
    Wu, Ming-Juan
    Gao, Ying-Lian
    Liu, Jin-Xing
    Zheng, Chun-Hou
    Wang, Juan
    IEEE JOURNAL OF BIOMEDICAL AND HEALTH INFORMATICS, 2020, 24 (06) : 1823 - 1834
  • [42] MOPA: An integrative multi-omics pathway analysis method for measuring omics activity
    Jeon, Jaemin
    Han, Eon Yong
    Jung, Inuk
    PLOS ONE, 2023, 18 (03):
  • [43] Multi-Omics Data Fusion for Cancer Molecular Subtyping Using Sparse Canonical Correlation Analysis
    Qi, Lin
    Wang, Wei
    Wu, Tan
    Zhu, Lina
    He, Lingli
    Wang, Xin
    FRONTIERS IN GENETICS, 2021, 12
  • [44] Integrative Analysis of Multi-Omics Identified the Prognostic Biomarkers in Acute Myelogenous Leukemia
    Zheng, Jiafeng
    Zhang, Tongqiang
    Guo, Wei
    Zhou, Caili
    Cui, Xiaojian
    Gao, Long
    Cai, Chunquan
    Xu, Yongsheng
    FRONTIERS IN ONCOLOGY, 2020, 10
  • [45] Integrative Multi-omics Analysis to Characterize Human Brain Ischemia
    Ramiro, Laura
    Garcia-Berrocoso, Teresa
    Brianso, Ferran
    Goicoechea, Leire
    Simats, Alba
    Llombart, Victor
    Gonzalo, Ricardo
    Hainard, Alexandre
    Martinez-Saez, Elena
    Canals, Francesc
    Sanchez, Jean-Charles
    Sanchez-Pla, Alex
    Montaner, Joan
    MOLECULAR NEUROBIOLOGY, 2021, 58 (08) : 4107 - 4121
  • [46] Consistency and overfitting of multi-omics methods on experimental data
    McCabe, Sean D.
    Lin, Dan-Yu
    Love, Michael, I
    BRIEFINGS IN BIOINFORMATICS, 2020, 21 (04) : 1277 - 1284
  • [47] PIntMF: Penalized Integrative Matrix Factorization method for multi-omics data
    Pierre-Jean, Morgane
    Mauger, Florence
    Deleuze, Jean-Francois
    Le Floch, Edith
    BIOINFORMATICS, 2022, 38 (04) : 900 - 907
  • [48] Multi-omics analysis in developmental bone biology
    Matsushita, Yuki
    Noguchi, Azumi
    Ono, Wanida
    Ono, Noriaki
    JAPANESE DENTAL SCIENCE REVIEW, 2023, 59 : 412 - 420
  • [49] Integration of multi-omics data for integrative gene regulatory network inference
    Zarayeneh, Neda
    Ko, Euiseong
    Oh, Jung Hun
    Suh, Sang
    Liu, Chunyu
    Gao, Jean
    Kim, Donghyun
    Kang, Mingon
    INTERNATIONAL JOURNAL OF DATA MINING AND BIOINFORMATICS, 2017, 18 (03) : 223 - 239
  • [50] Integrated Multi-Omics Analyses in Oncology: A Review of Machine Learning Methods and Tools
    Nicora, Giovanna
    Vitali, Francesca
    Dagliati, Arianna
    Geifman, Nophar
    Bellazzi, Riccardo
    FRONTIERS IN ONCOLOGY, 2020, 10