Multi-omics integration-a comparison of unsupervised clustering methodologies

被引:87
作者
Tini, Giulia [1 ,2 ]
Marchetti, Luca [3 ,4 ]
Priami, Corrado [5 ]
Scott-Boyer, Marie-Pier
机构
[1] Univ Trento, Math, Trento, Italy
[2] COSBI, Trento, Italy
[3] Univ Verona, Verona, Italy
[4] COSBI, Computat Biol Team, Trento, Italy
[5] Univ Trento, Comp Sci, Trento, Italy
关键词
molecular-level interaction; biological systems; unsupervised classification; data preprocessing; JOINT; DISCOVERY; MODULES; BREAST; ONPLS;
D O I
10.1093/bib/bbx167
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
With the recent developments in the field of multi-omics integration, the interest in factors such as data preprocessing, choice of the integration method and the number of different omics considered had increased. In this work, the impact of these factors is explored when solving the problem of sample classification, by comparing the performances of five unsupervised algorithms: Multiple Canonical Correlation Analysis, Multiple Co-Inertia Analysis, Multiple Factor Analysis, Joint and Individual Variation Explained and Similarity Network Fusion. These methods were applied to three real data sets taken from literature and several ad hoc simulated scenarios to discuss classification performance in different conditions of noise and signal strength across the data types. The impact of experimental design, feature selection and parameter training has been also evaluated to unravel important conditions that can affect the accuracy of the result.
引用
收藏
页码:1269 / 1279
页数:11
相关论文
共 50 条
  • [21] SUBATOMIC: a SUbgraph BAsed mulTi-OMIcs clustering framework to analyze integrated multi-edge networks
    Loers, Jens Uwe
    Vermeirssen, Vanessa
    BMC BIOINFORMATICS, 2022, 23 (01)
  • [22] MCNF: A Novel Method for Cancer Subtyping by Integrating Multi-Omics and Clinical Data
    Zhao, Lan
    Yan, Hong
    IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2020, 17 (05) : 1682 - 1690
  • [23] Interactive gene identification for cancer subtyping based on multi-omics clustering
    Ye, Xiucai
    Shi, Tianyi
    Cui, Yaxuan
    Sakurai, Tetsuya
    METHODS, 2023, 211 : 61 - 67
  • [24] Network-based integration of multi-omics data for clinical outcome prediction in neuroblastoma
    Wang, Conghao
    Lue, Wu
    Kaalia, Rama
    Kumar, Parvin
    Rajapakse, Jagath C.
    SCIENTIFIC REPORTS, 2022, 12 (01)
  • [25] MODIMO: Workshop on Multi-Omics Data Integration for Modelling Biological Systems
    Beccuti, Marco
    Bonnici, Vincenzo
    Giugno, Rosalba
    PROCEEDINGS OF THE 30TH ACM INTERNATIONAL CONFERENCE ON INFORMATION & KNOWLEDGE MANAGEMENT, CIKM 2021, 2021, : 4870 - 4871
  • [26] Biological Random Walks: multi-omics integration for disease gene prioritization
    Gentili, Michele
    Martini, Leonardo
    Sponziello, Marialuisa
    Becchetti, Luca
    BIOINFORMATICS, 2022, 38 (17) : 4145 - 4152
  • [27] An extension of latent unknown clustering integrating multi-omics data (LUCID) incorporating incomplete omics data
    Zhao, Yinqi
    Jia, Qiran
    Goodrich, Jesse
    Darst, Burcu
    Conti, David, V
    BIOINFORMATICS ADVANCES, 2024, 4 (01):
  • [28] A latent unknown clustering integrating multi-omics data (LUCID) with phenotypic traits
    Peng, Cheng
    Wang, Jun
    Asante, Isaac
    Louie, Stan
    Jin, Ran
    Chatzi, Lida
    Casey, Graham
    Thomas, Duncan C.
    Conti, David, V
    BIOINFORMATICS, 2020, 36 (03) : 842 - 850
  • [29] Benchmarking multi-omics integrative clustering methods for subtype identification in colorectal cancer
    Zhang, Shuai
    Lv, Jiali
    Zhang, Jinglan
    Fan, Zhe
    Gu, Bingbing
    Fan, Bingbing
    Li, Chunxia
    Wang, Cheng
    Zhang, Tao
    COMPUTER METHODS AND PROGRAMS IN BIOMEDICINE, 2025, 261
  • [30] Generalized Bayesian Factor Analysis for Integrative Clustering with Applications to Multi-Omics Data
    Min, Eun Jeong
    Chang, Changgee
    Long, Qi
    2018 IEEE 5TH INTERNATIONAL CONFERENCE ON DATA SCIENCE AND ADVANCED ANALYTICS (DSAA), 2018, : 109 - 119