Ten quick tips for avoiding pitfalls in multi-omics data integration analyses

被引:11
|
作者
Chicco, Davide [1 ]
Cumbo, Fabio [2 ]
Angione, Claudio [3 ]
机构
[1] Univ Toronto, Inst Hlth Policy Management & Evaluat, Toronto, ON, Canada
[2] Cleveland Clin, Genom Med Inst, Lerner Res Inst, Cleveland, OH USA
[3] Teesside Univ, Sch Comp Engn & Digital Technol, Middlesbrough, England
关键词
STANDARDS;
D O I
10.1371/journal.pcbi.1011224
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Data are the most important elements of bioinformatics: Computational analysis of bioinformatics data, in fact, can help researchers infer new knowledge about biology, chemistry, biophysics, and sometimes even medicine, influencing treatments and therapies for patients. Bioinformatics and high-throughput biological data coming from different sources can even be more helpful, because each of these different data chunks can provide alternative, complementary information about a specific biological phenomenon, similar to multiple photos of the same subject taken from different angles. In this context, the integration of bioinformatics and high-throughput biological data gets a pivotal role in running a successful bioinformatics study. In the last decades, data originating from proteomics, metabolomics, metagenomics, phenomics, transcriptomics, and epigenomics have been labelled -omics data, as a unique name to refer to them, and the integration of these omics data has gained importance in all biological areas. Even if this omics data integration is useful and relevant, due to its heterogeneity, it is not uncommon to make mistakes during the integration phases. We therefore decided to present these ten quick tips to perform an omics data integration correctly, avoiding common mistakes we experienced or noticed in published studies in the past. Even if we designed our ten guidelines for beginners, by using a simple language that (we hope) can be understood by anyone, we believe our ten recommendations should be taken into account by all the bioinformaticians performing omics data integration, including experts.
引用
收藏
页数:15
相关论文
共 50 条
  • [41] Integration strategies of multi-omics data for machine learning analysis
    Picard, Milan
    Scott-Boyer, Marie -Pier
    Bodein, Antoine
    Perin, Olivier
    Droit, Arnaud
    COMPUTATIONAL AND STRUCTURAL BIOTECHNOLOGY JOURNAL, 2021, 19 : 3735 - 3746
  • [42] Directional integration and pathway enrichment analysis for multi-omics data
    Slobodyanyuk, Mykhaylo
    Bahcheli, Alexander T.
    Klein, Zoe P.
    Bayati, Masroor
    Strug, Lisa J.
    Reimand, Juri
    NATURE COMMUNICATIONS, 2024, 15 (01)
  • [43] Multi-omics data integration for the identification of biomarkers for bull fertility
    Costes, Valentin
    Sellem, Eli
    Marthey, Sylvain
    Hoze, Chris
    Bonnet, Aurelie
    Schibler, Laurent
    Kiefer, Helene
    Jaffrezic, Florence
    PLOS ONE, 2024, 19 (02):
  • [44] Statistical Challenges in Multi-omics Integration
    Sebastiani, Paola
    Leshchk, Anastasia
    Song, Zeyuan
    Karagiannis, Tanya T.
    Gurinovich, Anastasia
    Bae, Harold
    Li, Mengze
    Monti, Stefano
    GENETIC EPIDEMIOLOGY, 2022, 46 (07) : 530 - 530
  • [45] Spatial multi-omics: deciphering technological landscape of integration of multi-omics and its applications
    Liu, Xiaojie
    Peng, Ting
    Xu, Miaochun
    Lin, Shitong
    Hu, Bai
    Chu, Tian
    Liu, Binghan
    Xu, Yashi
    Ding, Wencheng
    Li, Li
    Cao, Canhui
    Wu, Peng
    JOURNAL OF HEMATOLOGY & ONCOLOGY, 2024, 17 (01)
  • [46] Multi-omics Data Integration and Network Inference for Biomarker Discovery in Glioma
    Coletti, Roberta
    Lopes, Marta B.
    PROGRESS IN ARTIFICIAL INTELLIGENCE, EPIA 2023, PT II, 2023, 14116 : 247 - 259
  • [47] MiBiOmics: an interactive web application for multi-omics data exploration and integration
    Johanna Zoppi
    Jean-François Guillaume
    Michel Neunlist
    Samuel Chaffron
    BMC Bioinformatics, 22
  • [48] Machine Learning for multi-omics data integration and variant pathogenicity estimation
    Li, Shuang
    van der Velde, K. Joeri
    Swertz, Morris A.
    2018 IEEE 14TH INTERNATIONAL CONFERENCE ON E-SCIENCE (E-SCIENCE 2018), 2018, : 301 - 301
  • [49] Paired single-cell multi-omics data integration with Mowgli
    Geert-Jan Huizing
    Ina Maria Deutschmann
    Gabriel Peyré
    Laura Cantini
    Nature Communications, 14
  • [50] DeepDRA: Drug repurposing using multi-omics data integration with autoencoders
    Mohammadzadeh-Vardin, Taha
    Ghareyazi, Amin
    Gharizadeh, Ali
    Abbasi, Karim
    Rabiee, Hamid R.
    PLOS ONE, 2024, 19 (07):