A comparative study of multi-omics integration tools for cancer driver gene identification and tumour subtyping

被引:50
作者
Sathyanarayanan, Anita [1 ,2 ]
Gupta, Rohit [8 ]
Thompson, Erik W. [2 ,3 ]
Nyholt, Dale R. [4 ,5 ]
Bauer, Denis C. [6 ]
Nagaraj, Shivashankar H. [2 ,5 ,7 ]
机构
[1] Queensland Univ Technol, Inst Hlth & Biomed Innovat, Bioinformat, Brisbane, Qld, Australia
[2] Queensland Univ Technol, Sch Biomed Sci, Brisbane, Qld, Australia
[3] Queensland Univ Technol, Inst Hlth & Biomed Innovat, Translat Res Inst, Brisbane, Qld, Australia
[4] Queensland Univ Technol, Fac Hlth, Sch Biomed Sci, Brisbane, Qld, Australia
[5] Queensland Univ Technol, Inst Hlth & Biomed Innovat, Brisbane, Qld, Australia
[6] CSIRO, Brisbane, Qld, Australia
[7] Translat Res Inst, Brisbane, Qld, Australia
[8] Indian Inst Technol Madras, Dept Biotechnol, Chennai, Tamil Nadu, India
关键词
multi-omics data; cancer; multi-staged integration; meta-dimensional integration; tools evaluation; DNA COPY NUMBER; MESSENGER-RNA EXPRESSION; GENOMIC CHARACTERIZATION; R PACKAGE; CENTRAL DOGMA; CLASSIFICATION; REVEALS; METHYLATION; DISCOVERY; NETWORK;
D O I
10.1093/bib/bbz121
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Oncogenesis and cancer can arise as a consequence of a wide range of genomic aberrations including mutations, copy number alterations, expression changes and epigenetic modifications encompassing multiple omics layers. Integrating genomic, transcriptomic, proteomic and epigenomic datasets via multi-omics analysis provides the opportunity to derive a deeper and holistic understanding of the development and progression of cancer. There are two primary approaches to integrating multi-omics data: multi-staged (focused on identifying genes driving cancer) and meta-dimensional (focused on establishing clinically relevant tumour or sample classifications). A number of ready-to-use bioinformatics tools are available to perform both multi-staged and meta-dimensional integration of multi-omics data. In this study, we compared nine different integration tools using real and simulated cancer datasets. The performance of the multi-staged integration tools were assessed at the gene, function and pathway levels, while meta-dimensional integration tools were assessed based on the sample classification performance. Additionally, we discuss the influence of factors such as data representation, sample size, signal and noise on multi-omics data integration. Our results provide current and much needed guidance regarding selection and use of the most appropriate and best performing multi-omics integration tools.
引用
收藏
页码:1920 / 1936
页数:17
相关论文
共 64 条
  • [1] Integrated Genomic Characterization of Pancreatic Ductal Adenocarcinoma
    Aguirre, Andrew J.
    Hruban, Ralph H.
    Raphael, Benjamin J.
    [J]. CANCER CELL, 2017, 32 (02) : 185 - +
  • [2] Genomic Classification of Cutaneous Melanoma
    Akbani, Rehan
    Akdemir, Kadir C.
    Aksoy, B. Arman
    Albert, Monique
    Ally, Adrian
    Amin, Samirkumar B.
    Arachchi, Harindra
    Arora, Arshi
    Auman, J. Todd
    Ayala, Brenda
    Baboud, Julien
    Balasundaram, Miruna
    Balu, Saianand
    Barnabas, Nandita
    Bartlett, John
    Bartlett, Pam
    Bastian, Boris C.
    Baylin, Stephen B.
    Behera, Madhusmita
    Belyaev, Dmitry
    Benz, Christopher
    Bernard, Brady
    Beroukhim, Rameen
    Bir, Natalie
    Black, Aaron D.
    Bodenheimer, Tom
    Boice, Lori
    Boland, Genevieve M.
    Bono, Riccardo
    Bootwalla, Moiz S.
    Bosenberg, Marcus
    Bowen, Jay
    Bowlby, Reanne
    Bristow, Christopher A.
    Brockway-Lunardi, Laura
    Brooks, Denise
    Brzezinski, Jakub
    Bshara, Wiam
    Buda, Elizabeth
    Burns, William R.
    Butterfield, Yaron S. N.
    Button, Michael
    Calderone, Tiffany
    Cappellini, Giancarlo Antonini
    Carter, Candace
    Carter, Scott L.
    Cherney, Lynn
    Cherniack, Andrew D.
    Chevalier, Aaron
    Chin, Lynda
    [J]. CELL, 2015, 161 (07) : 1681 - 1696
  • [3] Comprehensive and Integrative Genomic Characterization of Hepatocellular Carcinoma
    Ally, Adrian
    Balasundaram, Miruna
    Carlsen, Rebecca
    Chuah, Eric
    Clarke, Amanda
    Dhalla, Noreen
    Holt, Robert A.
    Jones, Steven J. M.
    Lee, Darlene
    Ma, Yussanne
    Marra, Marco A.
    Mayo, Michael
    Moore, Richard A.
    Mungall, Andrew J.
    Schein, Jacqueline E.
    Sipahimalani, Payal
    Tam, Angela
    Thiessen, Nina
    Cheung, Dorothy
    Wong, Tina
    Brooks, Denise
    Robertson, A. Gordon
    Bowlby, Reanne
    Mungall, Karen
    Sadeghi, Sara
    Xi, Liu
    Covington, Kyle
    Shinbrot, Eve
    Wheeler, David A.
    Gibbs, Richard A.
    Donehower, Lawrence A.
    Wang, Linghua
    Bowen, Jay
    Gastier-Foster, Julie M.
    Gerken, Mark
    Helsel, Carmen
    Leraas, Kristen M.
    Lichtenberg, Tara M.
    Ramirez, Nilsa C.
    Wise, Lisa
    Zmuda, Erik
    Gabriel, Stacey B.
    Meyerson, Matthew
    Cibulskis, Carrie
    Murray, Bradley A.
    Shih, Juliann
    Beroukhim, Rameen
    Cherniack, Andrew D.
    Schumacher, Steven E.
    Saksena, Gordon
    [J]. CELL, 2017, 169 (07) : 1327 - +
  • [4] Statistics notes - The cost of dichotomising continuous variables
    Altman, DG
    Royston, P
    [J]. BRITISH MEDICAL JOURNAL, 2006, 332 (7549): : 1080 - 1080
  • [5] DNA methylation of distal regulatory sites characterizes dysregulation of cancer genes
    Aran, Dvir
    Sabato, Sivan
    Hellman, Asaf
    [J]. GENOME BIOLOGY, 2013, 14 (03)
  • [6] DNA methylation and gene silencing in cancer
    Baylin S.B.
    [J]. Nature Clinical Practice Oncology, 2005, 2 (Suppl 1): : S4 - S11
  • [7] Methods for the integration of multi-omics data: mathematical aspects
    Bersanelli, Matteo
    Mosca, Ettore
    Remondini, Daniel
    Giampieri, Enrico
    Sala, Claudia
    Castellani, Gastone
    Milanesi, Luciano
    [J]. BMC BIOINFORMATICS, 2016, 17
  • [8] The Somatic Genomic Landscape of Glioblastoma
    Brennan, Cameron W.
    Verhaak, Roel G. W.
    McKenna, Aaron
    Campos, Benito
    Noushmehr, Houtan
    Salama, Sofie R.
    Zheng, Siyuan
    Chakravarty, Debyani
    Sanborn, J. Zachary
    Berman, Samuel H.
    Beroukhim, Rameen
    Bernard, Brady
    Wu, Chang-Jiun
    Genovese, Giannicola
    Shmulevich, Ilya
    Barnholtz-Sloan, Jill
    Zou, Lihua
    Vegesna, Rahulsimham
    Shukla, Sachet A.
    Ciriello, Giovanni
    Yung, W. K.
    Zhang, Wei
    Sougnez, Carrie
    Mikkelsen, Tom
    Aldape, Kenneth
    Bigner, Darell D.
    Van Meir, Erwin G.
    Prados, Michael
    Sloan, Andrew
    Black, Keith L.
    Eschbacher, Jennifer
    Finocchiaro, Gaetano
    Friedman, William
    Andrews, David W.
    Guha, Abhijit
    Iacocca, Mary
    O'Neill, Brian P.
    Foltz, Greg
    Myers, Jerome
    Weisenberger, Daniel J.
    Penny, Robert
    Kucherlapati, Raju
    Perou, Charles M.
    Hayes, D. Neil
    Gibbs, Richard
    Marra, Marco
    Mills, Gordon B.
    Lander, Eric
    Spellman, Paul
    Wilson, Richard
    [J]. CELL, 2013, 155 (02) : 462 - 477
  • [9] Cancer Genome Atlas Research Network, 2018, Nature, V559, pE12, DOI [10.1038/nature13385, 10.1038/s41586-018-0228-6]
  • [10] The cBio Cancer Genomics Portal: An Open Platform for Exploring Multidimensional Cancer Genomics Data
    Cerami, Ethan
    Gao, Jianjiong
    Dogrusoz, Ugur
    Gross, Benjamin E.
    Sumer, Selcuk Onur
    Aksoy, Buelent Arman
    Jacobsen, Anders
    Byrne, Caitlin J.
    Heuer, Michael L.
    Larsson, Erik
    Antipin, Yevgeniy
    Reva, Boris
    Goldberg, Arthur P.
    Sander, Chris
    Schultz, Nikolaus
    [J]. CANCER DISCOVERY, 2012, 2 (05) : 401 - 404