Pan-cancer integrative analysis of whole-genome De novo somatic point mutations reveals 17 cancer types

被引:1
|
作者
Ghareyazi, Amin [1 ]
Kazemi, Amirreza [1 ,2 ]
Hamidieh, Kimia [3 ]
Dashti, Hamed [1 ]
Tahaei, Maedeh Sadat [1 ]
Rabiee, Hamid R. [1 ]
Alinejad-Rokny, Hamid [4 ,5 ,6 ]
Dehzangi, Iman [7 ,8 ]
机构
[1] Sharif Univ Technol, Dept Comp Engn, Bioinformat & Computat Biol Lab, Tehran 11365, Iran
[2] Simon Fraser Univ, Dept Comp Engn, Burnaby, BC 1S6, Canada
[3] Univ Toronto, Dept Comp Sci, Toronto, ON M5S 3H2, Canada
[4] UNSW Sydney, BioMed Machine Learning Lab BML, Grad Sch Biomed Engn, Sydney, NSW 2052, Australia
[5] Univ New South Wales UNSW Sydney, UNSW Data Sci Hub, Sydney, NSW 2052, Australia
[6] Macquarie Univ, AI Enabled Proc AIP Res Ctr, Sydney, NSW 2109, Australia
[7] Rutgers State Univ, Dept Comp Sci, Camden, NJ 08102 USA
[8] Rutgers State Univ, Ctr Computat & Integrat Biol, Camden, NJ 08102 USA
关键词
Pan-cancer; Somatic point mutations; Cancer subtyping; Biomarker discovery; Driver genes; Personalized medicine; Health data analytics; MOLECULAR CLASSIFICATION; GENE; IDENTIFICATION; SUBTYPES; DNA;
D O I
10.1186/s12859-022-04840-6
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Background: The advent of high throughput sequencing has enabled researchers to systematically evaluate the genetic variations in cancer, identifying many cancer-associated genes. Although cancers in the same tissue are widely categorized in the same group, they demonstrate many differences concerning their mutational profiles. Hence, there is no definitive treatment for most cancer types. This reveals the importance of developing new pipelines to identify cancer-associated genes accurately and re-classify patients with similar mutational profiles. Classification of cancer patients with similar mutational profiles may help discover subtypes of cancer patients who might benefit from specific treatment types. Results: In this study, we propose a new machine learning pipeline to identify protein-coding genes mutated in many samples to identify cancer subtypes. We apply our pipeline to 12,270 samples collected from the international cancer genome consortium, covering 19 cancer types. As a result, we identify 17 different cancer subtypes. Comprehensive phenotypic and genotypic analysis indicates distinguishable properties, including unique cancer-related signaling pathways. Conclusions: This new subtyping approach offers a novel opportunity for cancer drug development based on the mutational profile of patients. Additionally, we analyze the mutational signatures for samples in each subtype, which provides important insight into their active molecular mechanisms. Some of the pathways we identified in most subtypes, including the cell cycle and the Axon guidance pathways, are frequently observed in cancer disease. Interestingly, we also identified several mutated genes and different rates of mutation in multiple cancer subtypes. In addition, our study on "gene-motif" suggests the importance of considering both the context of the mutations and mutational processes in identifying cancer-associated genes. The source codes for our proposed clustering pipeline and analysis are publicly available at: https://github.com/bcb-sut/Pan-Cancer.
引用
收藏
页数:21
相关论文
共 50 条
  • [41] Pan-cancer atlas of somatic core and linker histone mutations
    Erin R. Bonner
    Adam Dawood
    Heather Gordish-Dressman
    Augustine Eze
    Surajit Bhattacharya
    Sridevi Yadavilli
    Sabine Mueller
    Sebastian M. Waszak
    Javad Nazarian
    npj Genomic Medicine, 8
  • [42] The Cancer Genome Atlas Pan-Cancer analysis project
    John N Weinstein
    Eric A Collisson
    Gordon B Mills
    Kenna R Mills Shaw
    Brad A Ozenberger
    Kyle Ellrott
    Ilya Shmulevich
    Chris Sander
    Joshua M Stuart
    Nature Genetics, 2013, 45 : 1113 - 1120
  • [43] Pan-cancer atlas of somatic core and linker histone mutations
    Bonner, Erin R.
    Dawood, Adam
    Gordish-Dressman, Heather
    Eze, Augustine
    Bhattacharya, Surajit
    Yadavilli, Sridevi
    Mueller, Sabine
    Waszak, Sebastian M.
    Nazarian, Javad
    NPJ GENOMIC MEDICINE, 2023, 8 (01)
  • [44] Whole-genome sequencing of recurrent neuroblastoma reveals somatic mutations that affect key players in cancer progression and telomere maintenance
    Susanne Fransson
    Angela Martinez-Monleon
    Mathias Johansson
    Rose-Marie Sjöberg
    Caroline Björklund
    Gustaf Ljungman
    Torben Ek
    Per Kogner
    Tommy Martinsson
    Scientific Reports, 10
  • [45] Whole-genome sequencing of recurrent neuroblastoma reveals somatic mutations that affect key players in cancer progression and telomere maintenance
    Fransson, Susanne
    Martinez-Monleon, Angela
    Johansson, Mathias
    Sjoberg, Rose-Marie
    Bjorklund, Caroline
    Ljungman, Gustaf
    Torben, Ek
    Kogner, Per
    Martinsson, Tommy
    SCIENTIFIC REPORTS, 2020, 10 (01)
  • [46] Author Correction: Landscape of somatic mutations in 560 breast cancer whole-genome sequences
    Serena Nik-Zainal
    Helen Davies
    Johan Staaf
    Manasa Ramakrishna
    Dominik Glodzik
    Xueqing Zou
    Inigo Martincorena
    Ludmil B. Alexandrov
    Sancha Martin
    David C. Wedge
    Peter Van Loo
    Young Seok Ju
    Marcel Smid
    Arie B. Brinkman
    Sandro Morganella
    Miriam R. Aure
    Ole Christian Lingjærde
    Anita Langerød
    Markus Ringnér
    Sung-Min Ahn
    Sandrine Boyault
    Jane E. Brock
    Annegien Broeks
    Adam Butler
    Christine Desmedt
    Luc Dirix
    Serge Dronov
    Aquila Fatima
    John A. Foekens
    Moritz Gerstung
    Gerrit K. J. Hooijer
    Se Jin Jang
    David R. Jones
    Hyung-Yong Kim
    Tari A. King
    Savitri Krishnamurthy
    Hee Jin Lee
    Jeong-Yeon Lee
    Yilong Li
    Stuart McLaren
    Andrew Menzies
    Ville Mustonen
    Sarah O’Meara
    Iris Pauporté
    Xavier Pivot
    Colin A. Purdie
    Keiran Raine
    Kamna Ramakrishnan
    F. Germán Rodríguez-González
    Gilles Romieu
    Nature, 2019, 566 : E1 - E1
  • [47] An Integrative Pan-Cancer Analysis of PBK in Human Tumors
    Wen, Huantao
    Chen, Zitao
    Li, Min
    Huang, Qiongzhen
    Deng, Yuhao
    Zheng, Jiawei
    Xiong, Moliang
    Wang, Pengfei
    Zhang, Wangming
    FRONTIERS IN MOLECULAR BIOSCIENCES, 2021, 8
  • [48] Pan-cancer functional analysis of somatic mutations in G protein-coupled receptors
    Bongers, B. J.
    Gonzalez, M. Gorostiola
    Wang, X.
    van Vlijmen, H. W. T.
    Jespers, W.
    Gutierrez-de-Teran, H.
    Ye, K.
    IJzerman, A. P.
    Heitman, L. H.
    van Westen, G. J. P.
    SCIENTIFIC REPORTS, 2022, 12 (01)
  • [49] Pan-cancer whole genome analyses of metastatic solid tumors
    Cuppen, E.
    Priestley, P.
    Baber, J.
    Lolkema, M. P.
    Steeghs, N.
    de Bruijn, E.
    Duyvesteyn, K.
    Haidari, S.
    van Hoeck, A.
    Roepman, P.
    Shale, C.
    Voda, M.
    Tjan-Heijnen, V. C. G.
    Bloemendal, H.
    van Herpen, C.
    Labots, M.
    Witteveen, P. O.
    Smit, E. F.
    Sleijfer, S.
    Voest, E. E.
    ANNALS OF ONCOLOGY, 2019, 30 : 864 - 864
  • [50] Pan-cancer functional analysis of somatic mutations in G protein-coupled receptors
    B. J. Bongers
    M. Gorostiola González
    X. Wang
    H. W. T. van Vlijmen
    W. Jespers
    H. Gutiérrez-de-Terán
    K. Ye
    A. P. IJzerman
    L. H. Heitman
    G. J. P. van Westen
    Scientific Reports, 12