Pan-cancer integrative analysis of whole-genome De novo somatic point mutations reveals 17 cancer types

被引:1
|
作者
Ghareyazi, Amin [1 ]
Kazemi, Amirreza [1 ,2 ]
Hamidieh, Kimia [3 ]
Dashti, Hamed [1 ]
Tahaei, Maedeh Sadat [1 ]
Rabiee, Hamid R. [1 ]
Alinejad-Rokny, Hamid [4 ,5 ,6 ]
Dehzangi, Iman [7 ,8 ]
机构
[1] Sharif Univ Technol, Dept Comp Engn, Bioinformat & Computat Biol Lab, Tehran 11365, Iran
[2] Simon Fraser Univ, Dept Comp Engn, Burnaby, BC 1S6, Canada
[3] Univ Toronto, Dept Comp Sci, Toronto, ON M5S 3H2, Canada
[4] UNSW Sydney, BioMed Machine Learning Lab BML, Grad Sch Biomed Engn, Sydney, NSW 2052, Australia
[5] Univ New South Wales UNSW Sydney, UNSW Data Sci Hub, Sydney, NSW 2052, Australia
[6] Macquarie Univ, AI Enabled Proc AIP Res Ctr, Sydney, NSW 2109, Australia
[7] Rutgers State Univ, Dept Comp Sci, Camden, NJ 08102 USA
[8] Rutgers State Univ, Ctr Computat & Integrat Biol, Camden, NJ 08102 USA
关键词
Pan-cancer; Somatic point mutations; Cancer subtyping; Biomarker discovery; Driver genes; Personalized medicine; Health data analytics; MOLECULAR CLASSIFICATION; GENE; IDENTIFICATION; SUBTYPES; DNA;
D O I
10.1186/s12859-022-04840-6
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Background: The advent of high throughput sequencing has enabled researchers to systematically evaluate the genetic variations in cancer, identifying many cancer-associated genes. Although cancers in the same tissue are widely categorized in the same group, they demonstrate many differences concerning their mutational profiles. Hence, there is no definitive treatment for most cancer types. This reveals the importance of developing new pipelines to identify cancer-associated genes accurately and re-classify patients with similar mutational profiles. Classification of cancer patients with similar mutational profiles may help discover subtypes of cancer patients who might benefit from specific treatment types. Results: In this study, we propose a new machine learning pipeline to identify protein-coding genes mutated in many samples to identify cancer subtypes. We apply our pipeline to 12,270 samples collected from the international cancer genome consortium, covering 19 cancer types. As a result, we identify 17 different cancer subtypes. Comprehensive phenotypic and genotypic analysis indicates distinguishable properties, including unique cancer-related signaling pathways. Conclusions: This new subtyping approach offers a novel opportunity for cancer drug development based on the mutational profile of patients. Additionally, we analyze the mutational signatures for samples in each subtype, which provides important insight into their active molecular mechanisms. Some of the pathways we identified in most subtypes, including the cell cycle and the Axon guidance pathways, are frequently observed in cancer disease. Interestingly, we also identified several mutated genes and different rates of mutation in multiple cancer subtypes. In addition, our study on "gene-motif" suggests the importance of considering both the context of the mutations and mutational processes in identifying cancer-associated genes. The source codes for our proposed clustering pipeline and analysis are publicly available at: https://github.com/bcb-sut/Pan-Cancer.
引用
收藏
页数:21
相关论文
共 50 条
  • [1] Pan-cancer integrative analysis of whole-genome De novo somatic point mutations reveals 17 cancer types
    Amin Ghareyazi
    Amirreza Kazemi
    Kimia Hamidieh
    Hamed Dashti
    Maedeh Sadat Tahaei
    Hamid R. Rabiee
    Hamid Alinejad-Rokny
    Iman Dehzangi
    BMC Bioinformatics, 23
  • [2] Whole-Genome Analysis of De Novo Somatic Point Mutations Reveals Novel Mutational Biomarkers in Pancreatic Cancer
    Ghareyazi, Amin
    Mohseni, Amir
    Dashti, Hamed
    Beheshti, Amin
    Dehzangi, Abdollah
    Rabiee, Hamid R.
    Alinejad-Rokny, Hamid
    CANCERS, 2021, 13 (17)
  • [3] Pan-cancer proteogenomic landscape of whole-genome doubling reveals putative therapeutic targets in various cancer types
    Chang, Eunhyong
    Kim, Su-Jung
    Hwang, Hee Sang
    Song, Kyu Jin
    Kim, Kwoneel
    Kim, Min-Sik
    Jang, Se Jin
    You, Sungyong
    Kim, Kwang Pyo
    An, Joon-Yong
    CLINICAL AND TRANSLATIONAL MEDICINE, 2024, 14 (08):
  • [4] Pan-cancer analysis of somatic mutations and transcriptomes reveals common functional gene clusters shared by multiple cancer types
    Kim, Hyeongmin
    Kim, Yong-Min
    SCIENTIFIC REPORTS, 2018, 8
  • [5] Pan-cancer analysis of somatic mutations and transcriptomes reveals common functional gene clusters shared by multiple cancer types
    Hyeongmin Kim
    Yong-Min Kim
    Scientific Reports, 8
  • [6] Pan-cancer analysis of somatic mutations across 21 neuroendocrine tumor types
    Cao, Yanan
    Zhou, Weiwei
    Li, Lin
    Wang, Jiaqian
    Gao, Zhibo
    Jiang, Yiran
    Jiang, Xiuli
    Shan, Aijing
    Bailey, Matthew H.
    Huang, Kuan-Lin
    Sun, Sam Q.
    McLellan, Michael D.
    Niu, Beifang
    Wang, Weiqing
    Ding, Li
    Ning, Guang
    CELL RESEARCH, 2018, 28 (05) : 601 - 604
  • [7] Pan-cancer analysis of somatic mutations across 21 neuroendocrine tumor types
    Yanan Cao
    Weiwei Zhou
    Lin Li
    Jiaqian Wang
    Zhibo Gao
    Yiran Jiang
    Xiuli Jiang
    Aijing Shan
    Matthew H. Bailey
    Kuan-lin Huang
    Sam Q. Sun
    Michael D. McLellan
    Beifang Niu
    Weiqing Wang
    Li Ding
    Guang Ning
    Cell Research, 2018, 28 : 601 - 604
  • [8] Pan-cancer whole-genome analyses of metastatic solid tumours
    Priestley, Peter
    Baber, Jonathan
    Lolkema, Martijn P.
    Steeghs, Neeltje
    de Bruijn, Ewart
    Shale, Charles
    Duyvesteyn, Korneel
    Haidari, Susan
    van Hoeck, Arne
    Onstenk, Wendy
    Roepman, Paul
    Voda, Mircea
    Bloemendal, Haiko J.
    Tjan-Heijnen, Vivianne C. G.
    van Herpen, Carla M. L.
    Labots, Mariette
    Witteveen, Petronella O.
    Smit, Egbert F.
    Sleijfer, Stefan
    Voest, Emile E.
    Cuppen, Edwin
    NATURE, 2019, 575 (7781) : 210 - +
  • [9] Pan-cancer whole-genome analyses of metastatic solid tumours
    Peter Priestley
    Jonathan Baber
    Martijn P. Lolkema
    Neeltje Steeghs
    Ewart de Bruijn
    Charles Shale
    Korneel Duyvesteyn
    Susan Haidari
    Arne van Hoeck
    Wendy Onstenk
    Paul Roepman
    Mircea Voda
    Haiko J. Bloemendal
    Vivianne C. G. Tjan-Heijnen
    Carla M. L. van Herpen
    Mariette Labots
    Petronella O. Witteveen
    Egbert F. Smit
    Stefan Sleijfer
    Emile E. Voest
    Edwin Cuppen
    Nature, 2019, 575 : 210 - 216
  • [10] Pan-cancer analysis of whole-genome doubling and its association with patient prognosis
    Kikutake, Chie
    Suyama, Mikita
    BMC CANCER, 2023, 23 (01)