Pan-cancer integrative analysis of whole-genome De novo somatic point mutations reveals 17 cancer types

被引:1
|
作者
Ghareyazi, Amin [1 ]
Kazemi, Amirreza [1 ,2 ]
Hamidieh, Kimia [3 ]
Dashti, Hamed [1 ]
Tahaei, Maedeh Sadat [1 ]
Rabiee, Hamid R. [1 ]
Alinejad-Rokny, Hamid [4 ,5 ,6 ]
Dehzangi, Iman [7 ,8 ]
机构
[1] Sharif Univ Technol, Dept Comp Engn, Bioinformat & Computat Biol Lab, Tehran 11365, Iran
[2] Simon Fraser Univ, Dept Comp Engn, Burnaby, BC 1S6, Canada
[3] Univ Toronto, Dept Comp Sci, Toronto, ON M5S 3H2, Canada
[4] UNSW Sydney, BioMed Machine Learning Lab BML, Grad Sch Biomed Engn, Sydney, NSW 2052, Australia
[5] Univ New South Wales UNSW Sydney, UNSW Data Sci Hub, Sydney, NSW 2052, Australia
[6] Macquarie Univ, AI Enabled Proc AIP Res Ctr, Sydney, NSW 2109, Australia
[7] Rutgers State Univ, Dept Comp Sci, Camden, NJ 08102 USA
[8] Rutgers State Univ, Ctr Computat & Integrat Biol, Camden, NJ 08102 USA
关键词
Pan-cancer; Somatic point mutations; Cancer subtyping; Biomarker discovery; Driver genes; Personalized medicine; Health data analytics; MOLECULAR CLASSIFICATION; GENE; IDENTIFICATION; SUBTYPES; DNA;
D O I
10.1186/s12859-022-04840-6
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Background: The advent of high throughput sequencing has enabled researchers to systematically evaluate the genetic variations in cancer, identifying many cancer-associated genes. Although cancers in the same tissue are widely categorized in the same group, they demonstrate many differences concerning their mutational profiles. Hence, there is no definitive treatment for most cancer types. This reveals the importance of developing new pipelines to identify cancer-associated genes accurately and re-classify patients with similar mutational profiles. Classification of cancer patients with similar mutational profiles may help discover subtypes of cancer patients who might benefit from specific treatment types. Results: In this study, we propose a new machine learning pipeline to identify protein-coding genes mutated in many samples to identify cancer subtypes. We apply our pipeline to 12,270 samples collected from the international cancer genome consortium, covering 19 cancer types. As a result, we identify 17 different cancer subtypes. Comprehensive phenotypic and genotypic analysis indicates distinguishable properties, including unique cancer-related signaling pathways. Conclusions: This new subtyping approach offers a novel opportunity for cancer drug development based on the mutational profile of patients. Additionally, we analyze the mutational signatures for samples in each subtype, which provides important insight into their active molecular mechanisms. Some of the pathways we identified in most subtypes, including the cell cycle and the Axon guidance pathways, are frequently observed in cancer disease. Interestingly, we also identified several mutated genes and different rates of mutation in multiple cancer subtypes. In addition, our study on "gene-motif" suggests the importance of considering both the context of the mutations and mutational processes in identifying cancer-associated genes. The source codes for our proposed clustering pipeline and analysis are publicly available at: https://github.com/bcb-sut/Pan-Cancer.
引用
收藏
页数:21
相关论文
共 50 条
  • [41] The use of pan-cancer analysis of ADAMTS9 expression in various cancer types
    Jiang, Shijun
    Jiang, Ying
    Cao, Yingli
    Zhao, Yiyang
    Liu, Hongfeng
    Wang, Xiuli
    He, Zikang
    Song, Zheyao
    Wang, Xingyun
    Liu, Gang
    Cui, Rongjun
    EPIGENOMICS, 2021, 13 (04) : 253 - 256
  • [42] Whole-genome cancer analysis as an approach to deeper understanding of tumour biology
    Strausberg, R. L.
    Simpson, A. J. G.
    BRITISH JOURNAL OF CANCER, 2010, 102 (02) : 243 - 248
  • [43] A Pan-Cancer Analysis Reveals CLEC5A as a Biomarker for Cancer Immunity and Prognosis
    Chen, Rui
    Wu, Wantao
    Chen, Si-Yu
    Liu, Zheng-Zheng
    Wen, Zhi-Peng
    Yu, Jing
    Zhang, Long-Bo
    Liu, Zaoqu
    Zhang, Jian
    Luo, Peng
    Zeng, Wen-Jing
    Cheng, Quan
    FRONTIERS IN IMMUNOLOGY, 2022, 13
  • [44] Single-cell whole-genome sequencing reveals the functional landscape of somatic mutations in B lymphocytes across the human lifespan
    Zhang, Lei
    Dong, Xiao
    Lee, Moonsook
    Maslov, Alexander Y.
    Wang, Tao
    Vijg, Jan
    PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2019, 116 (18) : 9014 - 9019
  • [45] A Pan-Cancer Analysis Reveals the Abnormal Expression and Drug Sensitivity of CSF1
    Dai, Xiaoshuo
    Chen, Xinhuan
    Chen, Wei
    Chen, Yihuan
    Zhao, Jun
    Zhang, Qiushuang
    Lu, Jing
    ANTI-CANCER AGENTS IN MEDICINAL CHEMISTRY, 2022, 22 (07) : 1296 - 1312
  • [46] Pan-Cancer Analysis Reveals That E1A Binding Protein p300 Mutations Increase Genome Instability and Antitumor Immunity
    Chen, Zuobing
    Chen, Canping
    Li, Lin
    Zhang, Tianfang
    Wang, Xiaosheng
    FRONTIERS IN CELL AND DEVELOPMENTAL BIOLOGY, 2021, 9
  • [47] Molecular Correlates of Metastasis by Systematic Pan-Cancer Analysis Across The Cancer Genome Atlas
    Chen, Fengju
    Zhang, Yiqun
    Varambally, Sooryanarayana
    Creighton, Chad J.
    MOLECULAR CANCER RESEARCH, 2019, 17 (02) : 476 - 487
  • [48] Systematic pan-cancer analysis identifies ZBTB11 as a potential pan-cancer biomarker and immunotherapy target in multiple tumor types
    Xu, Peiyi
    Zhang, Qiuyan
    Zhai, Jing
    Chen, Pu
    Deng, Xueting
    Miao, Lin
    Zhang, Xiuhua
    DISCOVER ONCOLOGY, 2024, 15 (01)
  • [49] Pan-cancer analysis of TCGA data reveals notable signaling pathways
    Richard Neapolitan
    Curt M. Horvath
    Xia Jiang
    BMC Cancer, 15
  • [50] Pan-cancer analysis of TCGA data reveals notable signaling pathways
    Neapolitan, Richard
    Horvath, Curt M.
    Jiang, Xia
    BMC CANCER, 2015, 15