Pan-cancer integrative analysis of whole-genome De novo somatic point mutations reveals 17 cancer types

被引:1
|
作者
Ghareyazi, Amin [1 ]
Kazemi, Amirreza [1 ,2 ]
Hamidieh, Kimia [3 ]
Dashti, Hamed [1 ]
Tahaei, Maedeh Sadat [1 ]
Rabiee, Hamid R. [1 ]
Alinejad-Rokny, Hamid [4 ,5 ,6 ]
Dehzangi, Iman [7 ,8 ]
机构
[1] Sharif Univ Technol, Dept Comp Engn, Bioinformat & Computat Biol Lab, Tehran 11365, Iran
[2] Simon Fraser Univ, Dept Comp Engn, Burnaby, BC 1S6, Canada
[3] Univ Toronto, Dept Comp Sci, Toronto, ON M5S 3H2, Canada
[4] UNSW Sydney, BioMed Machine Learning Lab BML, Grad Sch Biomed Engn, Sydney, NSW 2052, Australia
[5] Univ New South Wales UNSW Sydney, UNSW Data Sci Hub, Sydney, NSW 2052, Australia
[6] Macquarie Univ, AI Enabled Proc AIP Res Ctr, Sydney, NSW 2109, Australia
[7] Rutgers State Univ, Dept Comp Sci, Camden, NJ 08102 USA
[8] Rutgers State Univ, Ctr Computat & Integrat Biol, Camden, NJ 08102 USA
关键词
Pan-cancer; Somatic point mutations; Cancer subtyping; Biomarker discovery; Driver genes; Personalized medicine; Health data analytics; MOLECULAR CLASSIFICATION; GENE; IDENTIFICATION; SUBTYPES; DNA;
D O I
10.1186/s12859-022-04840-6
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Background: The advent of high throughput sequencing has enabled researchers to systematically evaluate the genetic variations in cancer, identifying many cancer-associated genes. Although cancers in the same tissue are widely categorized in the same group, they demonstrate many differences concerning their mutational profiles. Hence, there is no definitive treatment for most cancer types. This reveals the importance of developing new pipelines to identify cancer-associated genes accurately and re-classify patients with similar mutational profiles. Classification of cancer patients with similar mutational profiles may help discover subtypes of cancer patients who might benefit from specific treatment types. Results: In this study, we propose a new machine learning pipeline to identify protein-coding genes mutated in many samples to identify cancer subtypes. We apply our pipeline to 12,270 samples collected from the international cancer genome consortium, covering 19 cancer types. As a result, we identify 17 different cancer subtypes. Comprehensive phenotypic and genotypic analysis indicates distinguishable properties, including unique cancer-related signaling pathways. Conclusions: This new subtyping approach offers a novel opportunity for cancer drug development based on the mutational profile of patients. Additionally, we analyze the mutational signatures for samples in each subtype, which provides important insight into their active molecular mechanisms. Some of the pathways we identified in most subtypes, including the cell cycle and the Axon guidance pathways, are frequently observed in cancer disease. Interestingly, we also identified several mutated genes and different rates of mutation in multiple cancer subtypes. In addition, our study on "gene-motif" suggests the importance of considering both the context of the mutations and mutational processes in identifying cancer-associated genes. The source codes for our proposed clustering pipeline and analysis are publicly available at: https://github.com/bcb-sut/Pan-Cancer.
引用
收藏
页数:21
相关论文
共 50 条
  • [31] Oncogenic Landscape of Somatic Mutations Perturbing Pan-Cancer lncRNA-ceRNA Regulation
    Zhang, Yuanfu
    Han, Peng
    Guo, Qiuyan
    Hao, Yangyang
    Qi, Yue
    Xin, Mengyu
    Zhang, Yafang
    Cui, Binbin
    Wang, Peng
    FRONTIERS IN CELL AND DEVELOPMENTAL BIOLOGY, 2021, 9
  • [32] Discovery of pan-cancer related genes via integrative network analysis
    Zhu, Yuan
    Zhang, Houwang
    Yang, Yuanhang
    Zhang, Chaoyang
    Le Ou-Yang
    Bai, Litai
    Deng, Minghua
    Ming Yi
    Song Liu
    Chao Wang
    BRIEFINGS IN FUNCTIONAL GENOMICS, 2022, 21 (04) : 325 - 338
  • [33] Pan-cancer analysis of systematic batch effects on somatic sequence variations
    Ji-Hye Choi
    Seong-Eui Hong
    Hyun Goo Woo
    BMC Bioinformatics, 18
  • [34] Pan-cancer analysis reveals DDX21 as a potential biomarker for the prognosis of multiple tumor types
    Hu, Ankang
    Wang, Yonghui
    Tian, Jiahao
    Chen, Zihan
    Chen, Renjin
    Han, Xufeng
    Chen, Yang
    Liu, Tingjun
    Chen, Quangang
    FRONTIERS IN ONCOLOGY, 2022, 12
  • [35] A Pan-Cancer Analysis of Transcriptome Changes Associated with Somatic Mutations in U2AF1 Reveals Commonly Altered Splicing Events
    Brooks, Angela N.
    Choi, Peter S.
    de Waal, Luc
    Sharifnia, Tanaz
    Imielinski, Marcin
    Saksena, Gordon
    Pedamallu, Chandra Sekhar
    Sivachenko, Andrey
    Rosenberg, Mara
    Chmielecki, Juliann
    Lawrence, Michael S.
    DeLuca, David S.
    Getz, Gad
    Meyerson, Matthew
    PLOS ONE, 2014, 9 (01):
  • [36] Pan-cancer analysis of systematic batch effects on somatic sequence variations
    Choi, Ji-Hye
    Hong, Seong-Eui
    Woo, Hyun Goo
    BMC BIOINFORMATICS, 2017, 18
  • [37] Whole-genome analysis reveals the contribution of non-coding de novo transposon insertions to autism spectrum disorder
    Borges-Monroy, Rebeca
    Chu, Chong
    Dias, Caroline
    Choi, Jaejoon
    Lee, Soohyun
    Gao, Yue
    Shin, Taehwan
    Park, Peter J.
    Walsh, Christopher A.
    Lee, Eunjung Alice
    MOBILE DNA, 2021, 12 (01)
  • [38] Integrative pan-cancer analysis reveals a common architecture of dysregulated transcriptional networks characterized by loss of enhancer methylation
    Ankill, Jorgen
    Zhao, Zhi
    Tekpli, Xavier
    Kure, Elin H.
    Kristensen, Vessela N.
    Mathelier, Anthony
    Fleischer, Thomas
    PLOS COMPUTATIONAL BIOLOGY, 2024, 20 (11)
  • [39] Pan-cancer analysis reveals an immunological role and prognostic potential of PXN in human cancer
    Chen, Yun
    Zhao, Han
    Xiao, Yan
    Shen, Peijun
    Tan, Li
    Zhang, Shaohui
    Liu, Qiong
    Gao, Zhengrong
    Zhao, Jie
    Zhao, Yaqiong
    Guo, Yue
    Feng, Yunzhi
    AGING-US, 2021, 13 (12): : 16248 - 16266
  • [40] Pan-cancer analysis reveals CHRM2 as a biomarker for cancer prognosis and immunotherapy
    Huang, Qingming
    Su, Zhengwei
    Tang, Han
    Ban, Chengjie
    Xie, Huadong
    Liao, Tianling
    Liao, Kangji
    Cheng, Zhi
    Yi, Xian lin
    GENETICS AND MOLECULAR RESEARCH, 2024, 23 (03):