Pan-cancer integrative analysis of whole-genome De novo somatic point mutations reveals 17 cancer types

被引:1
|
作者
Ghareyazi, Amin [1 ]
Kazemi, Amirreza [1 ,2 ]
Hamidieh, Kimia [3 ]
Dashti, Hamed [1 ]
Tahaei, Maedeh Sadat [1 ]
Rabiee, Hamid R. [1 ]
Alinejad-Rokny, Hamid [4 ,5 ,6 ]
Dehzangi, Iman [7 ,8 ]
机构
[1] Sharif Univ Technol, Dept Comp Engn, Bioinformat & Computat Biol Lab, Tehran 11365, Iran
[2] Simon Fraser Univ, Dept Comp Engn, Burnaby, BC 1S6, Canada
[3] Univ Toronto, Dept Comp Sci, Toronto, ON M5S 3H2, Canada
[4] UNSW Sydney, BioMed Machine Learning Lab BML, Grad Sch Biomed Engn, Sydney, NSW 2052, Australia
[5] Univ New South Wales UNSW Sydney, UNSW Data Sci Hub, Sydney, NSW 2052, Australia
[6] Macquarie Univ, AI Enabled Proc AIP Res Ctr, Sydney, NSW 2109, Australia
[7] Rutgers State Univ, Dept Comp Sci, Camden, NJ 08102 USA
[8] Rutgers State Univ, Ctr Computat & Integrat Biol, Camden, NJ 08102 USA
关键词
Pan-cancer; Somatic point mutations; Cancer subtyping; Biomarker discovery; Driver genes; Personalized medicine; Health data analytics; MOLECULAR CLASSIFICATION; GENE; IDENTIFICATION; SUBTYPES; DNA;
D O I
10.1186/s12859-022-04840-6
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Background: The advent of high throughput sequencing has enabled researchers to systematically evaluate the genetic variations in cancer, identifying many cancer-associated genes. Although cancers in the same tissue are widely categorized in the same group, they demonstrate many differences concerning their mutational profiles. Hence, there is no definitive treatment for most cancer types. This reveals the importance of developing new pipelines to identify cancer-associated genes accurately and re-classify patients with similar mutational profiles. Classification of cancer patients with similar mutational profiles may help discover subtypes of cancer patients who might benefit from specific treatment types. Results: In this study, we propose a new machine learning pipeline to identify protein-coding genes mutated in many samples to identify cancer subtypes. We apply our pipeline to 12,270 samples collected from the international cancer genome consortium, covering 19 cancer types. As a result, we identify 17 different cancer subtypes. Comprehensive phenotypic and genotypic analysis indicates distinguishable properties, including unique cancer-related signaling pathways. Conclusions: This new subtyping approach offers a novel opportunity for cancer drug development based on the mutational profile of patients. Additionally, we analyze the mutational signatures for samples in each subtype, which provides important insight into their active molecular mechanisms. Some of the pathways we identified in most subtypes, including the cell cycle and the Axon guidance pathways, are frequently observed in cancer disease. Interestingly, we also identified several mutated genes and different rates of mutation in multiple cancer subtypes. In addition, our study on "gene-motif" suggests the importance of considering both the context of the mutations and mutational processes in identifying cancer-associated genes. The source codes for our proposed clustering pipeline and analysis are publicly available at: https://github.com/bcb-sut/Pan-Cancer.
引用
收藏
页数:21
相关论文
共 50 条
  • [21] Pan-Cancer Analysis Reveals Differential Susceptibility of Bidirectional Gene Promoters to DNA Methylation, Somatic Mutations, and Copy Number Alterations
    Thompson, Jeffrey A.
    Christensen, Brock C.
    Marsit, Carmen J.
    INTERNATIONAL JOURNAL OF MOLECULAR SCIENCES, 2018, 19 (08)
  • [22] An integrative and comparative study of pan-cancer transcriptomes reveals distinct cancer common and specific signatures
    Cao, Zhen
    Zhang, Shihua
    SCIENTIFIC REPORTS, 2016, 6
  • [23] Pan-Cancer Analysis of the Genomic Alterations and Mutations of the Matrisome
    Izzi, Valerio
    Davis, Martin N.
    Naba, Alexandra
    CANCERS, 2020, 12 (08) : 1 - 21
  • [24] Whole-genome sequencing with long reads reveals complex structure and origin of structural variation in human genetic variations and somatic mutations in cancer
    Fujimoto, Akihiro
    Wong, Jing Hao
    Yoshii, Yukiko
    Akiyama, Shintaro
    Tanaka, Azusa
    Yagi, Hitomi
    Shigemizu, Daichi
    Nakagawa, Hidewaki
    Mizokami, Masashi
    Shimada, Mihoko
    GENOME MEDICINE, 2021, 13 (01)
  • [25] A Pan-Cancer and Polygenic Bayesian Hierarchical Model for the Effect of Somatic Mutations on Survival
    Samorodnitsky, Sarah
    Hoadley, Katherine A.
    Lock, Eric F.
    CANCER INFORMATICS, 2020, 19
  • [26] Pan-cancer Analysis Reveals Cancer-dependent Expression of SOX17 and Associated Clinical Outcomes
    Xu, Li
    Bai, Youhuang
    Cheng, Yihang
    Sheng, Xiujie
    Sun, Deqiang
    CANCER GENOMICS & PROTEOMICS, 2023, 20 (05) : 433 - 447
  • [27] Integrative analysis of the role of BOLA2B in human pan-cancer
    Liang, Mingxing
    Fei, Yinjiao
    Wang, Yalin
    Chen, Wenquan
    Liu, Zhen
    Xu, Di
    Shen, Hongyu
    Zhou, Honglei
    Tang, Jinhai
    FRONTIERS IN GENETICS, 2023, 14
  • [28] An integrative pan-cancer analysis reveals the oncogenic role of mutS homolog 6 (MSH6) in human tumors
    Zhan, Haibo
    Mo, Fengbo
    Xu, Qiang
    Wang, Song
    Zhang, Bin
    Liu, Xuqiang
    Dai, Min
    Liu, Hucheng
    AGING-US, 2021, 13 (23): : 25271 - 25290
  • [29] An integrative pan-cancer analysis of WWC family genes and functional validation in lung cancer
    Huang, Hongmei
    Gu, Jiaji
    Kuang, Xinjie
    Yu, Yonghui
    Rao, Boqi
    Fang, Shenying
    Lu, Jiachun
    Qiu, Fuman
    CELLULAR SIGNALLING, 2024, 115
  • [30] Somatic retrotransposition in human cancer revealed by whole-genome and exome sequencing
    Helman, Elena
    Lawrence, Michael S.
    Stewart, Chip
    Sougnez, Carrie
    Getz, Gad
    Meyerson, Matthew
    GENOME RESEARCH, 2014, 24 (07) : 1053 - 1063