Comparing variants related to chronic diseases from genome-wide association study (GWAS) and the cancer genome atlas (TCGA)

被引:1
作者
Jeon, Soohyun [1 ]
Park, Chaewon [2 ,3 ]
Kim, Jineui [4 ]
Lee, Jung Hoon [5 ]
Joe, Sung-yune [2 ,3 ]
Ko, Young Kyung [6 ]
Gim, Jeong-An [7 ]
机构
[1] Korea Univ, Dept Brain & Cognit Engn, Seoul 02841, South Korea
[2] Korea Univ, Sch Biomed Engn, Seoul 02841, South Korea
[3] Korea Univ, Interdisciplinary Program Precis Publ Hlth, Seoul 02841, South Korea
[4] Korea Univ, Coll Med, Inst Viral Dis, Dept Microbiol, Seoul 02841, South Korea
[5] Korea Univ, Coll Med, Dept Pharmacol, Seoul 02841, South Korea
[6] Korea Univ, Guro Hosp, Dept Internal Med, Div Pulm Allergy & Crit Care Med, Seoul 08308, South Korea
[7] Soonchunhyang Univ, Dept Med Sci, Asan 31538, South Korea
基金
新加坡国家研究基金会;
关键词
Cancer; Chronic diseases; Variants; Genome-wide association study; 1000 genomes project; RESOURCE;
D O I
10.1186/s12920-023-01758-7
中图分类号
Q3 [遗传学];
学科分类号
071007 ; 090102 ;
摘要
BackgroundSeveral genome-wide association studies (GWAS) have been performed to identify variants related to chronic diseases. Somatic variants in cancer tissues are associated with cancer development and prognosis. Expression quantitative trait loci (eQTL) and methylation QTL (mQTL) analyses were performed on chronic disease-related variants in TCGA dataset.MethodsMuTect2 calling variants for 33 cancers from TCGA and 296 GWAS variants provided by LocusZoom were used. At least one mutation was found in TCGA 22 cancers and LocusZoom 23 studies. Differentially expressed genes (DEGs) and differentially methylated regions (DMRs) from the three cancers (TCGA-COAD, TCGA-STAD, and TCGA-UCEC). Variants were mapped to the world map using population locations of the 1000 Genomes Project (1GP) populations. Decision tree analysis was performed on the discovered features and survival analysis was performed according to the cluster.ResultsBased on the DEGs and DMRs with clinical data, the decision tree model classified seven and three nodes in TCGA-COAD and TCGA-STAD, respectively. A total of 11 variants were commonly detected from TCGA and LocusZoom, and eight variants were selected from the 1GP variants, and the distribution patterns were visualized on the world map.ConclusionsVariants related to tumors and chronic diseases were selected, and their geological regional 1GP-based proportions are presented. The variant distribution patterns could provide clues for regional clinical trial designs and personalized medicine.
引用
收藏
页数:15
相关论文
共 25 条
  • [1] Examining How Race, Ethnicity, and Ancestry Data Are Used in Biomedical Research
    Bonham, Vence L.
    Green, Eric D.
    Perez-Stable, Eliseo J.
    [J]. JAMA-JOURNAL OF THE AMERICAN MEDICAL ASSOCIATION, 2018, 320 (15): : 1533 - 1534
  • [2] OPINION Non-genetic heterogeneity - a mutation-independent driving force for the somatic evolution of tumours
    Brock, Amy
    Chang, Hannah
    Huang, Sui
    [J]. NATURE REVIEWS GENETICS, 2009, 10 (05) : 336 - 342
  • [3] A systematic review and meta-analysis of adherence to physical activity interventions among three chronic conditions: cancer, cardiovascular disease, and diabetes
    Bullard, Tiffany
    Ji, Mengmeng
    An, Ruopeng
    Trinh, Linda
    Mackenzie, Michael
    Mullen, Sean P.
    [J]. BMC PUBLIC HEALTH, 2019, 19 (1)
  • [4] Germline Variants That Affect Tumor Progression
    Chatrath, Ajay
    Ratan, Aakrosh
    Dutta, Anindya
    [J]. TRENDS IN GENETICS, 2021, 37 (05) : 433 - 443
  • [5] A Pan-Cancer Analysis of Enhancer Expression in Nearly 9000 Patient Samples
    Chen, Han
    Li, Chunyan
    Peng, Xinxin
    Zhou, Zhicheng
    Weinstein, John N.
    Liang, Han
    [J]. CELL, 2018, 173 (02) : 386 - +
  • [6] The international Genome sample resource (IGSR): A worldwide collection of genome variation incorporating the 1000 Genomes Project data
    Clarke, Laura
    Fairley, Susan
    Zheng-Bradley, Xiangqun
    Streeter, Ian
    Perry, Emily
    Lowy, Ernesto
    Tasse, Anne-Marie
    Flicek, Paul
    [J]. NUCLEIC ACIDS RESEARCH, 2017, 45 (D1) : D854 - D859
  • [7] TCGAbiolinks: an R/Bioconductor package for integrative analysis of TCGA data
    Colaprico, Antonio
    Silva, Tiago C.
    Olsen, Catharina
    Garofano, Luciano
    Cava, Claudia
    Garolini, Davide
    Sabedot, Thais S.
    Malta, Tathiane M.
    Pagnotta, Stefano M.
    Castiglioni, Isabella
    Ceccarelli, Michele
    Bontempi, Gianluca
    Noushmehr, Houtan
    [J]. NUCLEIC ACIDS RESEARCH, 2016, 44 (08) : e71
  • [8] A systematic approach to assessing the clinical significance of genetic variants
    Duzkale, H.
    Shen, J.
    McLaughlin, H.
    Alfares, A.
    Kelly, M. A.
    Pugh, T. J.
    Funke, B. H.
    Rehm, H. L.
    Lebo, M. S.
    [J]. CLINICAL GENETICS, 2013, 84 (05) : 453 - 463
  • [9] Expression quantitative trait locus analysis for translational medicine
    Gibson, Greg
    Powell, Joseph E.
    Marigorta, Urko M.
    [J]. GENOME MEDICINE, 2015, 7
  • [10] Complex heatmaps reveal patterns and correlations in multidimensional genomic data
    Gu, Zuguang
    Eils, Roland
    Schlesner, Matthias
    [J]. BIOINFORMATICS, 2016, 32 (18) : 2847 - 2849