Integrative analysis of GWAS and transcriptomics data reveal key genes for non-small lung cancer

被引:0
作者
Xiangxiong Feng
机构
[1] University of California Davis,
来源
Medical Oncology | / 40卷
关键词
GWAS; Transcriptomics; Lung cancer; Machine learning;
D O I
暂无
中图分类号
学科分类号
摘要
Lung cancer is one of the world’s most common and deadly cancers. The two main types of lung cancer are non-small cell lung cancer (NSCLC) and small cell lung cancer (SCLC). More than 85% of lung cancers are NSCLC. Genetic factors play a significant role in the risk of NSCLC. Growing studies focus on studying risk factors at the molecular level. The aim of the study is to build a pipeline to integrate Genome-wide association analysis (GWAS) and transcriptomics data with machine learning to effectively identify genetic risk factors of NSCLC. GWAS datasets and GWAS summary data were downloaded from GWAS catalog, which include lung carcinoma genetic variants among the European population. Then, with the GWAS summary, data functional analysis of significant SNPs was performed using a webserver called FUMAGWAS. The transcriptomics data of NSCLC and non-NSCLC people were used to build a machine learning model to identify the key genes that help predict the NSCLC. The top up-regulation and down-regulation genes were identified by the BART cancer webserver, and the mechanistic roles of the genes were validated by literature review. By performing integrative analysis of GWAS and transcriptomics analysis using machine learning, we identified multiple SNPs and genes that related to NSCLC. The computational pipeline may facilitate the biomarker discovery for NSCLC and other diseases.
引用
收藏
相关论文
共 97 条
[1]  
Bray F(2018)Global cancer statistics 2018: GLOBOCAN estimates of incidence and mortality worldwide for 36 cancers in 185 countries CA Cancer J Clin 68 394-424
[2]  
Sung H(2021)Global cancer statistics 2020: GLOBOCAN estimates of incidence and mortality worldwide for 36 cancers in 185 countries CA Cancer J Clin 71 209-249
[3]  
Siegel RL(2020)Cancer statistics, 2020 CA Cancer J Clin 70 7-30
[4]  
Miller KD(2015)Transformation from non-small-cell lung cancer to small-cell lung cancer: molecular drivers and cells of origin Lancet Oncol 16 e165-172
[5]  
Jemal A(2018)Lung cancer Cas Lek Cesk 157 226-236
[6]  
Oser MG(2009)Squamous cell carcinoma Compend Contin Educ Vet 31 E9-653
[7]  
Niederst MJ(2013)Non-small cell lung cancer, version 2. 2013 J. Natl. Compr. Canc. Netw. 11 645-630
[8]  
Sequist LV(1998)Fried, well-done red meat and risk of lung cancer in women (United States) Cancer Causes Control CCC 9 621-1049
[9]  
Engelman JA(2021)Global, regional, and national burden of respiratory tract cancers and associated risk factors from 1990 to 2019: a systematic analysis for the Global Burden of Disease Study 2019 Lancet Respir Med 9 1030-24
[10]  
Skřičková J(2020)Lung cancer 2020: epidemiology, etiology, and prevention Clin Chest Med 41 1-5958