Using clinical and genetic risk factors for risk prediction of 8 cancers in the UK Biobank

被引:8
作者
Hu, Jiaqi [1 ]
Ye, Yixuan [2 ]
Zhou, Geyu [2 ]
Zhao, Hongyu [2 ,3 ,4 ]
机构
[1] Yale Sch Publ Hlth, Dept Chron Dis Epidemiol, New Haven, CT USA
[2] Yale Univ, Program Computat Biol & Bioinformat, New Haven, CT USA
[3] Yale Sch Publ Hlth, Dept Biostat, New Haven, CT USA
[4] Yale Sch Publ Hlth, Dept Biostat, POB 208034, 60 Coll St, New Haven, CT 06520 USA
关键词
SUSCEPTIBILITY LOCI; DIABETES-MELLITUS; BREAST-CANCER; ASSOCIATION; LUNG; POPULATION; PROSTATE; ONSET; AGE;
D O I
10.1093/jncics/pkae008
中图分类号
R73 [肿瘤学];
学科分类号
100214 ;
摘要
Background Models with polygenic risk scores and clinical factors to predict risk of different cancers have been developed, but these models have been limited by the polygenic risk score-derivation methods and the incomplete selection of clinical variables.Methods We used UK Biobank to train the best polygenic risk scores for 8 cancers (bladder, breast, colorectal, kidney, lung, ovarian, pancreatic, and prostate cancers) and select relevant clinical variables from 733 baseline traits through extreme gradient boosting (XGBoost). Combining polygenic risk scores and clinical variables, we developed Cox proportional hazards models for risk prediction in these cancers.Results Our models achieved high prediction accuracy for 8 cancers, with areas under the curve ranging from 0.618 (95% confidence interval = 0.581 to 0.655) for ovarian cancer to 0.831 (95% confidence interval = 0.817 to 0.845) for lung cancer. Additionally, our models could identify individuals at a high risk for developing cancer. For example, the risk of breast cancer for individuals in the top 5% score quantile was nearly 13 times greater than for individuals in the lowest 10%. Furthermore, we observed a higher proportion of individuals with high polygenic risk scores in the early-onset group but a higher proportion of individuals at high clinical risk in the late-onset group.Conclusion Our models demonstrated the potential to predict cancer risk and identify high-risk individuals with great generalizability to different cancers. Our findings suggested that the polygenic risk score model is more predictive for the cancer risk of early-onset patients than for late-onset patients, while the clinical risk model is more predictive for late-onset patients. Meanwhile, combining polygenic risk scores and clinical risk factors has overall better predictive performance than using polygenic risk scores or clinical risk factors alone.
引用
收藏
页数:12
相关论文
共 40 条
[1]   A global reference for human genetic variation [J].
Altshuler, David M. ;
Durbin, Richard M. ;
Abecasis, Goncalo R. ;
Bentley, David R. ;
Chakravarti, Aravinda ;
Clark, Andrew G. ;
Donnelly, Peter ;
Eichler, Evan E. ;
Flicek, Paul ;
Gabriel, Stacey B. ;
Gibbs, Richard A. ;
Green, Eric D. ;
Hurles, Matthew E. ;
Knoppers, Bartha M. ;
Korbel, Jan O. ;
Lander, Eric S. ;
Lee, Charles ;
Lehrach, Hans ;
Mardis, Elaine R. ;
Marth, Gabor T. ;
McVean, Gil A. ;
Nickerson, Deborah A. ;
Wang, Jun ;
Wilson, Richard K. ;
Boerwinkle, Eric ;
Doddapaneni, Harsha ;
Han, Yi ;
Korchina, Viktoriya ;
Kovar, Christie ;
Lee, Sandra ;
Muzny, Donna ;
Reid, Jeffrey G. ;
Zhu, Yiming ;
Chang, Yuqi ;
Feng, Qiang ;
Fang, Xiaodong ;
Guo, Xiaosen ;
Jian, Min ;
Jiang, Hui ;
Jin, Xin ;
Lan, Tianming ;
Li, Guoqing ;
Li, Jingxiang ;
Li, Yingrui ;
Liu, Shengmao ;
Liu, Xiao ;
Lu, Yao ;
Ma, Xuedi ;
Tang, Meifang ;
Wang, Bo .
NATURE, 2015, 526 (7571) :68-+
[2]  
[Anonymous], Tobacco
[3]  
[Anonymous], 2021, Risk factors: Age
[4]  
[Anonymous], 2016, Information and Resources for Cancer: Breast, Colon, Lung, Prostate, Skin
[5]   Obesity and cancer risk: Emerging biological mechanisms and perspectives [J].
Avgerinos, Konstantinos, I ;
Spyrou, Nikolaos ;
Mantzoros, Christos S. ;
Dalamaga, Maria .
METABOLISM-CLINICAL AND EXPERIMENTAL, 2019, 92 :121-135
[6]   Alcohol consumption and site-specific cancer risk: a comprehensive dose-response meta-analysis [J].
Bagnardi, V. ;
Rota, M. ;
Botteri, E. ;
Tramacere, I. ;
Islami, F. ;
Fedirko, V. ;
Scotti, L. ;
Jenab, M. ;
Turati, F. ;
Pasquali, E. ;
Pelucchi, C. ;
Galeone, C. ;
Bellocco, R. ;
Negri, E. ;
Corrao, G. ;
Boffetta, P. ;
La Vecchia, C. .
BRITISH JOURNAL OF CANCER, 2015, 112 (03) :580-593
[7]   Breast Cancer Risk in Relation to the Interval Between Menopause and Starting Hormone Therapy [J].
Beral, Valerie ;
Reeves, Gillian ;
Bull, Diana ;
Green, Jane .
JNCI-JOURNAL OF THE NATIONAL CANCER INSTITUTE, 2011, 103 (04) :296-305
[8]   Family history and risk of breast cancer: an analysis accounting for family structure [J].
Brewer, Hannah R. ;
Jones, Michael E. ;
Schoemaker, Minouk J. ;
Ashworth, Alan ;
Swerdlow, Anthony J. .
BREAST CANCER RESEARCH AND TREATMENT, 2017, 165 (01) :193-200
[9]   The UK Biobank resource with deep phenotyping and genomic data [J].
Bycroft, Clare ;
Freeman, Colin ;
Petkova, Desislava ;
Band, Gavin ;
Elliott, Lloyd T. ;
Sharp, Kevin ;
Motyer, Allan ;
Vukcevic, Damjan ;
Delaneau, Olivier ;
O'Connell, Jared ;
Cortes, Adrian ;
Welsh, Samantha ;
Young, Alan ;
Effingham, Mark ;
McVean, Gil ;
Leslie, Stephen ;
Allen, Naomi ;
Donnelly, Peter ;
Marchini, Jonathan .
NATURE, 2018, 562 (7726) :203-+
[10]   Second-generation PLINK: rising to the challenge of larger and richer datasets [J].
Chang, Christopher C. ;
Chow, Carson C. ;
Tellier, Laurent C. A. M. ;
Vattikuti, Shashaank ;
Purcell, Shaun M. ;
Lee, James J. .
GIGASCIENCE, 2015, 4