Integrating genome-wide polygenic risk scores and non-genetic risk to predict colorectal cancer diagnosis using UK Biobank data: population based cohort study

被引:36
作者
Briggs, Sarah E. W. [1 ]
Law, Philip [2 ]
East, James E. [3 ,4 ]
Wordsworth, Sarah [4 ,5 ]
Dunlop, Malcolm [6 ]
Houlston, Richard [2 ,7 ]
Hippisley-Cox, Julia [8 ]
Tomlinson, Ian [9 ]
机构
[1] Univ Oxford, Nuffield Dept Med, Oxford, England
[2] Inst Canc Res, Div Genet & Epidemiol, London, England
[3] Univ Oxford, Nuffield Dept Med, Translat Gastroenterol Unit, Oxford, England
[4] Univ Oxford, NIHR Oxford Biomed Res Ctr, Oxford, England
[5] Univ Oxford, Hlth Econ Res Ctr, Nuffield Dept Populat Hlth, Oxford, England
[6] Univ Edinburgh, Inst Genet & Canc, Med Res Council Human Genet Unit, Colon Canc Genet Grp, Edinburgh, Scotland
[7] Inst Canc Res, Div Genet & Epidemiol, London, England
[8] Univ Oxford, Nuffield Dept Primary Care Hlth Sci, Oxford, England
[9] Univ Edinburgh, Inst Genet & Canc, Canc Res, Edinburgh, Scotland
来源
BMJ-BRITISH MEDICAL JOURNAL | 2022年 / 379卷
基金
英国惠康基金; 英国医学研究理事会;
关键词
EXTERNAL VALIDATION; MODELS; METAANALYSIS; ASSOCIATION; TRAITS;
D O I
10.1136/bmj-2022-071707
中图分类号
R5 [内科学];
学科分类号
1002 ; 100201 ;
摘要
OBJECTIVE To evaluate the benefit of combining polygenic risk scores with the QCancer-10 (colorectal cancer) prediction model for non-genetic risk to identify people at highest risk of colorectal cancer.DESIGN Population based cohort study.SETTING Data from the UK Biobank study, collected between March 2006 and July 2010.PARTICIPANTS 434 587 individuals with complete data for genetics and QCancer-10 predictions were included in the QCancer-10 plus polygenic risk score modelling and validation cohorts.MAIN OUTCOME MEASURES Prediction of colorectal cancer diagnosis by genetic, non-genetic, and combined risk models. Using data from UK Biobank, six different polygenic risk scores for colorectal cancer were developed using LDpred2 polygenic risk score software, clumping, and thresholding approaches, and a model based on genome-wide significant polymorphisms. The top performing genome-wide polygenic risk score and the score containing genome-wide significant polymorphisms were combined with QCancer-10 and performance was compared with QCancer-10 alone. Case-control (logistic regression) and time-to-event (Cox proportional hazards) analyses were used to evaluate risk model performance in men and women.RESULTS Polygenic risk scores derived using the LDpred2 program performed best, with an odds ratio per standard deviation of 1.584 (95% confidence interval 1.536 to 1.633), and top age and sex adjusted C statistic of 0.733 (95% confidence interval 0.710 to 0.753) in logistic regression models in the validation cohort. Integrated QCancer-10 plus polygenic risk score models out-performed QCancer-10 alone. In men, the integrated LDpred2 model produced a C statistic of 0.730 (0.720 to 0.741) and explained variation of 28.2% (26.3 to 30.1), compared with 0.693 (0.682 to 0.704) and 21.0% (18.9 to 23.1) for QCancer-10 alone. In women, the C statistic for the integrated LDpred2 model was 0.687 (0.673 to 0.702) and explained variation was 21.0% (18.7 to 23.7), compared with 0.645 (0.631 to 0.659) and 12.4% (10.3 to 14.6) for QCancer-10 alone. In the top 20% of individuals at highest absolute risk, the sensitivity and specificity of the integrated LDpred2 models for predicting colorectal cancer diagnosis was 47.8% and 80.3% respectively in men, and 42.7% and 80.1% respectively in women, with increases in absolute risk in the top 5% of risk in men of 3.47-fold and in women of 2.77-fold compared with the median. Illustrative decision curve analysis indicated a small incremental improvement in net benefit with QCancer-10 plus polygenic risk score models compared with QCancer-10 alone.CONCLUSIONS Integrating polygenic risk scores with QCancer-10 modestly improves risk prediction over use of QCancer-10 alone. Given that QCancer-10 data can be obtained relatively easily from health records, use of polygenic risk score in risk stratified population screening for colorectal cancer currently has no clear justification. The added benefit, cost effectiveness, and acceptability of polygenic risk scores should be carefully evaluated in a real life screening setting before implementation in the general population.
引用
收藏
页数:15
相关论文
共 42 条
[1]  
[Anonymous], 2018, R LANG ENV STAT COMP
[2]   How to interpret a small increase in AUC with an additional risk prediction marker: decision analysis comes through [J].
Baker, Stuart G. ;
Schuit, Ewoud ;
Steyerberg, Ewout W. ;
Pencina, Michael J. ;
Vickers, Andew ;
Moons, Karel G. M. ;
Mol, Ben W. J. ;
Lindeman, Karen S. .
STATISTICS IN MEDICINE, 2014, 33 (22) :3946-3959
[3]   A simple yet accurate correction for winner's curse can predict signals discovered in much larger genome scans [J].
Bigdeli, T. Bernard ;
Lee, Donghyung ;
Webb, Bradley Todd ;
Riley, Brien P. ;
Vladimirov, Vladimir I. ;
Fanous, Ayman H. ;
Kendler, Kenneth S. ;
Bacanu, Silviu-Alin .
BIOINFORMATICS, 2016, 32 (17) :2598-2603
[4]   The UK Biobank resource with deep phenotyping and genomic data [J].
Bycroft, Clare ;
Freeman, Colin ;
Petkova, Desislava ;
Band, Gavin ;
Elliott, Lloyd T. ;
Sharp, Kevin ;
Motyer, Allan ;
Vukcevic, Damjan ;
Delaneau, Olivier ;
O'Connell, Jared ;
Cortes, Adrian ;
Welsh, Samantha ;
Young, Alan ;
Effingham, Mark ;
McVean, Gil ;
Leslie, Stephen ;
Allen, Naomi ;
Donnelly, Peter ;
Marchini, Jonathan .
NATURE, 2018, 562 (7726) :203-+
[5]   Are there ethnic and religious variations in uptake of bowel cancer screening? A retrospective cohort study among 1.7 million people in Scotland [J].
Campbell, Christine ;
Douglas, Anne ;
Williams, Linda ;
Cezard, Genevieve ;
Brewster, David H. ;
Buchanan, Duncan ;
Robb, Kathryn ;
Stanners, Greig ;
Weller, David ;
Steele, Robert J. C. ;
Steiner, Markus ;
Bhopal, Raj .
BMJ OPEN, 2020, 10 (10)
[6]   Identifying patients with undetected colorectal cancer: an independent validation of QCancer (Colorectal) [J].
Collins, G. S. ;
Altman, D. G. .
BRITISH JOURNAL OF CANCER, 2012, 107 (02) :260-265
[7]  
Collins GS, 2015, ANN INTERN MED, V162, P735, DOI [10.7326/L15-5093-2, 10.7326/L15-5093]
[8]  
Frank E., 2001, REGRESSION MODELING, DOI [DOI 10.1007/978-3-319-19425-7, DOI 10.1007/978-1-4757-3462-1]
[9]   Cancer PRSweb: An Online Repository with Polygenic Risk Scores for Major Cancer Traits and Their Evaluation in Two Independent Biobanks [J].
Fritsche, Lars G. ;
Patil, Snehal ;
Beesley, Lauren J. ;
VandeHaar, Peter ;
Salvatore, Maxwell ;
Ma, Ying ;
Peng, Robert B. ;
Taliun, Daniel ;
Zhou, Xiang ;
Mukherjee, Bhramar .
AMERICAN JOURNAL OF HUMAN GENETICS, 2020, 107 (05) :815-836
[10]   Comparison of Sociodemographic and Health-Related Characteristics of UK Biobank Participants With Those of the General Population [J].
Fry, Anna ;
Littlejohns, Thomas J. ;
Sudlow, Cathie ;
Doherty, Nicola ;
Adamska, Ligia ;
Sprosen, Tim ;
Collins, Rory ;
Allen, Naomi E. .
AMERICAN JOURNAL OF EPIDEMIOLOGY, 2017, 186 (09) :1026-1034