Prediction of Intracranial Aneurysm Risk using Machine Learning

被引:30
作者
Heo, Jaehyuk [1 ,2 ]
Park, Sang Jun [3 ,4 ]
Kang, Si-Hyuck [3 ,5 ]
Oh, Chang Wan [1 ]
Bang, Jae Seung [1 ]
Kim, Tackeun [1 ,3 ]
机构
[1] Seoul Natl Univ, Coll Med, Bundang Hosp, Dept Neurosurg, Seongnam Si, South Korea
[2] Univ Suwon, Dept Appl Stat, Hwaseong Si, South Korea
[3] Seoul Natl Univ, Bundang Hosp, Big Data Ctr, Dept Future Innovat Res, Seongnam Si, South Korea
[4] Seoul Natl Univ, Coll Med, Bundang Hosp, Dept Ophthalmol, Seongnam Si, South Korea
[5] Seoul Natl Univ, Coll Med, Bundang Hosp, Div Cardiol,Dept Internal Med, Seongnam Si, South Korea
关键词
PREVALENCE; COHORT; KOREA;
D O I
10.1038/s41598-020-63906-8
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
An efficient method for identifying subjects at high risk of an intracranial aneurysm (IA) is warranted to provide adequate radiological screening guidelines and effectively allocate medical resources. We developed a model for pre-diagnosis IA prediction using a national claims database and health examination records. Data from the National Health Screening Program in Korea were utilized as input for several machine learning algorithms: logistic regression (LR), random forest (RF), scalable tree boosting system (XGB), and deep neural networks (DNN). Algorithm performance was evaluated through the area under the receiver operating characteristic curve (AUROC) using different test data from that employed for model training. Five risk groups were classified in ascending order of risk using model prediction probabilities. Incidence rate ratios between the lowest- and highest-risk groups were then compared. The XGB model produced the best IA risk prediction (AUROC of 0.765) and predicted the lowest IA incidence (3.20) in the lowest-risk group, whereas the RF model predicted the highest IA incidence (161.34) in the highest-risk group. The incidence rate ratios between the lowest- and highest-risk groups were 49.85, 35.85, 34.90, and 30.26 for the XGB, LR, DNN, and RF models, respectively. The developed prediction model can aid future IA screening strategies.
引用
收藏
页数:10
相关论文
共 40 条
[1]  
[Anonymous], 2014, PREPRINT
[2]  
[Anonymous], ASSESSMENT METRICS I
[3]   NATURAL-HISTORY AND RISK-FACTORS OF UNRUPTURED CEREBRAL ANEURYSMS [J].
ASARI, S ;
OHMOTO, T .
CLINICAL NEUROLOGY AND NEUROSURGERY, 1993, 95 (03) :205-214
[4]   Random forests [J].
Breiman, L .
MACHINE LEARNING, 2001, 45 (01) :5-32
[5]  
Chen T., XGBoost: A Scalable Tree Boosting System|Semantic ScholarEB/OL, P785
[6]   CT Radiation Dose: Current Controversies and Dose Reduction Strategies [J].
Costello, Justin E. ;
Cecava, Nathan D. ;
Tucker, Jonathan E. ;
Bau, Jennifer L. .
AMERICAN JOURNAL OF ROENTGENOLOGY, 2013, 201 (06) :1283-1290
[7]   COMPARING THE AREAS UNDER 2 OR MORE CORRELATED RECEIVER OPERATING CHARACTERISTIC CURVES - A NONPARAMETRIC APPROACH [J].
DELONG, ER ;
DELONG, DM ;
CLARKEPEARSON, DI .
BIOMETRICS, 1988, 44 (03) :837-845
[8]   A data-driven approach to predicting diabetes and cardiovascular disease with machine learning [J].
Dinh, An ;
Miertschin, Stacey ;
Young, Amber ;
Mohanty, Somya D. .
BMC MEDICAL INFORMATICS AND DECISION MAKING, 2019, 19 (01)
[9]   A multi-parameterized artificial neural network for lung cancer risk prediction [J].
Hart, Gregory R. ;
Roffman, David A. ;
Decker, Roy ;
Deng, Jun .
PLOS ONE, 2018, 13 (10)
[10]   Estimating the Global Incidence of Aneurysmal Subarachnoid Hemorrhage: A Systematic Review for Central Nervous System Vascular Lesions and Meta-Analysis of Ruptured Aneurysms [J].
Hughes, Joshua D. ;
Bond, Kamila M. ;
Mekary, Rania A. ;
Dewan, Michael C. ;
Rattani, Abbas ;
Baticulon, Ronnie ;
Kato, Yoko ;
Azevedo-Filho, Hildo ;
Morcos, Jacques J. ;
Park, Kee B. .
WORLD NEUROSURGERY, 2018, 115 :430-+