Predicting CoVID-19 community mortality risk using machine learning and development of an online prognostic tool

被引:51
作者
Das, Ashis Kumar [1 ]
Mishra, Shiba [2 ]
Gopalan, Saji Saraswathy [1 ]
机构
[1] World Bank, 1818 H St NW, Washington, DC 20433 USA
[2] Credit Suisse Private Ltd, Pune, Maharashtra, India
关键词
CoVID-19; Machine learning; Modelling; Mortality risk prediction; Decision support; CLINICAL CHARACTERISTICS;
D O I
10.7717/peerj.10083
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Background: The recent pandemic of CoVID-19 has emerged as a threat to global health security. There are very few prognostic models on CoVID-19 using machine learning. Objectives: To predict mortality among confirmed CoVID-19 patients in South Korea using machine learning and deploy the best performing algorithm as an open-source online prediction tool for decision-making. Materials and Methods: Mortality for confirmed CoVID-19 patients (n = 3,524) between January 20, 2020 and May 30, 2020 was predicted using five machine learning algorithms (logistic regression, support vector machine, K nearest neighbor, random forest and gradient boosting). The performance of the algorithms was compared, and the best performing algorithm was deployed as an online prediction tool. Results: The logistic regression algorithm was the best performer in terms of discrimination (area under ROC curve = 0.830), calibration (Matthews Correlation Coefficient = 0.433; Brier Score = 0.036) and. The best performing algorithm (logistic regression) was deployed as the online CoVID-19 Community Mortality Risk Prediction tool named CoCoMoRP (https://ashis-das.shinyapps.io/CoCoMoRP/). Conclusions: We describe the development and deployment of an open-source machine learning tool to predict mortality risk among CoVID-19 confirmed patients using publicly available surveillance data. This tool can be utilized by potential stakeholders such as health providers and policymakers to triage patients at the community level in addition to other approaches.
引用
收藏
页数:12
相关论文
共 30 条
[1]  
[Anonymous], 2016, STAT ANAL DATA MIN, DOI DOI 10.1111/VRU.12350
[2]   Artificial Intelligence and Big Data in Public Health [J].
Benke, Kurt ;
Benke, Geza .
INTERNATIONAL JOURNAL OF ENVIRONMENTAL RESEARCH AND PUBLIC HEALTH, 2018, 15 (12)
[3]   Detection of COVID-19 Infection from Routine Blood Exams with Machine Learning: A Feasibility Study [J].
Brinati, Davide ;
Campagner, Andrea ;
Ferrari, Davide ;
Locatelli, Massimo ;
Banfi, Giuseppe ;
Cabitza, Federico .
JOURNAL OF MEDICAL SYSTEMS, 2020, 44 (08)
[4]   SMOTE: Synthetic minority over-sampling technique [J].
Chawla, Nitesh V. ;
Bowyer, Kevin W. ;
Hall, Lawrence O. ;
Kegelmeyer, W. Philip .
2002, American Association for Artificial Intelligence (16)
[5]   Clinical characteristics and intrauterine vertical transmission potential of COVID-19 infection in nine pregnant women: a retrospective review of medical records [J].
Chen, Huijun ;
Guo, Juanjuan ;
Wang, Chen ;
Luo, Fan ;
Yu, Xuechen ;
Zhang, Wei ;
Li, Jiafu ;
Zhao, Dongchi ;
Xu, Dan ;
Gong, Qing ;
Liao, Jing ;
Yang, Huixia ;
Hou, Wei ;
Zhang, Yuanzhen .
LANCET, 2020, 395 (10226) :809-815
[6]   Machine Learning and Prediction in Medicine - Beyond the Peak of Inflated Expectations [J].
Chen, Jonathan H. ;
Asch, Steven M. .
NEW ENGLAND JOURNAL OF MEDICINE, 2017, 376 (26) :2507-2509
[7]   The advantages of the Matthews correlation coefficient (MCC) over F1 score and accuracy in binary classification evaluation [J].
Chicco, Davide ;
Jurman, Giuseppe .
BMC GENOMICS, 2020, 21 (01)
[8]  
Coronavirus Resource Center, 2020, COVID 19 DASHB CTR S
[9]   Machine Learning in Medicine [J].
Deo, Rahul C. .
CIRCULATION, 2015, 132 (20) :1920-1930
[10]  
Feng C, 2020, The Lancet, DOI [10.1101/2020.03.19.20039099, DOI 10.2139/SSRN.3551355, 10.2139/ ssrn.3551355, DOI 10.1101/2020.03.19.20039099]