Machine Learning Implementations for Multi-class Cardiovascular Risk Prediction in Family Health Units

被引:0
作者
Sozen, Mert Erkan [1 ]
Sariyer, Gorkem [2 ]
Sozen, Mustafa Yigit [3 ]
Badhotiya, Gaurav Kumar [4 ]
Vijavargy, Lokesh [5 ]
机构
[1] Izmir Metro Co, Izmir, Turkiye
[2] Yasar Univ, Business Adm, Izmir, Turkiye
[3] Ayvalik 2 Family Hlth Unit, Balikesir, Turkiye
[4] Indian Inst Management Ahmedabad IIMA, Operat & Decis Sci, Ahmadabad, Gujarat, India
[5] Jaipuria Inst Management Jaipur, Jaipur, Rajasthan, India
关键词
Cardiovascular diseases; Machine learning; Risk prediction; Family health units; SCORE-Turkey; ARTIFICIAL-INTELLIGENCE; PRIMARY-CARE; BIG DATA; DISEASE; VALIDATION; FRAMINGHAM; REGRESSION; DERIVATION; TURKEY; SCORE;
D O I
10.33889/IJMEMS.2023.8.6.066
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
Cardiovascular disease (CVD) risk prediction plays a significant role in clinical research since it is the key to primary prevention. As family health units follow up on a specific group of patients, particularly in the middle-aged and elderly groups, CVD risk prediction has additional importance for them. In a retrospectively collected data set from a family health unit in Turkey in 2018, we evaluated the CVD risk levels of patients based on SCORE-Turkey. By identifying additional CVD risk factors for SCORE-Turkey and grouping the study patients into 3-classes "low risk," "moderate risk," and "high risk" patients, we proposed a machine learning implemented early warning system for CVD risk prediction in family health units. Body mass index, diastolic blood pressures, serum glucose, creatinine, urea, uric acid levels, and HbA1c were significant additional CVD risk factors to SCORE-Turkey. All of the five implemented algorithms, k-nearest neighbour (KNN), random forest (RF), decision tree (DT), logistic regression (LR), and support vector machines (SVM), had high prediction performances for both the K4 and K5 partitioning protocols. With 89.7% and 92.1% accuracies for K4 and K5 protocols, KNN outperformed the other algorithms. For the five ML algorithms, while for the " low risk" category, precision and recall measures varied between 95% to 100%, "moderate risk," and "high risk" categories, these measures varied between 60% to 92%. Machine learning-based algorithms can be used in CVD risk prediction by enhancing prediction performances and combining various risk factors having complex relationships.
引用
收藏
页码:1171 / 1187
页数:17
相关论文
共 55 条
[1]   Assessment of Risk Factors and Biomarkers Associated With Risk of Cardiovascular Disease Among Women Consuming a Mediterranean Diet [J].
Ahmad, Shafqat ;
Moorthy, M. Vinayaga ;
Demler, Olga, V ;
Hu, Frank B. ;
Ridker, Paul M. ;
Chasman, Daniel, I ;
Mora, Samia .
JAMA NETWORK OPEN, 2018, 1 (08)
[2]   Changes in primary care provision in Turkey: A comparison of 1993 and 2012 [J].
Akman, Mehmet ;
Sakarya, Sibel ;
Sargin, Mehmet ;
Unluoglu, Ilhami ;
Egici, Memet Taskin ;
Boerma, Wienke G. W. ;
Schafer, Willemijn L. A. .
HEALTH POLICY, 2017, 121 (02) :197-206
[3]  
[Anonymous], 2019, Series in BioEngineering, DOI DOI 10.1007/978-981-10-5092-314
[4]   Factors Relating to Decision Delay in the Emergency Department: Effects of Diagnostic Tests and Consultations [J].
Ataman, Mustafa Gokalp ;
Sariyer, Gorkem ;
Saglam, Caner ;
Karagoz, Arif ;
Unluer, Erden Erol .
OPEN ACCESS EMERGENCY MEDICINE, 2023, 15 :119-131
[5]   Predicting waiting and treatment times in emergency departments using ordinal logistic regression models [J].
Ataman, Mustafa Gokalp ;
Sariyer, Gorkem .
AMERICAN JOURNAL OF EMERGENCY MEDICINE, 2021, 46 :45-50
[6]   Revolutionizing cardiovascular risk prediction in patients with chronic kidney disease: machine learning and large-scale proteomic risk prediction model lead the way [J].
Avram, Robert .
EUROPEAN HEART JOURNAL, 2023, 44 (23) :2111-2113
[7]   A study of factors related to patients' length of stay using data mining techniques in a general hospital in southern Iran [J].
Ayyoubzadeh, Seyed Mohammad ;
Ghazisaeedi, Marjan ;
Kalhori, Sharareh Rostam Niakan ;
Hassaniazad, Mehdi ;
Baniasadi, Tayebeh ;
Maghooli, Keivan ;
Kahnouji, Kobra .
HEALTH INFORMATION SCIENCE AND SYSTEMS, 2020, 8 (01)
[8]   Data mining for censored time-to-event data: a Bayesian network model for predicting cardiovascular risk from electronic health record data [J].
Bandyopadhyay, Sunayan ;
Wolfson, Julian ;
Vock, David M. ;
Vazquez-Benitez, Gabriela ;
Adomavicius, Gediminas ;
Elidrisi, Mohamed ;
Johnson, Paul E. ;
O'Connor, Patrick J. .
DATA MINING AND KNOWLEDGE DISCOVERY, 2015, 29 (04) :1033-1069
[9]   Random forests [J].
Breiman, L .
MACHINE LEARNING, 2001, 45 (01) :5-32
[10]  
Cho SH, 2021, SCI REP-UK, V11, DOI [10.1038/s41598-021-83585-3, 10.1038/s41598-021-85813-2, 10.1038/s41598-021-88257-w]