Enhancing severe hypoglycemia prediction in type 2 diabetes mellitus through multi-view co-training machine learning model for imbalanced dataset

被引:2
作者
Agraz, Melih [1 ,2 ,6 ]
Deng, Yixiang [4 ,5 ]
Karniadakis, George Em [1 ,3 ]
Mantzoros, Christos Socrates [6 ]
机构
[1] Brown Univ, Div Appl Math, Providence, RI 02912 USA
[2] Giresun Univ, Dept Stat, TR-28200 Giresun, Turkiye
[3] Brown Univ, Sch Engn, Providence, RI 02912 USA
[4] Univ Delaware, Coll Engn, Dept Comp & Informat Sci, Newark, DE 19716 USA
[5] MIT & Harvard, Ragon Inst Mass Gen, Cambridge, MA 02142 USA
[6] Harvard Med Sch, Beth Israel Deaconess Med Ctr, Dept Endocrinol, Boston, MA 02215 USA
关键词
FEATURE-SELECTION; MORTALITY; HEALTH; RISK;
D O I
10.1038/s41598-024-69844-z
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Patients with type 2 diabetes mellitus (T2DM) who have severe hypoglycemia (SH) poses a considerable risk of long-term death, especially among the elderly, demanding urgent medical attention. Accurate prediction of SH remains challenging due to its multifaced nature, contributed from factors such as medications, lifestyle choices, and metabolic measurements. In this study, we propose a systematic approach to improve the robustness and accuracy of SH predictions using machine learning models, guided by clinical feature selection. Our focus is on developing long-term SH prediction models using both semi-supervised learning and supervised learning algorithms. Using the action to control cardiovascular risk in diabetes trial, which includes electronic health records for over 10,000 individuals, we focus on studying adults with T2DM. Our results indicate that the application of a multi-view co-training method, incorporating the random forest algorithm, improves the specificity of SH prediction, while the same setup with Naive Bayes replacing random forest demonstrates better sensitivity. Our framework also provides interpretability of machine learning models by identifying key predictors for hypoglycemia, including fasting plasma glucose, hemoglobin A1c, general diabetes education, and NPH or L insulins. The integration of data routinely available in electronic health records significantly enhances our model's capability to predict SH events, showcasing its potential to transform clinical practice by facilitating early interventions and optimizing patient management. By enhancing prediction accuracy and identifying crucial predictive features, our study contributes to advancing the understanding and management of hypoglycemia in this population.
引用
收藏
页数:12
相关论文
共 66 条
[31]   Development of early prediction model for pregnancy-associated hypertension with graph-based semi-supervised learning [J].
Lee, Seung Mi ;
Nam, Yonghyun ;
Choi, Eun Saem ;
Jung, Young Mi ;
Sriram, Vivek ;
Leiby, Jacob S. ;
Koo, Ja Nam ;
Oh, Ig Hwan ;
Kim, Byoung Jae ;
Kim, Sun Min ;
Kim, Sang Youn ;
Kim, Gyoung Min ;
Joo, Sae Kyung ;
Shin, Sue ;
Norwitz, Errol R. ;
Park, Chan-Wook ;
Jun, Jong Kwan ;
Kim, Won ;
Kim, Dokyoon ;
Park, Joong Shin .
SCIENTIFIC REPORTS, 2022, 12 (01)
[32]   Electronic health records based reinforcement learning for treatment optimizing [J].
Li, Tianhao ;
Wang, Zhishun ;
Lu, Wei ;
Zhang, Qian ;
Li, Dengfeng .
INFORMATION SYSTEMS, 2022, 104
[33]   Overfitting in linear feature extraction for classification of high-dimensional image data [J].
Liu, Raymond ;
Gillies, Duncan F. .
PATTERN RECOGNITION, 2016, 53 :73-86
[34]  
Long C., 2022, Front. Endocrinol., V1870
[35]   Explainable machine-learning predictions for the prevention of hypoxaemia during surgery [J].
Lundberg, Scott M. ;
Nair, Bala ;
Vavilala, Monica S. ;
Horibe, Mayumi ;
Eisses, Michael J. ;
Adams, Trevor ;
Liston, David E. ;
Low, Daniel King-Wai ;
Newman, Shu-Fang ;
Kim, Jerry ;
Lee, Su-In .
NATURE BIOMEDICAL ENGINEERING, 2018, 2 (10) :749-760
[36]   Multiple predictively equivalent risk models for handling missing data at time of prediction: With an application in severe hypoglycemia risk prediction for type 2 diabetes [J].
Ma, Sisi ;
Schreiner, Pamela J. ;
Seaquist, Elizabeth R. ;
Ugurbil, Mehmet ;
Zmora, Rachel ;
Chow, Lisa S. .
JOURNAL OF BIOMEDICAL INFORMATICS, 2020, 103
[37]  
Madhavan Dharanija, 2013, Frontiers in Genetics, V4, P116, DOI 10.3389/fgene.2013.00116
[38]   Association between glycated haemoglobin levels and cardiovascular outcomes in patients with type 2 diabetes and cardiovascular disease: a secondary analysis of theTECOSrandomized clinical trial [J].
McAlister, Finlay A. ;
Zheng, Yinggan ;
Westerhout, Cynthia M. ;
Buse, John B. ;
Standl, Eberhard ;
McGuire, Darren K. ;
van de Werf, Frans ;
Green, Jennifer B. ;
Armstrong, Paul W. ;
Holman, Rury R. .
EUROPEAN JOURNAL OF HEART FAILURE, 2020, 22 (11) :2026-2034
[39]   Increased Mortality of Patients With Diabetes Reporting Severe Hypoglycemia [J].
McCoy, Rozalina G. ;
Van Houten, Holly K. ;
Ziegenfuss, Jeanette Y. ;
Shah, Nilay D. ;
Wermers, Robert A. ;
Smith, Steven A. .
DIABETES CARE, 2012, 35 (09) :1897-1901
[40]   HYPOGLYCEMIA: MINIMIZING ITS IMPACT IN TYPE 2 DIABETES [J].
Moghissi, Etie ;
Ismail-Beigi, Faramarz ;
Devine, Robin Cammarota .
ENDOCRINE PRACTICE, 2013, 19 (03) :526-535