Prediction of gestational diabetes mellitus in Asian women using machine learning algorithms

被引:6
作者
Kang, Byung Soo [1 ]
Lee, Seon Ui [2 ]
Hong, Subeen [1 ]
Choi, Sae Kyung [3 ]
Shin, Jae Eun [4 ]
Wie, Jeong Ha [5 ]
Jo, Yun Sung [2 ]
Kim, Yeon Hee [6 ]
Kil, Kicheol [7 ]
Chung, Yoo Hyun [8 ]
Jung, Kyunghoon [9 ]
Hong, Hanul [9 ]
Park, In Yang [1 ]
Ko, Hyun Sun [1 ]
机构
[1] Catholic Univ Korea, Seoul St Marys Hosp, Coll Med, Dept Obstet & Gynecol, Seoul, South Korea
[2] Catholic Univ Korea, St Vincents Hosp, Coll Med, Dept Obstet & Gynecol, Seoul, South Korea
[3] Catholic Univ Korea, Incheon St Marys Hosp, Coll Med, Dept Obstet & Gynecol, Seoul, South Korea
[4] Catholic Univ Korea, Bucheon St Marys Hosp, Coll Med, Dept Obstet & Gynecol, Seoul, South Korea
[5] Catholic Univ Korea, Eunpyeong St Marys Hosp, Coll Med, Dept Obstet & Gynecol, Seoul, South Korea
[6] Catholic Univ Korea, Uijeongbu St Marys Hosp, Coll Med, Dept Obstet & Gynecol, Seoul, South Korea
[7] Catholic Univ Korea, Yeouido St Marys Hosp, Coll Med, Dept Obstet & Gynecol, Seoul, South Korea
[8] Catholic Univ Korea, Daejeon St Marys Hosp, Coll Med, Dept Obstet & Gynecol, Seoul, South Korea
[9] Innerwave Co Ltd, Seoul, South Korea
关键词
PREGNANCY; DIAGNOSIS; MODEL; RISK;
D O I
10.1038/s41598-023-39680-8
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
This study developed a machine learning algorithm to predict gestational diabetes mellitus (GDM) using retrospective data from 34,387 pregnancies in multi-centers of South Korea. Variables were collected at baseline, E0 (until 10 weeks' gestation), E1 (11-13 weeks' gestation) and M1 (14-24 weeks' gestation). The data set was randomly divided into training and test sets (7:3 ratio) to compare the performances of light gradient boosting machine (LGBM) and extreme gradient boosting (XGBoost) algorithms, with a full set of variables (original). A prediction model with the whole cohort achieved area under the receiver operating characteristics curve (AUC) and area under the precision-recall curve (AUPR) values of 0.711 and 0.246 at baseline, 0.720 and 0.256 at E0, 0.721 and 0.262 at E1, and 0.804 and 0.442 at M1, respectively. Then comparison of three models with different variable sets were performed: [a] variables from clinical guidelines; [b] selected variables from Shapley additive explanations (SHAP) values; and [c] Boruta algorithms. Based on model [c] with the least variables and similar or better performance than the other models, simple questionnaires were developed. The combined use of maternal factors and laboratory data could effectively predict individual risk of GDM using a machine learning model.
引用
收藏
页数:10
相关论文
共 50 条
  • [1] An early prediction model for gestational diabetes mellitus created using machine learning algorithms
    Yang, Zhifen
    Shi, Xiaoyue
    Wang, Shengpu
    Du, Lijia
    Zhang, Xiaoying
    Zhang, Kun
    Zhang, Yongqiang
    Ma, Jinlong
    Zheng, Rui
    INTERNATIONAL JOURNAL OF GYNECOLOGY & OBSTETRICS, 2025,
  • [2] Prediction of postpartum prediabetes by machine learning methods in women with gestational diabetes mellitus
    Parkhi, Durga
    Periyathambi, Nishanthi
    Ghebremichael-Weldeselassie, Yonas
    Patel, Vinod
    Sukumar, Nithya
    Siddharthan, Rahul
    Narlikar, Leelavati
    Saravanan, Ponnusamy
    ISCIENCE, 2023, 26 (10)
  • [3] Prediction model for gestational diabetes mellitus using the XG Boost machine learning algorithm
    Hu, Xiaoqi
    Hu, Xiaolin
    Yu, Ya
    Wang, Jia
    FRONTIERS IN ENDOCRINOLOGY, 2023, 14
  • [4] Application of machine learning algorithm incorporating dietary intake in prediction of gestational diabetes mellitus
    Ding, Tianze
    Liu, Peijie
    Jia, Jie
    Wu, Hui
    Zhu, Jie
    Yang, Kefeng
    ENDOCRINE CONNECTIONS, 2024, 13 (12)
  • [5] Prediction of postpartum diabetes in women with gestational diabetes mellitus
    Ekelund, M.
    Shaat, N.
    Almgren, P.
    Groop, L.
    Berntorp, K.
    DIABETOLOGIA, 2010, 53 (03) : 452 - 457
  • [6] Machine Learning Prediction Models for Gestational Diabetes Mellitus: Meta-analysis
    Zhang, Zheqing
    Yang, Luqian
    Han, Wentao
    Wu, Yaoyu
    Zhang, Linhui
    Gao, Chun
    Jiang, Kui
    Liu, Yun
    Wu, Huiqun
    JOURNAL OF MEDICAL INTERNET RESEARCH, 2022, 24 (03)
  • [7] Nonalcoholic fatty liver disease and early prediction of gestational diabetes mellitus using machine learning methods
    Lee, Seung Mi
    Hwangbo, Suhyun
    Norwitz, Errol R.
    Koo, Ja Nam
    Oh, Ig Hwan
    Choi, Eun Saem
    Jung, Young Mi
    Kim, Sun Min
    Kim, Byoung Jae
    Kim, Sang Youn
    Kim, Gyoung Min
    Kim, Won
    Joo, Sae Kyung
    Shin, Sue
    Park, Chan-Wook
    Park, Taesung
    Park, Joong Shin
    CLINICAL AND MOLECULAR HEPATOLOGY, 2022, 28 (01) : 105 - 116
  • [8] A stepwise prediction and interpretation of gestational diabetes mellitus: Foster the practical application of machine learning in clinical decision
    Zhou, Fang
    Ran, Xiao
    Song, Fangliang
    Wu, Qinglan
    Jia, Yuan
    Liang, Ying
    Chen, Suichen
    Zhang, Guojun
    Dong, Jie
    Wang, Yukun
    HELIYON, 2024, 10 (12)
  • [9] Early Prediction of Gestational Diabetes Mellitus in the Chinese Population via Advanced Machine Learning
    Wu, Yan-Ting
    Zhang, Chen-Jie
    Mol, Ben Willem
    Kawai, Andrew
    Li, Cheng
    Chen, Lei
    Wang, Yu
    Sheng, Jian-Zhong
    Fan, Jian-Xia
    Shi, Yi
    Huang, He-Feng
    JOURNAL OF CLINICAL ENDOCRINOLOGY & METABOLISM, 2021, 106 (03) : E1191 - E1205
  • [10] PREDICTION OF TYPE 2 DIABETES MELLITUS USING FEATURE SELECTION-BASED MACHINE LEARNING ALGORITHMS
    Yilmaz, Atinc
    HEALTH PROBLEMS OF CIVILIZATION, 2022, 16 (02) : 128 - 139