Predicting preterm birth using explainable machine learning in a prospective cohort of nulliparous and multiparous pregnant women

被引:5
作者
Khan, Wasif [1 ,2 ]
Zaki, Nazar [1 ,2 ]
Ghenimi, Nadirah [3 ]
Ahmad, Amir [1 ,4 ]
Bian, Jiang [5 ]
Masud, Mohammad M. [1 ,2 ,4 ]
Ali, Nasloon [6 ]
Govender, Romona [3 ]
Ahmed, Luai A. [4 ,6 ]
机构
[1] United Arab Emirates Univ, Coll Informat Technol, Dept Comp Sci & Software Engn, Al Ain, U Arab Emirates
[2] United Arab Emirates Univ, Coll Informat Technol, Dept Informat Syst & Secur, Al Ain, U Arab Emirates
[3] United Arab Emirates Univ, Coll Med & Hlth Sci, Dept Family Med, Al Ain, U Arab Emirates
[4] United Arab Emirates Univ, Zayed Ctr Hlth Sci, Al Ain, U Arab Emirates
[5] Univ Florida, Coll Med, Dept Hlth Outcomes & Biomed Informat, Gainesville, FL USA
[6] United Arab Emirates Univ, Inst Publ Hlth, Coll Med & Hlth Sci, Al Ain, U Arab Emirates
来源
PLOS ONE | 2023年 / 18卷 / 12期
关键词
RISK;
D O I
10.1371/journal.pone.0293925
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Preterm birth (PTB) presents a complex challenge in pregnancy, often leading to significant perinatal and long-term morbidities. "While machine learning (ML) algorithms have shown promise in PTB prediction, the lack of interpretability in existing models hinders their clinical utility. This study aimed to predict PTB in a pregnant population using ML models, identify the key risk factors associated with PTB through the SHapley Additive exPlanations (SHAP) algorithm, and provide comprehensive explanations for these predictions to assist clinicians in providing appropriate care. This study analyzed a dataset of 3509 pregnant women in the United Arab Emirates and selected 35 risk factors associated with PTB based on the existing medical and artificial intelligence literature. Six ML algorithms were tested, wherein the XGBoost model exhibited the best performance, with an area under the operator receiving curves of 0.735 and 0.723 for parous and nulliparous women, respectively. The SHAP feature attribution framework was employed to identify the most significant risk factors linked to PTB. Additionally, individual patient analysis was performed using the SHAP and the local interpretable model-agnostic explanation algorithms (LIME). The overall incidence of PTB was 11.23% (11 and 12.1% in parous and nulliparous women, respectively). The main risk factors associated with PTB in parous women are previous PTB, previous cesarean section, preeclampsia during pregnancy, and maternal age. In nulliparous women, body mass index at delivery, maternal age, and the presence of amniotic infection were the most relevant risk factors. The trained ML prediction model developed in this study holds promise as a valuable screening tool for predicting PTB within this specific population. Furthermore, SHAP and LIME analyses can assist clinicians in understanding the individualized impact of each risk factor on their patients and provide appropriate care to reduce morbidity and mortality related to PTB.
引用
收藏
页数:17
相关论文
共 52 条
  • [31] Effect of airborne particulate matter of 2.5 μm or less on preterm birth: A national birth cohort study in China
    Li, Qin
    Wang, Yuan-yuan
    Guo, Yuming
    Zhou, Hong
    Wang, Xiaobin
    Wang, Qiaomei
    Shen, Haiping
    Zhang, Yiping
    Yan, Donghai
    Zhang, Ya
    Zhang, Hongguang
    Li, Shanshan
    Chen, Gongbo
    Lin, Lizi
    Zhao, Jun
    He, Yuan
    Yang, Ying
    Xu, Jihong
    Wang, Yan
    Peng, Zuoqi
    Wang, Hai-Jun
    Ma, Xu
    [J]. ENVIRONMENT INTERNATIONAL, 2018, 121 : 1128 - 1136
  • [32] Li W., 2021, BMJ Open, V11
  • [33] Lipton Zachary C, 2017, ARXIV
  • [34] Global, regional, and national causes of under-5 mortality in 2000-15: an updated systematic analysis with implications for the Sustainable Development Goals
    Liu, Li
    Oza, Shefali
    Hogan, Dan
    Chu, Yue
    Perin, Jamie
    Zhu, Jun
    Lawn, Joy E.
    Cousens, Simon
    Mathers, Colin
    Black, Robert E.
    [J]. LANCET, 2016, 388 (10063) : 3027 - 3035
  • [35] Lundberg SM, 2017, ADV NEUR IN, V30
  • [36] Prediction models for the risk of spontaneous preterm birth based on maternal characteristics: a systematic review and independent external validation
    Meertens, Linda J. E.
    van Montfort, Pim
    Scheepers, Hubertina C. J.
    van Kuijk, Sander M. J.
    Aardenburg, Robert
    Langenveld, Josje
    van Dooren, Ivo M. A.
    Zwaan, Iris M.
    Spaanderman, Marc E. A.
    Smits, Luc J. M.
    [J]. ACTA OBSTETRICIA ET GYNECOLOGICA SCANDINAVICA, 2018, 97 (08) : 907 - 920
  • [37] The preterm prediction study: A clinical risk assessment system
    Mercer, BM
    Goldenberg, RL
    Das, A
    Moawad, AH
    Iams, JD
    Meis, PJ
    Copper, RL
    Johnson, F
    Thom, E
    McNellis, D
    Miodovnik, M
    Menard, MK
    Caritis, SN
    Thurnau, GR
    Bottoms, SF
    Roberts, J
    [J]. AMERICAN JOURNAL OF OBSTETRICS AND GYNECOLOGY, 1996, 174 (06) : 1885 - 1893
  • [38] Explainable artificial intelligence models using real-world electronic health record data: a systematic scoping review
    Payrovnaziri, Seyedeh Neelufar
    Chen, Zhaoyi
    Rengifo-Moreno, Pablo
    Miller, Tim
    Bian, Jiang
    Chen, Jonathan H.
    Liu, Xiuwen
    He, Zhe
    [J]. JOURNAL OF THE AMERICAN MEDICAL INFORMATICS ASSOCIATION, 2020, 27 (07) : 1173 - 1185
  • [39] Phung, 2016, ARXIV
  • [40] Machine Learning Approach for Preterm Birth Prediction Based on Maternal Chronic Conditions
    Prema, N. S.
    Pushpalatha, M. P.
    [J]. EMERGING RESEARCH IN ELECTRONICS, COMPUTER SCIENCE AND TECHNOLOGY, ICERECT 2018, 2019, 545 : 581 - 588