Predicting preterm birth using explainable machine learning in a prospective cohort of nulliparous and multiparous pregnant women

被引:5
作者
Khan, Wasif [1 ,2 ]
Zaki, Nazar [1 ,2 ]
Ghenimi, Nadirah [3 ]
Ahmad, Amir [1 ,4 ]
Bian, Jiang [5 ]
Masud, Mohammad M. [1 ,2 ,4 ]
Ali, Nasloon [6 ]
Govender, Romona [3 ]
Ahmed, Luai A. [4 ,6 ]
机构
[1] United Arab Emirates Univ, Coll Informat Technol, Dept Comp Sci & Software Engn, Al Ain, U Arab Emirates
[2] United Arab Emirates Univ, Coll Informat Technol, Dept Informat Syst & Secur, Al Ain, U Arab Emirates
[3] United Arab Emirates Univ, Coll Med & Hlth Sci, Dept Family Med, Al Ain, U Arab Emirates
[4] United Arab Emirates Univ, Zayed Ctr Hlth Sci, Al Ain, U Arab Emirates
[5] Univ Florida, Coll Med, Dept Hlth Outcomes & Biomed Informat, Gainesville, FL USA
[6] United Arab Emirates Univ, Inst Publ Hlth, Coll Med & Hlth Sci, Al Ain, U Arab Emirates
来源
PLOS ONE | 2023年 / 18卷 / 12期
关键词
RISK;
D O I
10.1371/journal.pone.0293925
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Preterm birth (PTB) presents a complex challenge in pregnancy, often leading to significant perinatal and long-term morbidities. "While machine learning (ML) algorithms have shown promise in PTB prediction, the lack of interpretability in existing models hinders their clinical utility. This study aimed to predict PTB in a pregnant population using ML models, identify the key risk factors associated with PTB through the SHapley Additive exPlanations (SHAP) algorithm, and provide comprehensive explanations for these predictions to assist clinicians in providing appropriate care. This study analyzed a dataset of 3509 pregnant women in the United Arab Emirates and selected 35 risk factors associated with PTB based on the existing medical and artificial intelligence literature. Six ML algorithms were tested, wherein the XGBoost model exhibited the best performance, with an area under the operator receiving curves of 0.735 and 0.723 for parous and nulliparous women, respectively. The SHAP feature attribution framework was employed to identify the most significant risk factors linked to PTB. Additionally, individual patient analysis was performed using the SHAP and the local interpretable model-agnostic explanation algorithms (LIME). The overall incidence of PTB was 11.23% (11 and 12.1% in parous and nulliparous women, respectively). The main risk factors associated with PTB in parous women are previous PTB, previous cesarean section, preeclampsia during pregnancy, and maternal age. In nulliparous women, body mass index at delivery, maternal age, and the presence of amniotic infection were the most relevant risk factors. The trained ML prediction model developed in this study holds promise as a valuable screening tool for predicting PTB within this specific population. Furthermore, SHAP and LIME analyses can assist clinicians in understanding the individualized impact of each risk factor on their patients and provide appropriate care to reduce morbidity and mortality related to PTB.
引用
收藏
页数:17
相关论文
共 52 条
  • [1] Mutaba'ah-Mother and Child Health Study: protocol for a prospective cohort study investigating the maternal and early life determinants of infant, child, adolescent and maternal health in the United Arab Emirates
    Al Haddad, Amal
    Ali, Nasloon
    Elbarazi, Iffat
    Elabadlah, Haba
    Al-Maskari, Fatima
    Narchi, Hassib
    Brabon, Christel
    Ghazal-Aswad, Saad
    AlShalabi, Fatima M.
    Zampelas, Antonis
    Loney, Tom
    Blair, Iain
    Ahmed, Luai A.
    [J]. BMJ OPEN, 2019, 9 (08):
  • [2] [Anonymous], A supervised machine learning-based methodology for analyzing dysregulation in splicing machinery: An application in cancer diagnosis
  • [3] [Anonymous], EXPLAINABLE ARTIFICI
  • [4] Prediction and associations of preterm birth and its subtypes with eicosanoid enzymatic pathways and inflammatory markers
    Aung, Max T.
    Yu, Youfei
    Ferguson, Kelly K.
    Cantonwine, David E.
    Zeng, Lixia
    McElrath, Thomas F.
    Pennathur, Subramaniam
    Mukherjee, Bhramar
    Meeker, John D.
    [J]. SCIENTIFIC REPORTS, 2019, 9 (1)
  • [5] Prediction of preterm birth in nulliparous women using logistic regression and machine learning
    Belaghi, Reza Arabi
    Beyene, Joseph
    McDonald, Sarah D.
    [J]. PLOS ONE, 2021, 16 (06):
  • [6] Clinical risk models for preterm birth less than 28 weeks and less than 32 weeks of gestation using a large retrospective cohort
    Belaghi, Reza Arabi
    Beyene, Joseph
    McDonald, Sarah D.
    [J]. JOURNAL OF PERINATOLOGY, 2021, 41 (09) : 2173 - 2181
  • [7] Born Too Soon: The global epidemiology of 15 million preterm births
    Blencowe, Hannah
    Cousens, Simon
    Chou, Doris
    Oestergaard, Mikkel
    Say, Lale
    Moller, Ann-Beth
    Kinney, Mary
    Lawn, Joy
    [J]. REPRODUCTIVE HEALTH, 2013, 10
  • [8] Random forests
    Breiman, L
    [J]. MACHINE LEARNING, 2001, 45 (01) : 5 - 32
  • [9] Predicting high-risk preterm birth using artificial neural networks
    Catley, Christina
    Frize, Monique
    Walker, C. Robin
    Petriu, Dorina C.
    [J]. IEEE TRANSACTIONS ON INFORMATION TECHNOLOGY IN BIOMEDICINE, 2006, 10 (03): : 540 - 549
  • [10] Global, regional, and national estimates of levels of preterm birth in 2014: a systematic review and modelling analysis
    Chawanpaiboon, Saifon
    Vogel, Joshua P.
    Moller, Ann-Beth
    Lumbiganon, Pisake
    Petzold, Max
    Hogan, Daniel
    Landoulsi, Sihem
    Jampathong, Nampet
    Kongwattanakul, Kiattisak
    Laopaiboon, Malinee
    Lewis, Cameron
    Rattanakanokchai, Siwanon
    Teng, Ditza N.
    Thinkhamrop, Jadsada
    Watananirun, Kanokwaroon
    Zhang, Jun
    Zhou, Wei
    Gulmezoglu, A. Metin
    [J]. LANCET GLOBAL HEALTH, 2019, 7 (01): : E37 - E46