Dense phenotyping from electronic health records enables machine learning-based prediction of preterm birth

被引:9
|
作者
Abraham, Abin [1 ,2 ]
Le, Brian [3 ]
Kosti, Idit [3 ,4 ]
Straub, Peter [1 ,5 ]
Velez-Edwards, Digna R. [1 ,6 ,7 ]
Davis, Lea K. [1 ,8 ,9 ]
Newton, J. M. [7 ]
Muglia, Louis J. [10 ]
Rokas, Antonis [6 ,11 ]
Bejan, Cosmin A. [6 ]
Sirota, Marina [3 ,4 ]
Capra, John A. [1 ,3 ,6 ,11 ]
机构
[1] Vanderbilt Univ, Vanderbilt Genet Inst, 221 Kirkland Hall, Nashville, TN 37235 USA
[2] Vanderbilt Univ, Med Ctr, Nashville, TN 37232 USA
[3] Univ Calif San Francisco, Bakar Computat Hlth Sci Inst, San Francisco, CA 94143 USA
[4] Univ Calif San Francisco, Dept Pediat, San Francisco, CA USA
[5] Vanderbilt Univ, Med Ctr, Dept Med, Div Genet Med, Nashville, TN USA
[6] Vanderbilt Univ, Med Ctr, Dept Biomed Informat, Nashville, TN 37232 USA
[7] Vanderbilt Univ, Med Ctr, Dept Obstet & Gynecol, Nashville, TN 37232 USA
[8] Vanderbilt Univ, Med Ctr, Dept Med, Nashville, TN USA
[9] Vanderbilt Univ, Dept Psychiat & Behav Sci, Div Genet Med, Med Ctr, Nashville, TN USA
[10] Burroughs Wellcome Fund, Res Triangle Pk, NC USA
[11] Vanderbilt Univ, Dept Biol Sci, 221 Kirkland Hall, Nashville, TN 37235 USA
基金
美国国家卫生研究院;
关键词
Preterm birth; Machine learning; Electronic health records; Artificial intelligence; SYSTEMATIC ANALYSIS; PERINATAL OUTCOMES; GESTATIONAL-AGE; RISK-FACTORS; PREGNANCY; EPIDEMIOLOGY; MORTALITY; WOMEN; MANAGEMENT; MEDICINE;
D O I
10.1186/s12916-022-02522-x
中图分类号
R5 [内科学];
学科分类号
1002 ; 100201 ;
摘要
Background Identifying pregnancies at risk for preterm birth, one of the leading causes of worldwide infant mortality, has the potential to improve prenatal care. However, we lack broadly applicable methods to accurately predict preterm birth risk. The dense longitudinal information present in electronic health records (EHRs) is enabling scalable and cost-efficient risk modeling of many diseases, but EHR resources have been largely untapped in the study of pregnancy. Methods Here, we apply machine learning to diverse data from EHRs with 35,282 deliveries to predict singleton preterm birth. Results We find that machine learning models based on billing codes alone can predict preterm birth risk at various gestational ages (e.g., ROC-AUC = 0.75, PR-AUC = 0.40 at 28 weeks of gestation) and outperform comparable models trained using known risk factors (e.g., ROC-AUC = 0.65, PR-AUC = 0.25 at 28 weeks). Examining the patterns learned by the model reveals it stratifies deliveries into interpretable groups, including high-risk preterm birth subtypes enriched for distinct comorbidities. Our machine learning approach also predicts preterm birth subtypes (spontaneous vs. indicated), mode of delivery, and recurrent preterm birth. Finally, we demonstrate the portability of our approach by showing that the prediction models maintain their accuracy on a large, independent cohort (5978 deliveries) from a different healthcare system. Conclusions By leveraging rich phenotypic and genetic features derived from EHRs, we suggest that machine learning algorithms have great potential to improve medical care during pregnancy. However, further work is needed before these models can be applied in clinical settings.
引用
收藏
页数:21
相关论文
共 50 条
  • [1] Dense phenotyping from electronic health records enables machine learning-based prediction of preterm birth
    Abin Abraham
    Brian Le
    Idit Kosti
    Peter Straub
    Digna R. Velez-Edwards
    Lea K. Davis
    J. M. Newton
    Louis J. Muglia
    Antonis Rokas
    Cosmin A. Bejan
    Marina Sirota
    John A. Capra
    BMC Medicine, 20
  • [2] Machine Learning-Based Prediction Model of Preterm Birth Using Electronic Health Record
    Sun, Qi
    Zou, Xiaoxuan
    Yan, Yousheng
    Zhang, Hongguang
    Wang, Shuo
    Gao, Yongmei
    Liu, Haiyan
    Liu, Shuyu
    Lu, Jianbo
    Yang, Ying
    Ma, Xu
    JOURNAL OF HEALTHCARE ENGINEERING, 2022, 2022
  • [3] A Machine Learning-Based Prediction Model for Preterm Birth in Rural India
    Raja, Rakesh
    Mukherjee, Indrajit
    Sarkar, Bikash Kanti
    JOURNAL OF HEALTHCARE ENGINEERING, 2021, 2021
  • [4] Developing a Machine Learning-Based Prediction Model for Diabetes Duration Using Information from Electronic Health Records
    Guan, Dawei
    Li, Piaopiao
    Fonseca, Vivian
    Shi, Lizheng
    Ali, Mohammed K.
    Varghese, Jithin Sam
    Carrillo-Larco, Rodrigo M.
    Rouhizadeh, Masoud
    Winterstein, Almut G.
    Jiao, Tianze
    Shao, Hui
    DIABETES, 2023, 72
  • [5] Machine Learning-Based Early Prediction of Sepsis Using Electronic Health Records: A Systematic Review
    Islam, Khandaker Reajul
    Prithula, Johayra
    Kumar, Jaya
    Tan, Toh Leong
    Reaz, Mamun Bin Ibne
    Sumon, Md. Shaheenur Islam
    Chowdhury, Muhammad E. H.
    JOURNAL OF CLINICAL MEDICINE, 2023, 12 (17)
  • [6] Machine Learning Approach for Preterm Birth Prediction Using Health Records: Systematic Review
    Sharifi-Heris, Zahra
    Laitala, Juho
    Airola, Antti
    Rahmani, Amir M.
    Bender, Miriam
    JMIR MEDICAL INFORMATICS, 2022, 10 (04) : 18 - 35
  • [7] Machine Learning-Based Identification of Obesity from Positive and Unlabelled Electronic Health Records
    Blanes-Selva, Vicent
    Tortajada, Salvador
    Vilar, Ruth
    Valdivieso, Bernardo
    Garcia-Gomez, Juan M.
    DIGITAL PERSONALIZED HEALTH AND MEDICINE, 2020, 270 : 864 - 868
  • [8] Deep learning predicts extreme preterm birth from electronic health records
    Gao, Cheng
    Osmundson, Sarah
    Edwards, Digna R. Velez
    Jackson, Gretchen Purcell
    Malin, Bradley A.
    Chen, You
    JOURNAL OF BIOMEDICAL INFORMATICS, 2019, 100
  • [9] Machine learning approaches for electronic health records phenotyping: a methodical review
    Yang, Siyue
    Varghese, Paul
    Stephenson, Ellen
    Tu, Karen
    Gronsbell, Jessica
    JOURNAL OF THE AMERICAN MEDICAL INFORMATICS ASSOCIATION, 2023, 30 (02) : 367 - 381
  • [10] A machine learning-based prediction model for postoperative delirium in cardiac valve surgery using electronic health records
    Li, Qiuying
    Li, Jiaxin
    Chen, Jiansong
    Zhao, Xu
    Zhuang, Jian
    Zhong, Guoping
    Song, Yamin
    Lei, Liming
    BMC CARDIOVASCULAR DISORDERS, 2024, 24 (01):