Improving mixed-integer temporal modeling by generating synthetic data using conditional generative adversarial networks: A case study of fluid overload prediction in the intensive care unit

被引:4
作者
Rafiei, Alireza [1 ]
Rad, Milad Ghiasi [2 ]
Sikora, Andrea [3 ]
Kamaleswaran, Rishikesan [4 ,5 ]
机构
[1] Emory Univ, Dept Comp Sci & Informat, Ste W302,400 Dowman Dr, Atlanta, GA 30322 USA
[2] Georgia Inst Technol, Dept Elect & Comp Engn, Atlanta, GA USA
[3] Univ Georgia, Coll Pharm, Dept Clin & Adm Pharm, Augusta, GA USA
[4] Emory Univ, Dept Biomed Informat, Sch Med, Atlanta, GA USA
[5] Georgia Inst Technol, Dept Biomed Engn, Atlanta, GA USA
基金
美国国家卫生研究院;
关键词
Critical care; Fluid overload; GAN; Machine learning; Mixed-integer temporal modeling; Synthetic data; MACHINE; SMOTE;
D O I
10.1016/j.compbiomed.2023.107749
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Objective: The challenge of mixed-integer temporal data, which is particularly prominent for medication use in the critically ill, limits the performance of predictive models. The purpose of this evaluation was to pilot test integrating synthetic data within an existing dataset of complex medication data to improve machine learning model prediction of fluid overload.Materials and methods: This retrospective cohort study evaluated patients admitted to an ICU >= 72 h. Four machine learning algorithms to predict fluid overload after 48-72 h of ICU admission were developed using the original dataset. Then, two distinct synthetic data generation methodologies (synthetic minority over-sampling technique (SMOTE) and conditional tabular generative adversarial network (CTGAN)) were used to create synthetic data. Finally, a stacking ensemble technique designed to train a meta-learner was established. Models underwent training in three scenarios of varying qualities and quantities of datasets.Results: Training machine learning algorithms on the combined synthetic and original dataset overall increased the performance of the predictive models compared to training on the original dataset. The highest performing model was the meta-model trained on the combined dataset with 0.83 AUROC while it managed to significantly enhance the sensitivity across different training scenarios. Discussion: The integration of synthetically generated data is the first time such methods have been applied to ICU medication data and offers a promising solution to enhance the performance of machine learning models for fluid overload, which may be translated to other ICU outcomes. A meta-learner was able to make a trade-off between different performance metrics and improve the ability to identify the minority class.
引用
收藏
页数:10
相关论文
共 47 条
  • [1] Development of Machine Learning Models to Validate a Medication Regimen Complexity Scoring Tool for Critically Ill Patients
    Al-Mamun, Mohammad A.
    Brothers, Todd
    Newsome, Andrea Sikora
    [J]. ANNALS OF PHARMACOTHERAPY, 2021, 55 (04) : 421 - 429
  • [2] World Medical Association Declaration of Helsinki Ethical Principles for Medical Research Involving Human Subjects
    [J]. JAMA-JOURNAL OF THE AMERICAN MEDICAL ASSOCIATION, 2013, 310 (20): : 2191 - 2194
  • [3] Improving Sepsis Prediction Performance Using Conditional Recurrent Adversarial Networks
    Apalak, Merve
    Kiasaleh, Kamran
    [J]. IEEE ACCESS, 2022, 10 : 134466 - 134476
  • [4] Multiple imputation by chained equations: what is it and how does it work?
    Azur, Melissa J.
    Stuart, Elizabeth A.
    Frangakis, Constantine
    Leaf, Philip J.
    [J]. INTERNATIONAL JOURNAL OF METHODS IN PSYCHIATRIC RESEARCH, 2011, 20 (01) : 40 - 49
  • [5] Beery S, 2020, IEEE WINT CONF APPL, P852, DOI [10.1109/wacv45572.2020.9093570, 10.1109/WACV45572.2020.9093570]
  • [6] A narrative review of pharmacologic de-resuscitation in the critically ill
    Bissell, Brittany D.
    Donaldson, J. Chris
    Morris, Peter E.
    Neyra, Javier A.
    [J]. JOURNAL OF CRITICAL CARE, 2020, 59 : 156 - 162
  • [7] Impact of protocolized diuresis for de-resuscitation in the intensive care unit
    Bissell, Brittany D.
    Laine, Melanie E.
    Thompson Bastin, Melissa L.
    Flannery, Alexander H.
    Kelly, Andrew
    Riser, Jeremy
    Neyra, Javier A.
    Potter, Jordan
    Morris, Peter E.
    [J]. CRITICAL CARE, 2020, 24 (01):
  • [8] Fluid Stewardship of Maintenance Intravenous Fluids
    Carr, John R.
    Hawkins, W. Anthony
    Newsome, Andrea Sikora
    Smith, Susan E.
    Amber, Clemmons B.
    Bland, Christopher M.
    Branan, Trisha N.
    [J]. JOURNAL OF PHARMACY PRACTICE, 2022, 35 (05) : 769 - 782
  • [9] SMOTE: Synthetic minority over-sampling technique
    Chawla, Nitesh V.
    Bowyer, Kevin W.
    Hall, Lawrence O.
    Kegelmeyer, W. Philip
    [J]. 2002, American Association for Artificial Intelligence (16)
  • [10] Synthetic data in machine learning for medicine and healthcare
    Chen, Richard J.
    Lu, Ming Y.
    Chen, Tiffany Y.
    Williamson, Drew F. K.
    Mahmood, Faisal
    [J]. NATURE BIOMEDICAL ENGINEERING, 2021, 5 (06) : 493 - 497