Using Domain Adaptation and Inductive Transfer Learning to Improve Patient Outcome Prediction in the Intensive Care Unit: Retrospective Observational Study

被引:0
作者
Mutnuri, Maruthi Kumar [1 ,2 ]
Stelfox, Henry Thomas [3 ,4 ]
Forkert, Nils Daniel [5 ,6 ]
Lee, Joon [1 ,7 ,8 ,9 ]
机构
[1] Univ Calgary, Cumming Sch Med, Data Intelligence Hlth Lab, CWPH 5E17,3280 Hosp Dr Northwest, Calgary, AB T2N 4Z6, Canada
[2] Univ Calgary, Schulich Sch Engn, Dept Biomed Engn, Calgary, AB, Canada
[3] Univ Calgary, Cumming Sch Med, Dept Crit Care Med, Calgary, AB, Canada
[4] Univ Calgary, OBrien Inst Publ Hlth, Cumming Sch Med, Calgary, AB, Canada
[5] Univ Calgary, Cumming Sch Med, Dept Radiol, Calgary, AB, Canada
[6] Univ Calgary, Alberta Childrens Hosp Res Inst, Cumming Sch Med, Calgary, AB, Canada
[7] Univ Calgary, Cumming Sch Med, Dept Cardiac Sci, Calgary, AB, Canada
[8] Univ Calgary, Cumming Sch Med, Dept Community Hlth Sci, Calgary, AB, Canada
[9] Kyung Hee Univ, Sch Med, Dept Prevent Med, Seoul, South Korea
基金
加拿大自然科学与工程研究理事会;
关键词
transfer learning; patient outcome prediction; intensive care; deep learning; electronic health record;
D O I
10.2196/52730
中图分类号
R19 [保健组织与事业(卫生事业管理)];
学科分类号
摘要
Background: Accurate patient outcome prediction in the intensive care unit (ICU) can potentially lead to more effective and efficient patient care. Deep learning models are capable of learning from data to accurately predict patient outcomes, but they typically require large amounts of data and computational resources. Transfer learning (TL) can help in scenarios where data and computational resources are scarce by leveraging pretrained models. While TL has been widely used in medical imaging and natural language processing, it has been rare in electronic health record (EHR) analysis. Furthermore, domain adaptation (DA)has been the most common TL method in general, whereas inductive transfer learning (ITL) has been rare. To the best of our knowledge, DA and ITL have never been studied in-depth in the context of EHR-based ICU patient outcome prediction. Objective: This study investigated DA, as well as rarely researched ITL, in EHR-based ICU patient outcome prediction undersimulated, varying levels of data scarcity. Methods: Two patient cohorts were used in this study: (1) eCritical, a multicenter ICU data from 55,689 unique admission records from 48,672 unique patients admitted to 15 medical-surgical ICUs in Alberta, Canada, between March 2013 and December2019, and (2) Medical Information Mart for Intensive Care III, a single-center, publicly available ICU data set from Boston, Massachusetts, acquired between 2001 and 2012 containing 61,532 admission records from 46,476 patients. We compared DA and ITL models with baseline models (without TL) of fully connected neural networks, logistic regression, and lasso regression in the prediction of 30-day mortality, acute kidney injury, ICU length of stay, and hospital length of stay. Random subsets of training data, ranging from 1% to 75%, as well as the full data set, were used to compare the performances of DA and ITL with the baseline models at various levels of data scarcity. Results: Overall, the ITL models outperformed the baseline models in 55 of 56 comparisons (all Pvalues <.001). The DA models outperformed the baseline models in 45 of 56 comparisons (all Pvalues <.001). ITL resulted in better performance than DA in terms of the number of times and the margin with which it outperformed the baseline models. In 11 of 16 cases (8 of 8 forITL and 3 of 8 for DA), TL models outperformed baseline models when trained using 1% data subset. Conclusions: TL-based ICU patient outcome prediction models are useful in data-scarce scenarios. The results of this studycan be used to estimate ICU outcome prediction performance at different levels of data scarcity, with and without TL. The publiclyavailable pretrained models from this study can serve as building blocks in further research for the development and validationof models in other ICU cohorts and outcomes.
引用
收藏
页数:19
相关论文
共 33 条
[1]   Representation Learning: A Review and New Perspectives [J].
Bengio, Yoshua ;
Courville, Aaron ;
Vincent, Pascal .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2013, 35 (08) :1798-1828
[2]  
Borowka S., 2024, arXiv, DOI [10.31222/osf.io/x6aut, DOI 10.1111/CGF.13815]
[3]   SMOTE: Synthetic minority over-sampling technique [J].
Chawla, Nitesh V. ;
Bowyer, Kevin W. ;
Hall, Lawrence O. ;
Kegelmeyer, W. Philip .
2002, American Association for Artificial Intelligence (16)
[4]  
Devlin J, 2019, 2019 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES (NAACL HLT 2019), VOL. 1, P4171
[5]  
Diamant I, 2017, ELS MIC SOC BOOK SER, P299, DOI 10.1016/B978-0-12-810408-8.00018-3
[6]   Target-Dependent Sentiment Classification With BERT [J].
Gao, Zhengjie ;
Feng, Ao ;
Song, Xinyu ;
Wu, Xi .
IEEE ACCESS, 2019, 7 :154290-154299
[7]  
Ge Wendong, 2018, AMIA Annu Symp Proc, V2018, P460
[8]  
Ghafoorian Mohsen, 2017, Medical Image Computing and Computer Assisted Intervention MICCAI 2017. 20th International Conference. Proceedings: LNCS 10435, P516, DOI 10.1007/978-3-319-66179-7_59
[9]   Lung nodule detection and classification from Thorax CT-scan using RetinaNet with transfer learning [J].
Harsono, Ivan William ;
Liawatimena, Suryadiputra ;
Cenggoro, Tjeng Wawan .
JOURNAL OF KING SAUD UNIVERSITY-COMPUTER AND INFORMATION SCIENCES, 2020, 34 (03) :567-577
[10]   Deep Residual Learning for Image Recognition [J].
He, Kaiming ;
Zhang, Xiangyu ;
Ren, Shaoqing ;
Sun, Jian .
2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, :770-778