Cross-Hospital Sepsis Early Detection via Semi-Supervised Optimal Transport With Self-Paced Ensemble

被引:7
作者
Ding, Ruiqing [1 ,2 ]
Zhou, Yu [3 ]
Xu, Jie [4 ]
Xie, Yan [5 ]
Liang, Qiqiang [3 ]
Ren, He [5 ]
Wang, Yixuan [1 ,6 ]
Chen, Yanlin [7 ]
Wang, Leye [2 ]
Huang, Man [3 ]
机构
[1] Peking Univ, Key Lab High Confidence Software Technol, Minist Educ, Beijing 100871, Peoples R China
[2] Peking Univ, Sch Comp Sci, Beijing 100871, Peoples R China
[3] Zhejiang Univ, Gen Intens Care Unit, Sch Med, Gen Intens Care Unit, Hangzhou 310009, Peoples R China
[4] Zhejiang Univ, Affiliated Hosp 2, IT Ctr, Sch Med, Hangzhou 310009, Peoples R China
[5] Beijing HealSci Technol Co Ltd, Beijing 100176, Peoples R China
[6] Peking Univ, Dept Comp Sci & Technol, Beijing 100871, Peoples R China
[7] Shanghai Univ Finance & Econ, Sch Informat Management & Engn, Shanghai 200433, Peoples R China
基金
中国国家自然科学基金;
关键词
Optimal transport theory; semi-supervised transfer learning; sepsis early detection; INTERNATIONAL CONSENSUS DEFINITIONS; PREDICTION;
D O I
10.1109/JBHI.2023.3253208
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Leveraging machine learning techniques for Sepsis early detection and diagnosis has attracted increasing interest in recent years. However, most existing methods require a large amount of labeled training data, which may not be available for a target hospital that deploys a new Sepsis detection system. More seriously, as treated patients are diversified between hospitals, directly applying a model trained on other hospitals may not achieve good performance for the target hospital. To address this issue, we propose a novel semi-supervised transfer learning framework based on optimal transport theory and self-paced ensemble for Sepsis early detection, called SPSSOT, which can efficiently transfer knowledge from the source hospital (with rich labeled data) to the target hospital (with scarce labeled data). Specifically, SPSSOT incorporates a new optimal transport-based semi-supervised domain adaptation component that can effectively exploit all the unlabeled data in the target hospital. Moreover, self-paced ensemble is adapted in SPSSOT to alleviate the class imbalance issue during transfer learning. In a nutshell, SPSSOT is an end-to-end transfer learning method that automatically selects suitable samples from two domains (hospitals) respectively and aligns their feature spaces. Extensive experiments on two open clinical datasets, MIMIC-III and Challenge, demonstrate that SPSSOT outperforms state-of-the-art transfer learning methods by improving 1-3% of AUC.
引用
收藏
页码:3049 / 3060
页数:12
相关论文
共 62 条
  • [1] Minimizing flows for the Monge-Kantorovich problem
    Angenent, S
    Haker, S
    Tannenbaum, A
    [J]. SIAM JOURNAL ON MATHEMATICAL ANALYSIS, 2003, 35 (01) : 61 - 97
  • [2] Azizzadenesheli K., 2019, PROC INT C LEARN REP
  • [3] A systematic study of the class imbalance problem in convolutional neural networks
    Buda, Mateusz
    Maki, Atsuto
    Mazurowski, Maciej A.
    [J]. NEURAL NETWORKS, 2018, 106 : 249 - 259
  • [4] SMOTE: Synthetic minority over-sampling technique
    Chawla, Nitesh V.
    Bowyer, Kevin W.
    Hall, Lawrence O.
    Kegelmeyer, W. Philip
    [J]. 2002, American Association for Artificial Intelligence (16)
  • [5] SMOTEBoost: Improving prediction of the minority class in boosting
    Chawla, NV
    Lazarevic, A
    Hall, LO
    Bowyer, KW
    [J]. KNOWLEDGE DISCOVERY IN DATABASES: PKDD 2003, PROCEEDINGS, 2003, 2838 : 107 - 119
  • [6] Choi Edward, 2016, JMLR Workshop Conf Proc, V56, P301
  • [7] Chung J., 2014, NIPS WORKSH DEEP LEA
  • [8] CORTES C., 2010, ADV NEURAL INFORM PR, V23, P442
  • [9] Courty Nicolas, 2014, Machine Learning and Knowledge Discovery in Databases. European Conference, ECML PKDD 2014. Proceedings: LNCS 8724, P274, DOI 10.1007/978-3-662-44848-9_18
  • [10] Courty N, 2017, ADV NEUR IN, V30