Frozen Language Model Helps ECG Zero-Shot Learning

被引:0
作者
Li, Jun [1 ]
Liu, Che [2 ,3 ]
Cheng, Sibo [3 ]
Arcucci, Rossella [2 ,3 ]
Hong, Shenda [4 ,5 ]
机构
[1] Jilin Univ, Coll Elect Sci & Engn, Changchun, Peoples R China
[2] Imperial Coll London, Dept Earth Sci & Engn, London SW7 2AZ, England
[3] Imperial Coll London, Data Sci Inst, Dept Comp, London, England
[4] Peking Univ, Natl Inst Hlth Data Sci, Beijing, Peoples R China
[5] Peking Univ, Inst Med Technol, Hlth Sci Ctr, Beijing, Peoples R China
来源
MEDICAL IMAGING WITH DEEP LEARNING, VOL 227 | 2023年 / 227卷
基金
中国国家自然科学基金;
关键词
Multimodal self-supervised learning; Zero-shot learning; Language model; ECG; Signal processing; MYOCARDIAL-INFARCTION; SIGNALS;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The electrocardiogram (ECG) is one of the most commonly used non-invasive, convenient medical monitoring tools that assist in the clinical diagnosis of heart diseases. Recently, deep learning (DL) techniques, particularly self-supervised learning (SSL), have demonstrated great potential in the classification of ECG. SSL pre-training has achieved competitive performance with only a small amount of annotated data after fine-tuning. However, current SSL methods rely on the availability of annotated data and are unable to predict labels not existing in fine-tuning datasets. To address this challenge, we propose Multimodal ECG-Text Self-supervised pre-training (METS), the first work to utilize the auto-generated clinical reports to guide ECG SSL pre-training. We use a trainable ECG encoder and a frozen language model to embed paired ECG and automatically machine-generated clinical reports separately. The SSL aims to maximize the similarity between paired ECG and auto-generated report while minimize the similarity between ECG and other reports. In downstream classification tasks, METS achieves around 10% improvement in performance without using any annotated data via zero-shot classification, compared to other supervised and SSL baselines that rely on annotated data. Furthermore, METS achieves the highest recall and F1 scores on the MIT-BIH dataset, despite MIT-BIH containing different classes of ECG compared to the pre-trained dataset. The extensive experiments have demonstrated the advantages of using ECG-Text multimodal self-supervised learning in terms of generalizability, effectiveness, and efficiency.
引用
收藏
页码:402 / 415
页数:14
相关论文
共 50 条
  • [31] Bidirectional generative transductive zero-shot learning
    Li, Xinpeng
    Zhang, Dan
    Ye, Mao
    Li, Xue
    Dou, Qiang
    Lv, Qiao
    NEURAL COMPUTING & APPLICATIONS, 2021, 33 (10) : 5313 - 5326
  • [32] Kernelized distance learning for zero-shot recognition
    Zarei, Mohammad Reza
    Taheri, Mohammad
    Long, Yang
    INFORMATION SCIENCES, 2021, 580 : 801 - 818
  • [33] Rethinking attribute localization for zero-shot learning
    Chen, Shuhuang
    Chen, Shiming
    Xie, Guo-Sen
    Shu, Xiangbo
    You, Xinge
    Li, Xuelong
    SCIENCE CHINA-INFORMATION SCIENCES, 2024, 67 (07)
  • [34] Adversarial strategy for transductive zero-shot learning
    Liu, Youfa
    Du, Bo
    Ni, Fuchuan
    INFORMATION SCIENCES, 2021, 578 : 750 - 761
  • [35] An Attribute Learning Method for Zero-Shot Recognition
    Yazdanian, Ramtin
    Shojaee, Seyed Mohsen
    Baghshah, Mahdieh Soleymani
    2017 25TH IRANIAN CONFERENCE ON ELECTRICAL ENGINEERING (ICEE), 2017, : 2235 - 2240
  • [36] Collaborative Filtering Based Zero-Shot Learning
    Yang B.
    Zhang Y.-X.-Q.
    Peng Y.-D.
    Zhang C.-X.
    Huang J.
    Ruan Jian Xue Bao/Journal of Software, 2021, 32 (09): : 2801 - 2815
  • [37] A Deep Multi-Modal Explanation Model for Zero-Shot Learning
    Liu, Yu
    Tuytelaars, Tinne
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2020, 29 : 4788 - 4803
  • [38] Meta hyperbolic networks for zero-shot learning
    Xu, Yan
    Mu, Lifu
    Ji, Zhong
    Liu, Xiyao
    Han, Jungong
    NEUROCOMPUTING, 2022, 491 : 57 - 66
  • [39] Zero-Shot Learning: An Energy based Approach
    Zhao, Tianxiang
    Liu, Guiquan
    Wu, Le
    Ma, Chao
    Chen, Enhong
    2018 IEEE INTERNATIONAL CONFERENCE ON DATA MINING (ICDM), 2018, : 797 - 806
  • [40] VLPSR: Enhancing Zero-Shot Object ReID with Vision-Language Model
    Hu, Mingzhe
    ADVANCES IN VISUAL COMPUTING, ISVC 2024, PT II, 2025, 15047 : 56 - 69