Applying masked autoencoder-based self-supervised learning for high-capability vision transformers of electrocardiographies

被引:0
|
作者
Sawano, Shinnosuke [1 ]
Kodera, Satoshi [1 ]
Setoguchi, Naoto [2 ]
Tanabe, Kengo [2 ]
Kushida, Shunichi [3 ]
Kanda, Junji [3 ]
Saji, Mike [4 ]
Nanasato, Mamoru [4 ]
Maki, Hisataka [5 ]
Fujita, Hideo [5 ]
Kato, Nahoko [6 ]
Watanabe, Hiroyuki [6 ]
Suzuki, Minami [7 ]
Takahashi, Masao [7 ]
Sawada, Naoko [8 ]
Yamasaki, Masao [8 ]
Sato, Masataka [1 ]
Katsushika, Susumu [1 ]
Shinohara, Hiroki [1 ]
Takeda, Norifumi [1 ]
Fujiu, Katsuhito [1 ,9 ]
Daimon, Masao [1 ,10 ]
Akazawa, Hiroshi [1 ]
Morita, Hiroyuki [1 ]
Komuro, Issei [1 ]
机构
[1] Univ Tokyo Hosp, Dept Cardiovasc Med, Tokyo, Japan
[2] Mitsui Mem Hosp, Div Cardiol, Tokyo, Japan
[3] Asahi Gen Hosp, Dept Cardiovasc Med, Chiba, Japan
[4] Sakakibara Heart Inst, Dept Cardiol, Tokyo, Japan
[5] Jichi Med Univ, Saitama Med Ctr, Div Cardiovasc Med, Omiya, Japan
[6] Tokyo Bay Med Ctr, Dept Cardiol, Urayasu, Japan
[7] JR Gen Hosp, Dept Cardiol, Tokyo, Japan
[8] NTT Med Ctr Tokyo, Dept Cardiol, Tokyo, Japan
[9] Univ Tokyo, Dept Adv Cardiol, Tokyo, Japan
[10] Univ Tokyo Hosp, Dept Clin Lab, Tokyo, Japan
来源
PLOS ONE | 2024年 / 19卷 / 08期
关键词
VENTRICULAR SYSTOLIC DYSFUNCTION; ECG;
D O I
10.1371/journal.pone.0307978
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
The generalization of deep neural network algorithms to a broader population is an important challenge in the medical field. We aimed to apply self-supervised learning using masked autoencoders (MAEs) to improve the performance of the 12-lead electrocardiography (ECG) analysis model using limited ECG data. We pretrained Vision Transformer (ViT) models by reconstructing the masked ECG data with MAE. We fine-tuned this MAE-based ECG pretrained model on ECG-echocardiography data from The University of Tokyo Hospital (UTokyo) for the detection of left ventricular systolic dysfunction (LVSD), and then evaluated it using multi-center external validation data from seven institutions, employing the area under the receiver operating characteristic curve (AUROC) for assessment. We included 38,245 ECG-echocardiography pairs from UTokyo and 229,439 pairs from all institutions. The performances of MAE-based ECG models pretrained using ECG data from UTokyo were significantly higher than that of other Deep Neural Network models across all external validation cohorts (AUROC, 0.913-0.962 for LVSD, p < 0.001). Moreover, we also found improvements for the MAE-based ECG analysis model depending on the model capacity and the amount of training data. Additionally, the MAE-based ECG analysis model maintained high performance even on the ECG benchmark dataset (PTB-XL). Our proposed method developed high performance MAE-based ECG analysis models using limited ECG data.
引用
收藏
页数:17
相关论文
共 50 条
  • [41] Masked cosine similarity prediction for self-supervised skeleton-based action representation learning
    Ziliang Ren
    Ronggui Liu
    Yong Qin
    Xiangyang Gao
    Qieshi Zhang
    Pattern Analysis and Applications, 2025, 28 (2)
  • [42] Monocular depth estimation for vision-based vehicles based on a self-supervised learning method
    Tektonidis, Marco
    Monnin, David
    AUTONOMOUS SYSTEMS: SENSORS, PROCESSING, AND SECURITY FOR VEHICLES AND INFRASTRUCTURE 2020, 2020, 11415
  • [43] Vision Transformer-Based Self-supervised Learning for Ulcerative Colitis Grading in Colonoscopy
    Pyatha, Ajay
    Xu, Ziang
    Ali, Sharib
    DATA ENGINEERING IN MEDICAL IMAGING, DEMI 2023, 2023, 14314 : 102 - 110
  • [44] Self-supervised Pre-training for Dealing with Small Datasets in Deep Learning for Medical Imaging Evaluation of Contrastive and Masked Autoencoder Methods
    Wolf, Daniel
    Payer, Tristan
    Lisson, Catharina S.
    Lisson, Christoph G.
    Beer, Meinrad
    Goetz, Michael
    Ropinski, Timo
    BILDVERARBEITUNG FUR DIE MEDIZIN 2024, 2024, : 157 - 157
  • [45] EslaXDET: A new X-ray baggage security detection framework based on self-supervised vision transformers
    Wu, Jiajie
    Xu, Xianghua
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2024, 127
  • [46] EslaXDET: A new X-ray baggage security detection framework based on self-supervised vision transformers
    Wu, Jiajie
    Xu, Xianghua
    Engineering Applications of Artificial Intelligence, 2024, 127
  • [47] Self-supervised representation learning of metro interior noise based on variational autoencoder and deep embedding clustering
    Wang, Yang
    Xiao, Hong
    Zhang, Zhihai
    Guo, Xiaoxuan
    Liu, Qiang
    COMPUTER-AIDED CIVIL AND INFRASTRUCTURE ENGINEERING, 2025, 40 (04) : 503 - 522
  • [48] ATTENTION-GUIDED CONTRASTIVE MASKED IMAGE MODELING FOR TRANSFORMER-BASED SELF-SUPERVISED LEARNING
    Zhan, Yucheng
    Zhao, Yucheng
    Luo, Chong
    Zhang, Yueyi
    Sun, Xiaoyan
    2023 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, ICIP, 2023, : 2490 - 2494
  • [49] A Novel Driver Distraction Behavior Detection Method Based on Self-Supervised Learning With Masked Image Modeling
    Zhang, Yingzhi
    Li, Taiguo
    Li, Chao
    Zhou, Xinghong
    IEEE INTERNET OF THINGS JOURNAL, 2024, 11 (04): : 6056 - 6071
  • [50] Masked Feature Modeling for Generative Self-Supervised Representation Learning of High-Resolution Remote Sensing Images
    Pang, Shiyan
    Hu, Hanchun
    Zuo, Zhiqi
    Chen, Jia
    Hu, Xiangyun
    IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2024, 17 : 8434 - 8449