Applying masked autoencoder-based self-supervised learning for high-capability vision transformers of electrocardiographies

被引:0
|
作者
Sawano, Shinnosuke [1 ]
Kodera, Satoshi [1 ]
Setoguchi, Naoto [2 ]
Tanabe, Kengo [2 ]
Kushida, Shunichi [3 ]
Kanda, Junji [3 ]
Saji, Mike [4 ]
Nanasato, Mamoru [4 ]
Maki, Hisataka [5 ]
Fujita, Hideo [5 ]
Kato, Nahoko [6 ]
Watanabe, Hiroyuki [6 ]
Suzuki, Minami [7 ]
Takahashi, Masao [7 ]
Sawada, Naoko [8 ]
Yamasaki, Masao [8 ]
Sato, Masataka [1 ]
Katsushika, Susumu [1 ]
Shinohara, Hiroki [1 ]
Takeda, Norifumi [1 ]
Fujiu, Katsuhito [1 ,9 ]
Daimon, Masao [1 ,10 ]
Akazawa, Hiroshi [1 ]
Morita, Hiroyuki [1 ]
Komuro, Issei [1 ]
机构
[1] Univ Tokyo Hosp, Dept Cardiovasc Med, Tokyo, Japan
[2] Mitsui Mem Hosp, Div Cardiol, Tokyo, Japan
[3] Asahi Gen Hosp, Dept Cardiovasc Med, Chiba, Japan
[4] Sakakibara Heart Inst, Dept Cardiol, Tokyo, Japan
[5] Jichi Med Univ, Saitama Med Ctr, Div Cardiovasc Med, Omiya, Japan
[6] Tokyo Bay Med Ctr, Dept Cardiol, Urayasu, Japan
[7] JR Gen Hosp, Dept Cardiol, Tokyo, Japan
[8] NTT Med Ctr Tokyo, Dept Cardiol, Tokyo, Japan
[9] Univ Tokyo, Dept Adv Cardiol, Tokyo, Japan
[10] Univ Tokyo Hosp, Dept Clin Lab, Tokyo, Japan
来源
PLOS ONE | 2024年 / 19卷 / 08期
关键词
VENTRICULAR SYSTOLIC DYSFUNCTION; ECG;
D O I
10.1371/journal.pone.0307978
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
The generalization of deep neural network algorithms to a broader population is an important challenge in the medical field. We aimed to apply self-supervised learning using masked autoencoders (MAEs) to improve the performance of the 12-lead electrocardiography (ECG) analysis model using limited ECG data. We pretrained Vision Transformer (ViT) models by reconstructing the masked ECG data with MAE. We fine-tuned this MAE-based ECG pretrained model on ECG-echocardiography data from The University of Tokyo Hospital (UTokyo) for the detection of left ventricular systolic dysfunction (LVSD), and then evaluated it using multi-center external validation data from seven institutions, employing the area under the receiver operating characteristic curve (AUROC) for assessment. We included 38,245 ECG-echocardiography pairs from UTokyo and 229,439 pairs from all institutions. The performances of MAE-based ECG models pretrained using ECG data from UTokyo were significantly higher than that of other Deep Neural Network models across all external validation cohorts (AUROC, 0.913-0.962 for LVSD, p < 0.001). Moreover, we also found improvements for the MAE-based ECG analysis model depending on the model capacity and the amount of training data. Additionally, the MAE-based ECG analysis model maintained high performance even on the ECG benchmark dataset (PTB-XL). Our proposed method developed high performance MAE-based ECG analysis models using limited ECG data.
引用
收藏
页数:17
相关论文
共 50 条
  • [21] Scaling Vision Transformers to Gigapixel Images via Hierarchical Self-Supervised Learning
    Chen, Richard J.
    Chen, Chengkuan
    Li, Yicong
    Chen, Tiffany Y.
    Trister, Andrew D.
    Krishnan, Rahul G.
    Mahmood, Faisal
    2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, : 16123 - 16134
  • [22] SELF-SUPERVISED VISION TRANSFORMERS FOR JOINT SAR-OPTICAL REPRESENTATION LEARNING
    Wang, Yi
    Albrecht, Conrad M.
    Zhu, Xiao Xiang
    2022 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM (IGARSS 2022), 2022, : 139 - 142
  • [23] MS-DINO: Masked Self-Supervised Distributed Learning Using Vision Transformer
    Park, Sangjoon
    Lee, Ik Jae
    Kim, Jun Won
    Ye, Jong Chul
    IEEE JOURNAL OF BIOMEDICAL AND HEALTH INFORMATICS, 2024, 28 (10) : 6180 - 6192
  • [24] Self-Supervised Learning-Based Time Series Classification via Hierarchical Sparse Convolutional Masked-Autoencoder
    Yu, Ting
    Xu, Kele
    Wang, Xu
    Ding, Bo
    Feng, Dawei
    IEEE OPEN JOURNAL OF SIGNAL PROCESSING, 2024, 5 : 964 - 975
  • [25] Self-supervised learning of Vision Transformers for digital soil mapping using visual data
    Tresson, Paul
    Dumont, Maxime
    Jaeger, Marc
    Borne, Frederic
    Boivin, Stephane
    Marie-Louise, Loic
    Francois, Jeremie
    Boukcim, Hassan
    Goeau, Herve
    GEODERMA, 2024, 450
  • [26] PointUR-RL: Unified Self-Supervised Learning Method Based on Variable Masked Autoencoder for Point Cloud Reconstruction and Representation Learning
    Li, Kang
    Zhu, Qiuquan
    Wang, Haoyu
    Wang, Shibo
    Tian, He
    Zhou, Ping
    Cao, Xin
    REMOTE SENSING, 2024, 16 (16)
  • [27] A Multi-view Spectral-Spatial-Temporal Masked Autoencoder for Decoding Emotions with Self-supervised Learning
    Li, Rui
    Wang, Yiting
    Zheng, Wei-Long
    Lu, Bao-Liang
    PROCEEDINGS OF THE 30TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2022, 2022,
  • [28] A Self-Supervised Contrastive Denoising Autoencoder-Based Noise Suppression Method for Micro Thrust Measurement Signals Processing
    Chen, Xingyu
    Zhao, Liye
    Xu, Jiawen
    Liu, Zhikang
    Zhang, Chengxin
    Xu, Luxiang
    Guo, Ning
    IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT, 2024, 73 : 1 - 17
  • [29] A Cross-Domain Threat Screening and Localization Framework Using Vision Transformers and Self-supervised Learning
    Nasim, Ammara
    Akram, Muhammad Usman
    Khan, Asad Mansoor
    Khan, Muhammad Belal Afsar
    Hassan, Taimur
    2024 14TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION SYSTEMS, ICPRS, 2024,
  • [30] Masked Modeling-Based Ultrasound Image Classification via Self-Supervised Learning
    Xu, Kele
    You, Kang
    Zhu, Boqing
    Feng, Ming
    Feng, Dawei
    Yang, Cheng
    IEEE OPEN JOURNAL OF ENGINEERING IN MEDICINE AND BIOLOGY, 2024, 5 : 226 - 237