Which Augmentation Should I Use? An Empirical Investigation of Augmentations for Self-Supervised Phonocardiogram Representation Learning

被引:0
作者
Ballas, Aristotelis [1 ]
Papapanagiotou, Vasileios [2 ]
Diou, Christos [1 ]
机构
[1] Harokopio Univ, Dept Informat & Telemat, Athens 17778, Greece
[2] Karolinska Inst, Dept Med, S-14152 Stockholm, Sweden
关键词
Brain modeling; Phonocardiography; Biological system modeling; Training; Data models; Electrocardiography; Electroencephalography; Feature extraction; Robustness; Representation learning; Contrastive learning; deep learning; OOD representation learning; phonocardiogram classification; self-supervised learning; NEURAL-NETWORKS;
D O I
10.1109/ACCESS.2024.3519297
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Despite recent advancements in deep learning, its application in real-world medical settings, such as phonocardiogram (PCG) classification, remains limited. A significant barrier is the lack of high-quality annotated datasets, which hampers the development of robust, generalizable models that can perform well on newly collected, out-of-distribution (OOD) data. Self-Supervised Learning (SSL), particularly contrastive learning, has shown promise in mitigating the issue of data scarcity by leveraging unlabeled data to enhance model robustness and effectiveness. Even though SSL methods have been proposed and researched in other domains, works focusing on the impact of data augmentations on model robustness for PCG classification is limited. In particular, while augmentations are a key component in SSL, selecting the most suitable transformations during the training process is highly challenging and time-consuming. Improper augmentations can lead to substantial performance degradation, even hindering the network's ability to learn meaningful representations. Addressing this gap, our research aims to explore and evaluate a wide range of audio-based augmentations and uncover combinations that enhance SSL model performance in PCG classification. We conduct a comprehensive comparative analysis across multiple datasets and downstream tasks, assessing the impact of various augmentations on model performance and generalization. Our findings reveal that depending on the training distribution, augmentation choice significantly influences model robustness, with fully-supervised models experiencing up to a 32% drop in effectiveness when applied to unseen data, while SSL models demonstrate greater resilience, losing only 10% or even improving in some cases. This study also sheds light on the most promising and appropriate augmentations for robust PCG signal processing, by calculating their effect size on model training. These insights equip researchers and practitioners with valuable guidelines for building more robust, reliable models in PCG signal processing.
引用
收藏
页码:193459 / 193472
页数:14
相关论文
共 79 条
[1]  
Abadi Martin, 2016, Proceedings of OSDI '16: 12th USENIX Symposium on Operating Systems Design and Implementation. OSDI '16, P265
[2]  
Alkhodari Mohanad, 2022, 2022 Computing in Cardiology (CinC), P1, DOI 10.22489/CinC.2022.035
[3]  
[Anonymous], 2005, Bioelectrical Signal Processing in Cardiac and Neurological Applications (Biomedical Engineering)
[4]  
[Anonymous], The PASCAL classifying heart sounds challenge 2011 (CHSC2011) results
[5]  
Araujo Matheus, 2022, 2022 Computing in Cardiology (CinC), P1, DOI 10.22489/CinC.2022.249
[6]   RECOMMENDATIONS FOR STANDARDIZATION AND SPECIFICATIONS IN AUTOMATED ELECTROCARDIOGRAPHY - BANDWIDTH AND DIGITAL SIGNAL-PROCESSING - A REPORT FOR HEALTH-PROFESSIONALS BY AN AD HOC WRITING GROUP OF THE COMMITTEE ON ELECTROCARDIOGRAPHY AND CARDIAC ELECTROPHYSIOLOGY OF THE COUNCIL-ON-CLINICAL-CARDIOLOGY, AMERICAN-HEART-ASSOCIATION [J].
BAILEY, JJ ;
BERSON, AS ;
GARSON, A ;
HORAN, LG ;
MACFARLANE, PW ;
MORTARA, DW ;
ZYWIETZ, C .
CIRCULATION, 1990, 81 (02) :730-739
[7]  
Ballas Aristotelis, 2022, 2022 Computing in Cardiology (CinC), P1, DOI 10.22489/CinC.2022.298
[8]  
Ballas A., 2023, IEEE Trans. Artif. Intell., V5, P6253
[9]   Towards Domain Generalization for ECG and EEG Classification: Algorithms and Benchmarks [J].
Ballas, Aristotelis ;
Diou, Christos .
IEEE TRANSACTIONS ON EMERGING TOPICS IN COMPUTATIONAL INTELLIGENCE, 2024, 8 (01) :44-54
[10]   A Domain Generalization Approach for Out-Of-Distribution 12-lead ECG Classification with Convolutional Neural Networks [J].
Ballas, Aristotelis ;
Diou, Christos .
2022 IEEE EIGHTH INTERNATIONAL CONFERENCE ON BIG DATA COMPUTING SERVICE AND APPLICATIONS, BIGDATASERVICE, 2022, :9-13