Generalising electrocardiogram detection and delineation: training convolutional neural networks with synthetic data augmentation

被引:1
作者
Jimenez-Perez, Guillermo [1 ,2 ,3 ]
Acosta, Juan [2 ]
Alcaine, Alejandro [4 ]
Camara, Oscar [1 ]
机构
[1] BCN MedTech, Dept Informat & Commun Technol, PhySense Res Grp, Barcelona, Spain
[2] Virgen Del Rocio Univ Hosp, Arrhythmia Unit, Dept Cardiol, Seville, Spain
[3] Inst Invest Biomed August Pi & Sunyer, Barcelona, Spain
[4] Univ San Jorge, Fac Ciencias Salud, Comp Med & Biol Applicat CoMBA Grp, Zaragoza, Spain
来源
FRONTIERS IN CARDIOVASCULAR MEDICINE | 2024年 / 11卷
关键词
digital health; electrocardiogram; convolutional neural network; artificial intelligence; delineation; multi-centre study; data augmentation; segmentation; ECG SIGNALS; CLASSIFICATION; VALIDATION;
D O I
10.3389/fcvm.2024.1341786
中图分类号
R5 [内科学];
学科分类号
1002 ; 100201 ;
摘要
Introduction Extracting beat-by-beat information from electrocardiograms (ECGs) is crucial for various downstream diagnostic tasks that rely on ECG-based measurements. However, these measurements can be expensive and time-consuming to produce, especially for long-term recordings. Traditional ECG detection and delineation methods, relying on classical signal processing algorithms such as those based on wavelet transforms, produce high-quality delineations but struggle to generalise to diverse ECG patterns. Machine learning (ML) techniques based on deep learning algorithms have emerged as promising alternatives, capable of achieving similar performance without handcrafted features or thresholds. However, supervised ML techniques require large annotated datasets for training, and existing datasets for ECG detection/delineation are limited in size and the range of pathological conditions they represent.Methods This article addresses this challenge by introducing two key innovations. First, we develop a synthetic data generation scheme that probabilistically constructs unseen ECG traces from "pools" of fundamental segments extracted from existing databases. A set of rules guides the arrangement of these segments into coherent synthetic traces, while expert domain knowledge ensures the realism of the generated traces, increasing the input variability for training the model. Second, we propose two novel segmentation-based loss functions that encourage the accurate prediction of the number of independent ECG structures and promote tighter segmentation boundaries by focusing on a reduced number of samples.Results The proposed approach achieves remarkable performance, with a F1-score of 99.38% and delineation errors of 2.19 +/- 17.73 ms and 4.45 +/- 18.32 ms for ECG segment onsets and offsets across the P, QRS, and T waves. These results, aggregated from three diverse freely available databases (QT, LU, and Zhejiang), surpass current state-of-the-art detection and delineation approaches.Discussion Notably, the model demonstrated exceptional performance despite variations in lead configurations, sampling frequencies, and represented pathophysiology mechanisms, underscoring its robust generalisation capabilities. Real-world examples, featuring clinical data with various pathologies, illustrate the potential of our approach to streamline ECG analysis across different medical settings, fostered by releasing the codes as open source.
引用
收藏
页数:15
相关论文
共 60 条
  • [1] Abrishami H., 2018, 2018 Int'l Conf. Bioinformatics and Computational Biology (BIOCOMP'18), P71
  • [2] A QRS axis-based algorithm to identify the origin of scar-related ventricular tachycardia in the 17-segment American Heart Association model
    Andreu, David
    Fernandez-Armenta, Juan
    Acosta, Juan
    Penela, Diego
    Jauregui, Beatriz
    Soto-Iglesias, David
    Syrovnev, Vladimir
    Arbelo, Elena
    Maria Tolosana, Jose
    Berruezo, Antonio
    [J]. HEART RHYTHM, 2018, 15 (10) : 1491 - 1497
  • [3] Avula Vennela, 2023, JACC Adv, V2, DOI 10.1016/j.jacadv.2023.100686
  • [4] Delineation of ECG characteristic features using multiresolution wavelet analysis method
    Banerjee, S.
    Gupta, R.
    Mitra, M.
    [J]. MEASUREMENT, 2012, 45 (03) : 474 - 487
  • [5] Bokeh Development Team, 2018, Bokeh: Python library for interactive visualization
  • [6] A Crucial Wave Detection and Delineation Method for Twelve-Lead ECG Signals
    Chen, Genlang
    Chen, Maolin
    Zhang, Jiajian
    Zhang, Liang
    Pang, Chaoyi
    [J]. IEEE ACCESS, 2020, 8 : 10707 - 10717
  • [7] Cheng B., 2021, PROC CVPR IEEE
  • [8] Xception: Deep Learning with Depthwise Separable Convolutions
    Chollet, Francois
    [J]. 30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, : 1800 - 1807
  • [9] Darmawahyuni A., 2023, Improved delineation model of a standard 12-lead electrocardiogram based on a deep learning algorithm, V23, P139, DOI [10.1186/s12911-023-02233-0, DOI 10.1186/S12911-023-02233-0]
  • [10] Training machine learning models with synthetic data improves the prediction of ventricular origin in outflow tract ventricular arrhythmias
    Doste, Ruben
    Lozano, Miguel
    Jimenez-Perez, Guillermo
    Mont, Lluis
    Berruezo, Antonio
    Penela, Diego
    Camara, Oscar
    Sebastian, Rafael
    [J]. FRONTIERS IN PHYSIOLOGY, 2022, 13