Multimodal multi-instance learning for long-term ECG classification

被引:34
作者
Han, Haozhan [1 ]
Lian, Cheng [1 ]
Zeng, Zhigang [2 ]
Xu, Bingrong [1 ]
Zang, Junbin [3 ]
Xue, Chenyang [3 ]
机构
[1] Wuhan Univ Technol, Sch Automat, Wuhan 430074, Peoples R China
[2] Huazhong Univ Sci & Technol, Sch Artificial Intelligence & Automat, Wuhan 430074, Peoples R China
[3] North Univ China, Sch Instrument & Elect, Taiyuan 038507, Shanxi, Peoples R China
关键词
Long-term ECG; Multimodal learning; Multi-instance learning; Attention mechanism; HEARTBEAT CLASSIFICATION; CNN;
D O I
10.1016/j.knosys.2023.110555
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Recently, deep learning-based models have been widely used for electrocardiogram (ECG) classification tasks. Most ECG signals are long-term time series that contain a large number of sample points. However, existing deep learning-based models resize or crop the original long-term ECG signal due to the limitation of input size and hardware, which results in information loss. To address this issue, a multimodal multi-instance learning neural network (MAMIL) is proposed for long-term ECG classification. The proposed MAMIL has three major components. First, the original ECG signal and Gramian Angular Field (GAF) image converted from the ECG signal are utilized as multimodal inputs, which enables the model to learn complementary information between different modalities. Second, multi-instance learning (MIL) is introduced to avoid information loss. Each long-term ECG signal and GAF image are treated as bags, and each heartbeat from a long-term ECG signal and each patch from a GAF image are treated as instances. Convolutional neural networks (CNNs) are utilized to extract instance features from different modalities. Third, a novel attention mechanism-based feature fusion method is proposed to aggregate the instance features from multiple modalities to obtain the bag feature for final classification. Our feature fusion method adopts pooling to obtain positive instances, which can effectively eliminate redundant information and achieve low computational complexity. The proposed MAMIL is evaluated on both intrapatient and interpatient patterns of two commonly used ECG datasets. Experimental results show that our model not only outperforms common deep learning-based models, but also outperforms previous MIL-based models. (c) 2023 Elsevier B.V. All rights reserved.
引用
收藏
页数:12
相关论文
共 41 条
[1]   ECG Heartbeat Classification Using Multimodal Fusion [J].
Ahmad, Zeeshan ;
Tabassum, Anika ;
Guan, Ling ;
Khan, Naimul Mefraz .
IEEE ACCESS, 2021, 9 :100615-100626
[2]   Classification of myocardial infarction with multi-lead ECG signals and deep CNN [J].
Baloglu, Ulas Baran ;
Talo, Muhammed ;
Yildirim, Ozal ;
Tan, Ru San ;
Acharya, U. Rajendra .
PATTERN RECOGNITION LETTERS, 2019, 122 :23-30
[3]   ECG analysis: A new approach in human identification [J].
Biel, L ;
Pettersson, O ;
Philipson, L ;
Wide, P .
IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT, 2001, 50 (03) :808-812
[4]  
Bulkova Veronika, 2021, Vnitr Lek, V67, P16, DOI 10.36290/vnl.2021.002
[5]  
Cho K., 2014, LEARNING PHRASE REPR
[6]  
Dosovitskiy A, 2021, Arxiv, DOI arXiv:2010.11929
[7]   Multiscaled Fusion o Deep Convolutional Neural Networks for Screening Atrial Fibrillation From Single Lead Short ECG Recordings [J].
Fan, Xiaomao ;
Yao, Qihang ;
Cai, Yunpeng ;
Miao, Fen ;
Sun, Fangmin ;
Li, Ye .
IEEE JOURNAL OF BIOMEDICAL AND HEALTH INFORMATICS, 2018, 22 (06) :1744-1753
[8]   InceptionTime: Finding AlexNet for time series classification [J].
Fawaz, Hassan Ismail ;
Lucas, Benjamin ;
Forestier, Germain ;
Pelletier, Charlotte ;
Schmidt, Daniel F. ;
Weber, Jonathan ;
Webb, Geoffrey, I ;
Idoumghar, Lhassane ;
Muller, Pierre-Alain ;
Petitjean, Francois .
DATA MINING AND KNOWLEDGE DISCOVERY, 2020, 34 (06) :1936-1962
[9]   SenCAPTCHA: A Mobile-First CAPTCHA Using Orientation Sensors [J].
Feng, Yunhe ;
Cao, Qing ;
Qi, Hairong ;
Ruoti, Scott .
PROCEEDINGS OF THE ACM ON INTERACTIVE MOBILE WEARABLE AND UBIQUITOUS TECHNOLOGIES-IMWUT, 2020, 4 (02)
[10]   PhysioBank, PhysioToolkit, and PhysioNet - Components of a new research resource for complex physiologic signals [J].
Goldberger, AL ;
Amaral, LAN ;
Glass, L ;
Hausdorff, JM ;
Ivanov, PC ;
Mark, RG ;
Mietus, JE ;
Moody, GB ;
Peng, CK ;
Stanley, HE .
CIRCULATION, 2000, 101 (23) :E215-E220