Superiority of Classification Tree versus Cluster, Fuzzy and Discriminant Models in a Heartbeat Classification System

被引:30
作者
Krasteva, Vessela [1 ]
Jekova, Irena [1 ]
Leber, Remo [2 ]
Schmid, Ramun [2 ,3 ]
Abaecherli, Roger [2 ,3 ,4 ]
机构
[1] Bulgarian Acad Sci, Inst Biophys & Biomed Engn, Sofia, Bulgaria
[2] Schiller AG, Biomed Res & Signal Proc, Baar, Switzerland
[3] Bern Univ Appl Sci, Med Technol Ctr, Bern, Switzerland
[4] Univ Basel Hosp, Cardiovasc Res Inst Basel, Bazel, Switzerland
关键词
ECG BEAT CLASSIFICATION; VENTRICULAR CONTRACTION CLASSIFICATION; DECISION TREE; RECOGNITION; FEATURES; SELECTION; CRITERIA; DATABASE; RULE;
D O I
10.1371/journal.pone.0140123
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
This study presents a 2-stage heartbeat classifier of supraventricular (SVB) and ventricular (VB) beats. Stage 1 makes computationally-efficient classification of SVB-beats, using simple correlation threshold criterion for finding close match with a predominant normal (reference) beat template. The non-matched beats are next subjected to measurement of 20 basic features, tracking the beat and reference template morphology and RR-variability for subsequent refined classification in SVB or VB-class by Stage 2. Four linear classifiers are compared: cluster, fuzzy, linear discriminant analysis (LDA) and classification tree (CT), all subjected to iterative training for selection of the optimal feature space among extended 210-sized set, embodying interactive second-order effects between 20 independent features. The optimization process minimizes at equal weight the false positives in SVB-class and false negatives in VB-class. The training with European ST-T, AHA, MIT-BIH Supraventricular Arrhythmia databases found the best performance settings of all classification models: Cluster (30 features), Fuzzy (72 features), LDA (142 coefficients), CT (221 decision nodes) with top-3 best scored features: normalized current RR-interval, higher/lower frequency content ratio, beat-to-template correlation. Unbiased test-validation with MIT-BIH Arrhythmia database rates the classifiers in descending order of their specificity for SVBclass: CT (99.9%), LDA (99.6%), Cluster (99.5%), Fuzzy (99.4%); sensitivity for ventricular ectopic beats as part from VB-class (commonly reported in published beat-classification studies): CT (96.7%), Fuzzy (94.4%), LDA (94.2%), Cluster (92.4%); positive predictivity: CT (99.2%), Cluster (93.6%), LDA (93.0%), Fuzzy (92.4%). CT has superior accuracy by 0.3-6.8% points, with the advantage for easy model complexity configuration by pruning the tree consisted of easy interpretable 'if-then' rules.
引用
收藏
页数:29
相关论文
共 49 条
[31]  
Malhotra R, 2010, REV LIT ARTS AM, V4333, P183
[32]  
Mardia K, 1980, Multivariate analysis
[33]   ECG beat classification using PCA, LDA, ICA and Discrete Wavelet Transform [J].
Martis, Roshan Joy ;
Acharya, U. Rajendra ;
Min, Lim Choo .
BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2013, 8 (05) :437-448
[34]  
Mingers J., 1989, Machine Learning, V3, P319, DOI 10.1007/BF00116837
[35]   The impact of the MIT-BIH arrhythmia database [J].
Moody, GA ;
Mark, RG .
IEEE ENGINEERING IN MEDICINE AND BIOLOGY MAGAZINE, 2001, 20 (03) :45-50
[36]  
Pantelopoulos A, 2010, P 10 IEEE INT C INF, P1, DOI [10.1109/ITAB.2010.5687731, DOI 10.1109/ITAB.2010.5687731]
[37]   Intelligent Classification of Heartbeats for Automated Real-Time ECG Monitoring [J].
Park, Juyoung ;
Kang, Kyungtae .
TELEMEDICINE AND E-HEALTH, 2014, 20 (12) :1069-1077
[38]   PcHD: Personalized classification of heartbeat types using a decision tree [J].
Park, Juyoung ;
Kang, Kyungtae .
COMPUTERS IN BIOLOGY AND MEDICINE, 2014, 54 :79-88
[39]  
Patra D, 2010, INT J COMPUTER SCI I, V36, P3
[40]  
Ripley BD, 1996, Pattern Recognition and Neural Networks, DOI DOI 10.1017/CBO9780511812651