Signal quality indices and data fusion for determining clinical acceptability of electrocardiograms

被引:206
作者
Clifford, G. D. [1 ]
Behar, J. [1 ]
Li, Q. [1 ,2 ]
Rezek, I. [1 ]
机构
[1] Univ Oxford, Dept Engn Sci, Inst Biomed Engn, Oxford OX1 2JD, England
[2] Shandong Univ, Sch Med, Inst Biomed Engn, Jinan 250012, Shandong, Peoples R China
关键词
ECG; machine learning; mHealth; neural networks; signal quality;
D O I
10.1088/0967-3334/33/9/1419
中图分类号
Q6 [生物物理学];
学科分类号
071011 ;
摘要
A completely automated algorithm to detect poor-quality electrocardiograms (ECGs) is described. The algorithm is based on both novel and previously published signal quality metrics, originally designed for intensive care monitoring. The algorithms have been adapted for use on short (5-10 s) single- and multi-lead ECGs. The metrics quantify spectral energy distribution, higher order moments and inter-channel and inter-algorithm agreement. Seven metrics were calculated for each channel (84 features in all) and presented to either a multi-layer perceptron artificial neural network or a support vector machine (SVM) for training on a multiple-annotator labelled and adjudicated training dataset. A single- lead version of the algorithm was also developed in a similar manner. Data were drawn from the PhysioNet Challenge 2011 dataset where binary labels were available, on 1500 12-lead ECGs indicating whether the entire recording was acceptable or unacceptable for clinical interpretation. We re-annotated all the leads in both the training set (1000 labelled ECGs) and test dataset (500 12-lead ECGs where labels were not publicly available) using two independent annotators, and a third for adjudication of differences. We found that low-quality data accounted for only 16% of the ECG leads. To balance the classes (between high and low quality), we created extra noisy data samples by adding noise from PhysioNet's noise stress test database to some of the clean 12-lead ECGs. No data were shared between training and test sets. A classification accuracy of 98% on the training data and 97% on the test data were achieved. Upon inspection, incorrectly classified data were found to be borderline cases which could be classified either way. If these cases were more consistently labelled, we expect our approach to achieve an accuracy closer to 100%.
引用
收藏
页码:1419 / 1433
页数:15
相关论文
共 26 条
  • [1] Reducing false alarm rates for critical arrhythmias using the arterial blood pressure waveform.
    Aboukhalil, Anton
    Nielsen, Larry
    Saeed, Mohammed
    Mark, Roger G.
    Clifford, Gari D.
    [J]. JOURNAL OF BIOMEDICAL INFORMATICS, 2008, 41 (03) : 442 - 451
  • [2] [Anonymous], ORAL HLTH STATUS ORA
  • [3] [Anonymous], 2006, PATTERN RECOGN
  • [4] [Anonymous], 2006, ARTECH HOUSE BOSTON
  • [5] [Anonymous], ORAL HLTH STATUS ORA
  • [6] [Anonymous], 2011, ACM T INTEL SYST TEC, DOI DOI 10.1145/1961189.1961199
  • [7] [Anonymous], 2011, COMPUT CARDIOL CONF
  • [8] Celi Leo Anthony, 2009, J Health Inform Dev Ctries, V3, P17
  • [9] Clifford GD, 2011, COMPUT CARDIOL CONF, V38, P285
  • [10] DOWER G E, 1980, Clinical Cardiology, V3, P87