Predicting cardiac autonomic neuropathy category for diabetic data with missing values

被引:19
作者
Abawajy, Jemal [1 ,2 ]
Kelarev, Andrei [1 ,2 ]
Chowdhury, Morshed [1 ,2 ]
Stranieri, Andrew [3 ]
Jelinek, Herbert F. [4 ]
机构
[1] Deakin Univ, Sch Informat Technol, Geelong, Vic 3125, Australia
[2] Deakin Univ, Parallel & Distributing Comp Lab, Geelong, Vic 3125, Australia
[3] Univ Ballarat, Sch Sci Informat Technol & Engn, Ballarat, Vic 3353, Australia
[4] Khalifa Univ, Dept Biomed Engn, Abu Dhabi, U Arab Emirates
关键词
Cardiac autonomic neuropathy; Diabetes; Missing value imputation; Regression learners; Meta-regression techniques; Ewing formula; HEART-RATE-VARIABILITY; MODEL TREES; CLASSIFICATION; REGRESSION;
D O I
10.1016/j.compbiomed.2013.07.002
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Cardiovascular autonomic neuropathy (CAN) is a serious and well known complication of diabetes. Previous articles circumvented the problem of missing values in CAN data by deleting all records and fields with missing values and applying classifiers trained on different sets of features that were complete. Most of them also added alternative features to compensate for the deleted ones. Here we introduce and investigate a new method for classifying CAN data with missing values. In contrast to all previous papers, our new method does not delete attributes with missing values, does not use classifiers, and does not add features. Instead it is based on regression and meta-regression combined with the Ewing formula for identifying the classes of CAN. This is the first article using the Ewing formula and regression to classify CAN. We carried out extensive experiments to determine the best combination of regression and meta-regression techniques for classifying CAN data with missing values. The best outcomes have been obtained by the additive regression meta-learner based on M5Rules and combined with the Ewing formula. It has achieved the best accuracy of 99.78% for two classes of CAN, and 98.98% for three classes of CAN. These outcomes are substantially better than previous results obtained in the literature by deleting all missing attributes and applying traditional classifiers to different sets of features without regression. Another advantage of our method is that it does not require practitioners to perform more tests collecting additional alternative features. (c) 2013 Elsevier Ltd. All rights reserved.
引用
收藏
页码:1328 / 1333
页数:6
相关论文
共 34 条
[1]  
Abawajy J., 2012, P WORKSH NEW TRENDS, P1
[2]   Exploiting correlation of ECG with certain EMD functions for discrimination of ventricular fibrillation [J].
Abu Anas, Emran Mohammad ;
Lee, Soo Yeol ;
Hasan, Md Kamrul .
COMPUTERS IN BIOLOGY AND MEDICINE, 2011, 41 (02) :110-114
[3]  
[Anonymous], BIOMEDICAL ENG ONLIN
[4]  
[Anonymous], 2011, Proceedings of the Ninth Australasian Data Mining Conference-Volume 121, AusDM '11
[5]   A statistical approach for determination of time plane features from digitized ECG [J].
Chatterjee, H. K. ;
Gupta, R. ;
Mitra, M. .
COMPUTERS IN BIOLOGY AND MEDICINE, 2011, 41 (05) :278-284
[6]   Binary symbolic dynamics classifies heart rate variability patterns linked to autonomic modulations [J].
Cysarz, D. ;
Van Leeuwen, P. ;
Edelhaeuser, F. ;
Montano, N. ;
Porta, A. .
COMPUTERS IN BIOLOGY AND MEDICINE, 2012, 42 (03) :313-318
[7]   Spectral analysis of heart rate variability with the autoregressive method: What model order to choose? [J].
Dantas, Eduardo Miranda ;
Sant'Anna, Marcela Lima ;
Andreao, Rodrigo Varejao ;
Goncalves, Christine Pereira ;
Morra, Elis Aguiar ;
Baldo, Marcelo Perim ;
Rodrigues, Sergio Lamego ;
Mill, Jose Geraldo .
COMPUTERS IN BIOLOGY AND MEDICINE, 2012, 42 (02) :164-170
[8]   Missing value imputation in DNA microarrays based on conjugate gradient method [J].
Dorri, Fatemeh ;
Azmi, Paeiz ;
Dorri, Faezeh .
COMPUTERS IN BIOLOGY AND MEDICINE, 2012, 42 (02) :222-227
[9]  
EWING DJ, 1980, Q J MED, V49, P95
[10]   DIAGNOSIS AND MANAGEMENT OF DIABETIC AUTONOMIC NEUROPATHY [J].
EWING, DJ ;
CLARKE, BF .
BRITISH MEDICAL JOURNAL, 1982, 285 (6346) :916-918