Missing data techniques in classification for cardiovascular dysautonomias diagnosis

被引:0
作者
Ali Idri
Ilham Kadi
Ibtissam Abnane
José Luis Fernandez-Aleman
机构
[1] Mohammed V University,Software Project Management Research Team
[2] Mohammed VI Polytechnic University,CSEHS
[3] University of Murcia,MSDA
来源
Medical & Biological Engineering & Computing | 2020年 / 58卷
关键词
Missing data; KNN imputation; Missingness mechanism; Cardiology;
D O I
暂无
中图分类号
学科分类号
摘要
Missing data (MD) is a common and inevitable problem facing data mining (DM)–based decision systems in e-health since many medical historical datasets contain a huge number of missing values. Therefore, a pre-processing stage is usually required to deal with missing values before building any DM–based decision system. The purpose of this paper is to evaluate the impact of MD techniques on classification systems in cardiovascular dysautonomias diagnosis. We analyzed and compared the accuracy rates of four classification techniques: random forest (RF), support vector machines (SVM), C4.5 decision tree, and Naive Bayes (NB), using two MD techniques: deletion or imputation with k-nearest neighbors (KNN). A total of 216 experiments were therefore carried out using three missingness mechanisms (MCAR: missing completely at random, MAR: missing at random and NMAR: not missing at random), two MD techniques (deletion and KNN imputation), nine MD percentages from 10 to 90% over a dataset collected from the autonomic nervous system (ANS) unit of the University Hospital Avicenne in Morocco. The results obtained suggest that using KNN imputation rather than deletion enhances the accuracy rates of the four classifiers. Moreover, the MD percentages have a negative impact on the performance of classification techniques regardless of the MD mechanisms and MD techniques used. In fact, the accuracy rates of the four classifiers decrease as the MD percentage increases.
引用
收藏
页码:2863 / 2878
页数:15
相关论文
共 50 条
  • [31] A dynamic ensemble approach to robust classification in the presence of missing data
    Conroy, Bryan
    Eshelman, Larry
    Potes, Cristhian
    Xu-Wilson, Minnan
    MACHINE LEARNING, 2016, 102 (03) : 443 - 463
  • [32] A kernel PLS based classification method with missing data handling
    Thuy Tuong Nguyen
    Tsoy, Yury
    STATISTICAL PAPERS, 2017, 58 (01) : 211 - 225
  • [33] A kernel PLS based classification method with missing data handling
    Thuy Tuong Nguyen
    Yury Tsoy
    Statistical Papers, 2017, 58 : 211 - 225
  • [34] Rough Neuro-Fuzzy Structures for Classification With Missing Data
    Nowicki, Robert
    IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART B-CYBERNETICS, 2009, 39 (06): : 1334 - 1347
  • [35] A subspace ensemble framework for classification with high dimensional missing data
    Gao, Hang
    Jian, Songlei
    Peng, Yuxing
    Liu, Xinwang
    MULTIDIMENSIONAL SYSTEMS AND SIGNAL PROCESSING, 2017, 28 (04) : 1309 - 1324
  • [36] Sets with Incomplete and Missing Data - NN Radar Signal Classification
    Jordanov, Ivan
    Petrov, Nedyalko
    PROCEEDINGS OF THE 2014 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2014, : 218 - 224
  • [37] An Investigation of Missing Data Methods for Classification Trees Applied to Binary Response Data
    Ding, Yufeng
    Simonoff, Jeffrey S.
    JOURNAL OF MACHINE LEARNING RESEARCH, 2010, 11 : 131 - 170
  • [38] A subspace ensemble framework for classification with high dimensional missing data
    Hang Gao
    Songlei Jian
    Yuxing Peng
    Xinwang Liu
    Multidimensional Systems and Signal Processing, 2017, 28 : 1309 - 1324
  • [39] Practical Strategies for Extreme Missing Data Imputation in Dementia Diagnosis
    McCombe, Niamh
    Liu, Shuo
    Ding, Xuemei
    Prasad, Girijesh
    Bucholc, Magda
    Finn, David P.
    Todd, Stephen
    McClean, Paula L.
    Wong-Lin, Kongfatt
    IEEE JOURNAL OF BIOMEDICAL AND HEALTH INFORMATICS, 2022, 26 (02) : 818 - 827
  • [40] Expectation-Maximization Approach to Fault Diagnosis With Missing Data
    Zhang, Kangkang
    Gonzalez, Ruben
    Huang, Biao
    Ji, Guoli
    IEEE TRANSACTIONS ON INDUSTRIAL ELECTRONICS, 2015, 62 (02) : 1231 - 1240