Missing data techniques in classification for cardiovascular dysautonomias diagnosis

被引:0
作者
Ali Idri
Ilham Kadi
Ibtissam Abnane
José Luis Fernandez-Aleman
机构
[1] Mohammed V University,Software Project Management Research Team
[2] Mohammed VI Polytechnic University,CSEHS
[3] University of Murcia,MSDA
来源
Medical & Biological Engineering & Computing | 2020年 / 58卷
关键词
Missing data; KNN imputation; Missingness mechanism; Cardiology;
D O I
暂无
中图分类号
学科分类号
摘要
Missing data (MD) is a common and inevitable problem facing data mining (DM)–based decision systems in e-health since many medical historical datasets contain a huge number of missing values. Therefore, a pre-processing stage is usually required to deal with missing values before building any DM–based decision system. The purpose of this paper is to evaluate the impact of MD techniques on classification systems in cardiovascular dysautonomias diagnosis. We analyzed and compared the accuracy rates of four classification techniques: random forest (RF), support vector machines (SVM), C4.5 decision tree, and Naive Bayes (NB), using two MD techniques: deletion or imputation with k-nearest neighbors (KNN). A total of 216 experiments were therefore carried out using three missingness mechanisms (MCAR: missing completely at random, MAR: missing at random and NMAR: not missing at random), two MD techniques (deletion and KNN imputation), nine MD percentages from 10 to 90% over a dataset collected from the autonomic nervous system (ANS) unit of the University Hospital Avicenne in Morocco. The results obtained suggest that using KNN imputation rather than deletion enhances the accuracy rates of the four classifiers. Moreover, the MD percentages have a negative impact on the performance of classification techniques regardless of the MD mechanisms and MD techniques used. In fact, the accuracy rates of the four classifiers decrease as the MD percentage increases.
引用
收藏
页码:2863 / 2878
页数:15
相关论文
共 50 条
  • [21] PATTERN CLASSIFICATION FORMULATED AS A MISSING DATA TASK: THE AUDIO GENRE CLASSIFICATION CASE
    Pikrakis, Aggelos
    Kopsinis, Yannis
    Chouvardas, Symeon
    Theodoridis, Sergios
    2015 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING (ICASSP), 2015, : 2026 - 2030
  • [22] A Bayesian Approach for Control Loop Diagnosis with Missing Data
    Qi, Fei
    Huang, Biao
    Tamayo, Edgar C.
    AICHE JOURNAL, 2010, 56 (01) : 179 - 195
  • [23] An Ensemble Approach for the Diagnosis of Cognitive Decline with Missing Data
    Garcia Baez, Patricio
    Fernandez Viadero, Carlos
    Regidor Garcia, Jose
    Suarez Araujo, Carmen Paz
    HYBRID ARTIFICIAL INTELLIGENCE SYSTEMS, 2008, 5271 : 353 - +
  • [24] The treatment of missing data in a large cardiovascular clinical outcomes study
    Little, Roderick J.
    Wang, Julia
    Sun, Xiang
    Tian, Hong
    Suh, Eun-Young
    Lee, Michael
    Sarich, Troy
    Oppenheimer, Leonard
    Plotnikov, Alexei
    Wittes, Janet
    Cook-Bruns, Nancy
    Burton, Paul
    Gibson, C. Michael
    Mohanty, Surya
    CLINICAL TRIALS, 2016, 13 (03) : 344 - 351
  • [25] Techniques for handling missing data: Applications to online condition monitoring
    Nelwamondo, Fulufhelo Vincent
    Marwala, Tshilidzi
    INTERNATIONAL JOURNAL OF INNOVATIVE COMPUTING INFORMATION AND CONTROL, 2008, 4 (06): : 1507 - 1526
  • [26] Techniques for Handling Missing Data in Secondary Analyses of Large Surveys
    Langkamp, Diane L.
    Lehman, Amy
    Lemeshow, Stanley
    ACADEMIC PEDIATRICS, 2010, 10 (03) : 205 - 210
  • [27] A review of techniques for treating missing data in OM survey research
    Tsikriktsis, N
    JOURNAL OF OPERATIONS MANAGEMENT, 2005, 24 (01) : 53 - 62
  • [28] Missing data: A comparison of neural network and expectation maximization techniques
    Nelwamondo, Fulufhelo V.
    Mohamed, Shakir
    Marwala, Tshilidzi
    CURRENT SCIENCE, 2007, 93 (11): : 1514 - 1521
  • [29] Multi-Objective Feature Selection With Missing Data in Classification
    Xue, Yu
    Tang, Yihang
    Xu, Xin
    Liang, Jiayu
    Neri, Ferrante
    IEEE TRANSACTIONS ON EMERGING TOPICS IN COMPUTATIONAL INTELLIGENCE, 2022, 6 (02): : 355 - 364
  • [30] A dynamic ensemble approach to robust classification in the presence of missing data
    Bryan Conroy
    Larry Eshelman
    Cristhian Potes
    Minnan Xu-Wilson
    Machine Learning, 2016, 102 : 443 - 463