Feature Selection by mRMR Method for Heart Disease Diagnosis

被引:10
|
作者
Wang, Gaoshuai [1 ]
Lauri, Fabrice [2 ]
El Hassani, Amir Hajjam [1 ]
机构
[1] Univ Bourgogne Franche Comte, UTBM, Lab Nanomed Imagerie Therapeut EA4662, F-90010 Belfort, France
[2] Univ Bourgogne Franche Comte, UTBM, Connaissance & Intelligence Artificielle Distribu, F-90010 Belfort, France
来源
IEEE ACCESS | 2022年 / 10卷
关键词
Feature extraction; Diseases; Heart; Principal component analysis; Information filters; Mutual information; Clinical diagnosis; Heart disease; feature selection; mutual information; mRMR; ALGORITHM; CLASSIFICATION; ASSOCIATION; STROKE; PCA;
D O I
10.1109/ACCESS.2022.3207492
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Heart disease has become a non-ignorable threat to human health in recent years. Once without timely diagnosis and treatment, patients often suffer disability or even death. However, the diagnosis accuracy directly relies on different doctors' experiences and various factors associated with heart disease bring heavy tasks on them make the situation worse. Therefore, to improve heart disease treatment, introducing computer-aided techniques to assist doctors in diagnosis is a feasible approach. At present, researchers usually use the processed dataset (13 features) selected by doctors from the unprocessed dataset (74 features) (UCI Machine Learning Repository) and apply the feature selection method to the dataset, it's inappropriate because the feature scale is so small. People neglect the unprocessed dataset's value and don't realize it could contain some latent information. A comprehensive comparison is needed to demonstrate the unprocessed dataset's advantages. Besides, the incremental feature combination method should be verified. As the minimum Redundancy - Maximum Relevance (mRMR) gains great success in feature selection, applying it as a feature filter can enhance classification accuracy. Thus, in this research, we introduced the mRMR method as a filter for feature selection and made a comprehensive comparison within several methods like Principal Component Analysis (PCA), Linear Discriminant Analysis (LDA), Kendall, Random Forest, and other research works in several metrics. By analyzing the results, in most cases, the unprocessed dataset can enhance algorithm's performance. The incremental feature selection method is effective and the mRMR is superior to other methods. Not only does it own the highest accuracies, but also the least supportive features. It has 100% accuracy with 8 features on the Cleveland dataset, 98.3% accuracy with 14 features on Hungarian, and 99% accuracy with 9 features on Long-beach-VA, respectively. Furthermore, we find that some features, which doctors regard as useless, play a part in classification, that should attract some attention from doctors.
引用
收藏
页码:100786 / 100796
页数:11
相关论文
共 50 条
  • [41] Identification of OSAHS patients based on ReliefF-mRMR feature selection
    Ye, Ziqiang
    Peng, Jianxin
    Zhang, Xiaowen
    Song, Lijuan
    PHYSICAL AND ENGINEERING SCIENCES IN MEDICINE, 2024, 47 (01) : 99 - 108
  • [42] Feature Selection Based on Artificial Bee Colony for Parkinson Disease Diagnosis
    Badem, Hasan
    Turkusagi, Duran
    Caliskan, Abdullah
    Cil, Zeynel Abidin
    2019 MEDICAL TECHNOLOGIES CONGRESS (TIPTEKNO), 2019, : 224 - 227
  • [43] Feature Selection for Computer-Aided Polyp Detection using MRMR
    Yang, Xiaoyun
    Tek, Boray
    Beddoe, Gareth
    Slabaugh, Greg
    MEDICAL IMAGING 2010: COMPUTER - AIDED DIAGNOSIS, 2010, 7624
  • [44] Feature Selection Method for Nonintrusive Load Monitoring With Balanced Redundancy and Relevancy
    Bao, Sheng
    Zhang, Li
    Han, Xueshan
    Li, Wensheng
    Sun, Donglei
    Ren, Yijing
    Liu, Ningning
    Yang, Ming
    Zhang, Boyi
    IEEE TRANSACTIONS ON INDUSTRY APPLICATIONS, 2022, 58 (01) : 163 - 172
  • [45] Feature selection for fault level diagnosis of planetary gearboxes
    Liu, Zhiliang
    Zhao, Xiaomin
    Zuo, Ming J.
    Xu, Hongbing
    ADVANCES IN DATA ANALYSIS AND CLASSIFICATION, 2014, 8 (04) : 377 - 401
  • [46] A Hybrid mRMR-Genetic Based Selection Method For The Prediction Of Epileptic Seizures
    Assi, E. Bou
    Sawan, M.
    Nguyen, D. K.
    Rihana, S.
    2015 IEEE BIOMEDICAL CIRCUITS AND SYSTEMS CONFERENCE (BIOCAS), 2015, : 326 - 329
  • [47] A novel feature selection method and its application
    Li, Bing
    Chow, Tommy W. S.
    Huang, Di
    JOURNAL OF INTELLIGENT INFORMATION SYSTEMS, 2013, 41 (02) : 235 - 268
  • [48] Nonlinear feature selection using Gaussian kernel SVM-RFE for fault diagnosis
    Xue, Yangtao
    Zhang, Li
    Wang, Bangjun
    Zhang, Zhao
    Li, Fanzhang
    APPLIED INTELLIGENCE, 2018, 48 (10) : 3306 - 3331
  • [49] Multi-scoring Feature selection method based on SVM-RFE for prostate cancer diagnosis
    Albashish, Dheeb
    Sahran, Shahnorbanun
    Abdullah, Azizi
    Adam, Afzan
    Abd Shukor, Nordashima
    Pauzi, Suria Hayati Md
    5TH INTERNATIONAL CONFERENCE ON ELECTRICAL ENGINEERING AND INFORMATICS 2015, 2015, : 682 - 686
  • [50] A novel hybrid feature selection method based on dynamic feature importance
    Wei, Guangfen
    Zhao, Jie
    Feng, Yanli
    He, Aixiang
    Yu, Jun
    APPLIED SOFT COMPUTING, 2020, 93