Application of Chi-square discretization algorithms to ensemble classification methods

被引:0
|
作者
Peker N. [1 ]
Kubat C. [1 ]
机构
[1] Department of Industrial Engineering, Faculty of Engineering, Sakarya University, Esentepe, 54187, Sakarya
关键词
Chi-square statistics; Classification; Data mining; Discretization; Ensemble methods; Machine learning;
D O I
10.1016/j.eswa.2021.115540
中图分类号
学科分类号
摘要
Classification is one of the important tasks in data mining and machine learning. Classification performance depends on many factors as well as data characteristics. Some algorithms are known to work better with discrete data. In contrast, most real-world data contain continuous variables. For algorithms working with discrete data, these continuous variables must be converted to discrete ones. In this process called discretization, continuous variables are converted to their corresponding discrete variables. In this paper, four Chi-square based supervised discretization algorithms ChiMerge(ChiM), Chi2, Extended Chi2(ExtChi2) and Modified Chi2(ModChi2) were used. In the literature, the performance of these algorithms is often tested with decision trees and Naïve Bayes classifiers. In this study, differently, four sets of data discretized by these algorithms were classified with ensemble methods. Classification accuracies for these data sets were obtained through using a stratified 10-fold cross-validation method. The classification performance of the original and discrete data sets of the methods is presented comparatively. According to the results, the performance of the discrete data is more successful than the original data. © 2021 Elsevier Ltd
引用
收藏
相关论文
共 50 条
  • [11] Maximal conditional chi-square importance in random forests
    Wang, Minghui
    Chen, Xiang
    Zhang, Heping
    BIOINFORMATICS, 2010, 26 (06) : 831 - 837
  • [12] Pearson-Fisher Chi-Square Statistic Revisited
    Bolboaca, Sorana D.
    Jantschi, Lorentz
    Sestras, Adriana F.
    Sestras, Radu E.
    Pamfil, Doru C.
    INFORMATION, 2011, 2 (03) : 528 - 545
  • [13] Incremental Attribute Reduction Method Based on Chi-Square Statistics and Information Entropy
    Su, Na
    An, Xinjun
    Yan, Changqing
    Ji, Shujuan
    IEEE ACCESS, 2020, 8 : 98234 - 98243
  • [14] Determination of Priority Parameter for Classification of Poverty using Chi-Square method and Crammer's V Correlation
    Iskandar, Derick
    Suprapto, Yoyon K.
    Purnama, I. Ketut Eddy
    2016 1ST INTERNATIONAL SEMINAR ON APPLICATION FOR TECHNOLOGY OF INFORMATION AND COMMUNICATION (ISEMANTIC): SCIENCE AND TECHNOLOGY FOR A BETTER FUTURE, 2016, : 247 - 252
  • [15] Application of Imbalanced Data Classification Quality Metrics as Weighting Methods of the Ensemble Data Stream Classification Algorithms
    Wegier, Weronika
    Ksieniewicz, Pawel
    ENTROPY, 2020, 22 (08)
  • [16] Development of a Chi-Square Approach for Classifying Ischemic Stroke Prediction
    Arowolo, Micheal Olaolu
    Akubor, Victor Ashem
    Misra, Sanjay
    Garg, Lalit
    Adebiyi, Marion Olubunmi
    Awotunde, Joseph Bamidele
    INFORMATION SYSTEMS AND MANAGEMENT SCIENCE, ISMS 2021, 2023, 521 : 268 - 279
  • [17] Using chi-square statistics to measure similarities for text categorization
    Chen, Yao-Tsung
    Chen, Meng Chang
    EXPERT SYSTEMS WITH APPLICATIONS, 2011, 38 (04) : 3085 - 3090
  • [18] A systematic mapping study for ensemble classification methods in cardiovascular disease
    Mohamed Hosni
    Juan M. Carrillo de Gea
    Ali Idri
    Manal El Bajta
    José Luis Fernández Alemán
    Ginés García-Mateos
    Ibtissam Abnane
    Artificial Intelligence Review, 2021, 54 : 2827 - 2861
  • [19] A systematic mapping study for ensemble classification methods in cardiovascular disease
    Hosni, Mohamed
    Carrillo de Gea, Juan M.
    Idri, Ali
    El Bajta, Manal
    Fernandez Aleman, Jose Luis
    Garcia-Mateos, Gines
    Abnane, Ibtissam
    ARTIFICIAL INTELLIGENCE REVIEW, 2021, 54 (04) : 2827 - 2861
  • [20] Application of SVM and Chi-Square Feature Selection for Sentiment Analysis of Indonesia's National Health Insurance Mobile Application
    Hokijuliandy, Ewen
    Napitupulu, Herlina
    Firdaniza
    MATHEMATICS, 2023, 11 (17)