Application of Chi-square discretization algorithms to ensemble classification methods

被引:0
|
作者
Peker N. [1 ]
Kubat C. [1 ]
机构
[1] Department of Industrial Engineering, Faculty of Engineering, Sakarya University, Esentepe, 54187, Sakarya
关键词
Chi-square statistics; Classification; Data mining; Discretization; Ensemble methods; Machine learning;
D O I
10.1016/j.eswa.2021.115540
中图分类号
学科分类号
摘要
Classification is one of the important tasks in data mining and machine learning. Classification performance depends on many factors as well as data characteristics. Some algorithms are known to work better with discrete data. In contrast, most real-world data contain continuous variables. For algorithms working with discrete data, these continuous variables must be converted to discrete ones. In this process called discretization, continuous variables are converted to their corresponding discrete variables. In this paper, four Chi-square based supervised discretization algorithms ChiMerge(ChiM), Chi2, Extended Chi2(ExtChi2) and Modified Chi2(ModChi2) were used. In the literature, the performance of these algorithms is often tested with decision trees and Naïve Bayes classifiers. In this study, differently, four sets of data discretized by these algorithms were classified with ensemble methods. Classification accuracies for these data sets were obtained through using a stratified 10-fold cross-validation method. The classification performance of the original and discrete data sets of the methods is presented comparatively. According to the results, the performance of the discrete data is more successful than the original data. © 2021 Elsevier Ltd
引用
收藏
相关论文
共 50 条
  • [1] A comparative analysis of machine learning algorithms for waste classification: inceptionv3 and chi-square features
    E. T. Yasin
    M. Koklu
    International Journal of Environmental Science and Technology, 2025, 22 (10) : 9415 - 9428
  • [2] A fuzzy rough granular ensemble learning based on the feature selection with chi-square
    Hou, Xianyu
    Chen, Yumin
    Wu, Keshou
    Zhou, Ying
    Lu, Junwen
    Weng, Xuan
    JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2024, 46 (03) : 6201 - 6217
  • [3] A Chi-square Statistics Based Feature Selection Method in Text Classification
    Zhai, Yujia
    Song, Wei
    Liu, Xianjun
    Liu, Lizhen
    Zhao, Xinlei
    PROCEEDINGS OF 2018 IEEE 9TH INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING AND SERVICE SCIENCE (ICSESS), 2018, : 160 - 163
  • [4] Learning Word Embeddings with Chi-Square Weights for Healthcare Tweet Classification
    Kuang, Sicong
    Davison, Brian D.
    APPLIED SCIENCES-BASEL, 2017, 7 (08):
  • [5] Analyzing the characteristics of application traffic behavior based on chi-square statistics
    Chen L.
    Gong J.
    Ruan Jian Xue Bao/Journal of Software, 2010, 21 (11): : 2852 - 2865
  • [6] Classification of Categorical Data Based on the Chi-Square Dissimilarity and t-SNE
    Cardona, Luis Ariosto Serna
    Vargas-Cardona, Hernan Dario
    Navarro Gonzalez, Piedad
    Cardenas Pena, David Augusto
    Orozco Gutierrez, Alvaro Angel
    COMPUTATION, 2020, 8 (04) : 1 - 15
  • [7] Multi-Label Active Learning with Chi-Square Statistics for Image Classification
    Ye, Chen
    Wu, Jian
    Sheng, Victor S.
    Zhao, Shiquan
    Zhao, Pengpeng
    Cui, Zhiming
    ICMR'15: PROCEEDINGS OF THE 2015 ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA RETRIEVAL, 2015, : 583 - 586
  • [8] An Improved Ensemble-Based Cardiovascular Disease Detection System with Chi-Square Feature Selection
    Korial, Ayad E.
    Gorial, Ivan Isho
    Humaidi, Amjad J.
    COMPUTERS, 2024, 13 (06)
  • [9] Document Classification Using Word2Vec and Chi-square on Apache Spark
    Choi, Mijin
    Jin, Rize
    Chung, Tae-Sun
    ADVANCES IN COMPUTER SCIENCE AND UBIQUITOUS COMPUTING, 2017, 421 : 867 - 872
  • [10] Reviewing ensemble classification methods in breast cancer
    Hosni, Mohamed
    Abnane, Ibtissam
    Idri, Ali
    Carrillo de Gea, Juan M.
    Fernandez Aleman, Jose Luis
    COMPUTER METHODS AND PROGRAMS IN BIOMEDICINE, 2019, 177 : 89 - 112