Application of Chi-square discretization algorithms to ensemble classification methods

被引:0
|
作者
Peker N. [1 ]
Kubat C. [1 ]
机构
[1] Department of Industrial Engineering, Faculty of Engineering, Sakarya University, Esentepe, 54187, Sakarya
关键词
Chi-square statistics; Classification; Data mining; Discretization; Ensemble methods; Machine learning;
D O I
10.1016/j.eswa.2021.115540
中图分类号
学科分类号
摘要
Classification is one of the important tasks in data mining and machine learning. Classification performance depends on many factors as well as data characteristics. Some algorithms are known to work better with discrete data. In contrast, most real-world data contain continuous variables. For algorithms working with discrete data, these continuous variables must be converted to discrete ones. In this process called discretization, continuous variables are converted to their corresponding discrete variables. In this paper, four Chi-square based supervised discretization algorithms ChiMerge(ChiM), Chi2, Extended Chi2(ExtChi2) and Modified Chi2(ModChi2) were used. In the literature, the performance of these algorithms is often tested with decision trees and Naïve Bayes classifiers. In this study, differently, four sets of data discretized by these algorithms were classified with ensemble methods. Classification accuracies for these data sets were obtained through using a stratified 10-fold cross-validation method. The classification performance of the original and discrete data sets of the methods is presented comparatively. According to the results, the performance of the discrete data is more successful than the original data. © 2021 Elsevier Ltd
引用
收藏
相关论文
共 50 条
  • [21] Integrating Information Gain and Chi-Square for Enhanced Malware Detection Performance
    Rafrastara, Fauzi Adi
    Ghozi, Wildanil
    Sani, Ramadhan Rakhmat
    Handoko, Lekso Budi
    Abdussalam
    Pramudya, Elkaf Rahmawan
    Abdollah, Faizal M.
    JOURNAL OF INFORMATION AND COMMUNICATION TECHNOLOGY-MALAYSIA, 2025, 24 (01): : 79 - 101
  • [22] A modified chi-square statistics of the linear estimator for inter-laboratory comparison
    Hang, Chenzhe
    Ma, Guoyuan
    MEASUREMENT, 2018, 130 : 32 - 38
  • [23] A Performance Evaluation of Chi-Square Pruning Techniques in Class Association Rules Optimization
    Chern-Tong, Han
    Aziz, Izzatdin Abdul
    APPLIED COMPUTATIONAL INTELLIGENCE AND MATHEMATICAL METHODS: COMPUTATIONAL METHODS IN SYSTEMS AND SOFTWARE 2017, VOL. 2, 2018, 662 : 195 - 203
  • [24] A chi-square statistics of arithmetic mean and applications to inter-laboratory comparison
    Hang, Chenzhe
    Ma, Guoyuan
    Liu, Jianli
    Xu, Dinghua
    TENTH INTERNATIONAL SYMPOSIUM ON PRECISION ENGINEERING MEASUREMENTS AND INSTRUMENTATION, 2019, 11053
  • [25] Soft Decision LDPC Decoding Over Chi-Square Based Optical Channels
    Sahuguede, Stephanie
    Julien-Vergonjanne, Anne
    Cances, Jean-Pierre
    JOURNAL OF LIGHTWAVE TECHNOLOGY, 2009, 27 (16) : 3540 - 3545
  • [26] Comparison of maximum likelihood estimation and chi-square statistics applied to counting experiments
    Hauschild, T
    Jentschel, M
    NUCLEAR INSTRUMENTS & METHODS IN PHYSICS RESEARCH SECTION A-ACCELERATORS SPECTROMETERS DETECTORS AND ASSOCIATED EQUIPMENT, 2001, 457 (1-2) : 384 - 401
  • [27] Performance assessment of artificial neural network using chi-square and backward elimination feature selection methods for landslide susceptibility analysis
    Pham, Binh Thai
    Van Dao, Dong
    Acharya, Tri Dev
    Van Phong, Tran
    Costache, Romulus
    Van Le, Hiep
    Nguyen, Hanh Bich Thi
    Prakash, Indra
    ENVIRONMENTAL EARTH SCIENCES, 2021, 80 (20)
  • [28] Chi-square Statistics Feature Selection Based on Term Frequency and Distribution for Text Categorization
    Jin, Chuanxin
    Ma, Tinghuai
    Hou, Rongtao
    Tang, Meili
    Tian, Yuan
    Al-Dhelaan, Abdullah
    Al-Rodhaan, Mznah
    IETE JOURNAL OF RESEARCH, 2015, 61 (04) : 351 - 362
  • [29] Performance assessment of artificial neural network using chi-square and backward elimination feature selection methods for landslide susceptibility analysis
    Binh Thai Pham
    Dong Van Dao
    Tri Dev Acharya
    Tran Van Phong
    Romulus Costache
    Hiep Van Le
    Hanh Bich Thi Nguyen
    Indra Prakash
    Environmental Earth Sciences, 2021, 80
  • [30] Chi-Square Target Encoding for Categorical Data Representation: A Real-World Sensor Data Case Study
    M. Anitha
    Nickolas Savarimuthu
    S. Mary Saira Bhanu
    SN Computer Science, 6 (3)