Application of Chi-square discretization algorithms to ensemble classification methods

被引:0
|
作者
Peker N. [1 ]
Kubat C. [1 ]
机构
[1] Department of Industrial Engineering, Faculty of Engineering, Sakarya University, Esentepe, 54187, Sakarya
关键词
Chi-square statistics; Classification; Data mining; Discretization; Ensemble methods; Machine learning;
D O I
10.1016/j.eswa.2021.115540
中图分类号
学科分类号
摘要
Classification is one of the important tasks in data mining and machine learning. Classification performance depends on many factors as well as data characteristics. Some algorithms are known to work better with discrete data. In contrast, most real-world data contain continuous variables. For algorithms working with discrete data, these continuous variables must be converted to discrete ones. In this process called discretization, continuous variables are converted to their corresponding discrete variables. In this paper, four Chi-square based supervised discretization algorithms ChiMerge(ChiM), Chi2, Extended Chi2(ExtChi2) and Modified Chi2(ModChi2) were used. In the literature, the performance of these algorithms is often tested with decision trees and Naïve Bayes classifiers. In this study, differently, four sets of data discretized by these algorithms were classified with ensemble methods. Classification accuracies for these data sets were obtained through using a stratified 10-fold cross-validation method. The classification performance of the original and discrete data sets of the methods is presented comparatively. According to the results, the performance of the discrete data is more successful than the original data. © 2021 Elsevier Ltd
引用
收藏
相关论文
共 50 条
  • [41] A mapping study of ensemble classification methods in lung cancer decision support systems
    Mohamed Hosni
    Ginés García-Mateos
    Juan M. Carrillo-de-Gea
    Ali Idri
    José Luis Fernández-Alemán
    Medical & Biological Engineering & Computing, 2020, 58 : 2177 - 2193
  • [42] A mapping study of ensemble classification methods in lung cancer decision support systems
    Hosni, Mohamed
    Garcia-Mateos, Gines
    Carrillo-de-Gea, Juan M.
    Idri, Ali
    Fernandez-Aleman, Jose Luis
    MEDICAL & BIOLOGICAL ENGINEERING & COMPUTING, 2020, 58 (10) : 2177 - 2193
  • [43] Top-k Graph Similarity Search Algorithm Based on Chi-Square Statistics in Probabilistic Graphs
    Chen, Ziyang
    Zhuang, Junhao
    Wang, Xuan
    Tang, Xian
    Yang, Kun
    Du, Ming
    Zhou, Junfeng
    ELECTRONICS, 2024, 13 (01)
  • [44] Radiometric Normalization for Cross-Sensor Optical Gaofen Images with Change Detection and Chi-Square Test
    Yan, Li
    Yang, Jianbing
    Zhang, Yi
    Zhao, Anqi
    Li, Xi
    REMOTE SENSING, 2021, 13 (16)
  • [45] Smart Cities-Based Improving Atmospheric Particulate Matters Prediction Using Chi-Square Feature Selection Methods by Employing Machine Learning Techniques
    Mengash, Hanan Abdullah
    Hussain, Lal
    Mahgoub, Hany
    Al-Qarafi, A.
    Nour, Mohamed K.
    Marzouk, Radwa
    Qureshi, Shahzad Ahmad
    Hilal, Anwer Mustafa
    APPLIED ARTIFICIAL INTELLIGENCE, 2022, 36 (01)
  • [46] Stacked Framework for Ensemble of Heterogeneous Classification Algorithms
    David, H. Benjamin Fredrick
    Suruliandi, A.
    Raja, S. P.
    JOURNAL OF CIRCUITS SYSTEMS AND COMPUTERS, 2021, 30 (15)
  • [47] Fusion of Chi-Square and Z-Test Statistics for Feature Selection with Machine Learning Techniques in Intrusion Detection
    Sharma, Amrendra Kumar
    Tiwari, Mamta
    ADVANCED NETWORK TECHNOLOGIES AND INTELLIGENT COMPUTING, ANTIC 2023, PT I, 2024, 2090 : 206 - 224
  • [48] Data Mining of Students' Response on the University Services using Chi-square Automatic Interaction Detector (CHAID) Algorithm
    Rosas, Maryli F.
    Ambat, Shaneth C.
    Ballera, Melvin A.
    PROCEEDINGS OF THE 2018 1ST IEEE INTERNATIONAL CONFERENCE ON KNOWLEDGE INNOVATION AND INVENTION (ICKII 2018), 2018, : 244 - 247
  • [49] Diabetic Retinopathy Fundus Image Classification Using Ensemble Methods
    Lukashevich, Marina M.
    PATTERN RECOGNITION AND IMAGE ANALYSIS, 2024, 34 (02) : 331 - 339
  • [50] Balancing Performance Measures in Classification Using Ensemble Learning Methods
    Bahl, Neeraj
    Bansal, Ajay
    BUSINESS INFORMATION SYSTEMS, BIS 2019, PT II, 2019, 354 : 311 - 324