Application of Chi-square discretization algorithms to ensemble classification methods

被引:0
|
作者
Peker N. [1 ]
Kubat C. [1 ]
机构
[1] Department of Industrial Engineering, Faculty of Engineering, Sakarya University, Esentepe, 54187, Sakarya
关键词
Chi-square statistics; Classification; Data mining; Discretization; Ensemble methods; Machine learning;
D O I
10.1016/j.eswa.2021.115540
中图分类号
学科分类号
摘要
Classification is one of the important tasks in data mining and machine learning. Classification performance depends on many factors as well as data characteristics. Some algorithms are known to work better with discrete data. In contrast, most real-world data contain continuous variables. For algorithms working with discrete data, these continuous variables must be converted to discrete ones. In this process called discretization, continuous variables are converted to their corresponding discrete variables. In this paper, four Chi-square based supervised discretization algorithms ChiMerge(ChiM), Chi2, Extended Chi2(ExtChi2) and Modified Chi2(ModChi2) were used. In the literature, the performance of these algorithms is often tested with decision trees and Naïve Bayes classifiers. In this study, differently, four sets of data discretized by these algorithms were classified with ensemble methods. Classification accuracies for these data sets were obtained through using a stratified 10-fold cross-validation method. The classification performance of the original and discrete data sets of the methods is presented comparatively. According to the results, the performance of the discrete data is more successful than the original data. © 2021 Elsevier Ltd
引用
收藏
相关论文
共 50 条
  • [31] ECG classification with learning ensemble based on symbolic discretization
    Taktak, Mariem
    Ltifi, Hela
    Ben Ayed, Mounir
    INFORMATION SYSTEMS, 2024, 120
  • [32] A new indirect estimation of reference intervals: truncated minimum chi-square (TMC) approach
    Wosniok, Werner
    Haeckel, Rainer
    CLINICAL CHEMISTRY AND LABORATORY MEDICINE, 2019, 57 (12) : 1933 - 1947
  • [33] A novel chi-square statistic for detecting group differences between pathways in systems epidemiology
    Yuan, Zhongshang
    Ji, Jiadong
    Zhang, Tao
    Liu, Yi
    Zhang, Xiaoshuai
    Chen, Wei
    Xue, Fuzhong
    STATISTICS IN MEDICINE, 2016, 35 (29) : 5512 - 5524
  • [34] Tree-Based Ensemble Models and Algorithms for Classification
    Tsiligaridis, J.
    2023 INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE IN INFORMATION AND COMMUNICATION, ICAIIC, 2023, : 103 - 106
  • [35] Impact of discretization methods on the rough set-based classification of remotely sensed images
    Ge, Y.
    Cao, F.
    Duan, R. F.
    INTERNATIONAL JOURNAL OF DIGITAL EARTH, 2011, 4 (04) : 330 - 346
  • [36] Approximated Chi-Square Distance for Histogram Matching in Facial Image Analysis: Face and Expression Recognition
    Sadeghi, Hamid
    Raie, Abolghasem-A.
    2017 10TH IRANIAN CONFERENCE ON MACHINE VISION AND IMAGE PROCESSING (MVIP), 2017, : 188 - 191
  • [37] Ensemble Methods for Spatial Data Stream Classification
    King, Liam
    Osborn, Wendy
    18TH INTERNATIONAL CONFERENCE ON FUTURE NETWORKS AND COMMUNICATIONS, FNC 2023/20TH INTERNATIONAL CONFERENCE ON MOBILE SYSTEMS AND PERVASIVE COMPUTING, MOBISPC 2023/13TH INTERNATIONAL CONFERENCE ON SUSTAINABLE ENERGY INFORMATION TECHNOLOGY, SEIT 2023, 2023, 224 : 155 - 162
  • [38] Academic Analytics Implemented for Students Performance in Terms of Canonical Correlation Analysis and Chi-Square Analysis
    Muley, Aniket
    Bhalchandra, Parag
    Joshi, Mahesh
    Wasnik, Pawan
    INFORMATION AND COMMUNICATION TECHNOLOGY (ICICT 2016), 2018, 625 : 269 - 277
  • [39] Risk Assessment Score and Chi-Square Automatic Interaction Detection Algorithm for Hypertension Among Africans: Models From the SIREN Study
    Asowata, Osahon J.
    Okekunle, Akinkunmi Paul
    Akpa, Onoja M.
    Fakunle, Adekunle Gregory
    Akinyemi, Joshua O.
    Komolafe, Morenikeji Adeyoyin
    Sarfo, Fred Stephen
    Akpalu, Albert K.
    Obiako, Reginald
    Wahab, Kolawole W.
    Osaigbovo, Godwin O.
    Owolabi, Lukman F.
    Jenkins, Carolyn M.
    Calys-Tagoe, Benedict Nii Laryea
    Arulogun, Oyedunni Sola
    Ogbole, Godwin I.
    Ogah, Okechukwu Samuel
    Lambert, Appiah T.
    Ibinaiye, Philip Oluleke
    Adebayo, Philip B.
    Singh, Arti
    Adeniyi, Sunday Adebori
    Mensah, Yaw B.
    Laryea, Ruth Y.
    Balogun, Olayemi
    Chukwuonye, Innocent Ijezie
    Akinyemi, Rufus O.
    Ovbiagele, Bruce
    Owolabi, Mayowa Ojo
    HYPERTENSION, 2023, 80 (12) : 2581 - 2590
  • [40] Risk upper bounds for general ensemble methods with an application to multiclass classification
    Laviolette, Francois
    Morvant, Emilie
    Ralaivola, Liva
    Roy, Jean-Francis
    NEUROCOMPUTING, 2017, 219 : 15 - 25