Ensemble learning method for classification: Integrating data envelopment analysis with machine learning

被引:0
|
作者
An, Qingxian [1 ,2 ]
Huang, Siwei [1 ]
Han, Yuxuan [1 ]
Zhu, You [3 ,4 ]
机构
[1] Cent South Univ, Sch Business, Changsha 410083, Peoples R China
[2] Hefei Univ Technol, Sch Econ, Hefei 230601, Peoples R China
[3] Hunan Univ, Business Sch, Changsha 410082, Peoples R China
[4] Hunan Prov Key Lab Philosophy & Social Sci Ind Dig, Changsha 410082, Peoples R China
基金
中国国家自然科学基金;
关键词
Ensemble learning; Data envelopment analysis; Classifier; Large dataset; STATISTICAL COMPARISONS; CLASSIFIERS; EFFICIENCY; DEA;
D O I
10.1016/j.cor.2024.106739
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
In classification tasks with large sample sets, the use of a single classifier carries the risk of overfitting. To overcome this issue, an ensemble of classifier models has often been shown to outperform the use of a single "best" model. Given the rich variety of classifier models available, the selection of the high-efficiency classifiers for a given task dataset remains an urgent challenge. However, most of the previous classifier selection methods only focus on the measurement of classification output performance without considering the computational cost. This paper proposes a new ensemble learning method to improve the classification quality for big datasets by using data envelopment analysis. It contains the following two stages: classifier selection and classifier combination. In the first stage, the commonly used classifiers are evaluated on the basis of their performance on resource consumption and classification output performance using the range directional model (RDM); then, the most efficient classifiers are selected. In the second stage, the classifier confusion matrix is evaluated using the data envelopment analysis (DEA) cross-efficiency model. Then, the weight for the classifier combination is determined to ensure that classifiers with higher performance have greater weights based on the cross-efficiency values. Experimental results demonstrate the superiority of the cross-efficiency model over the BCC model and the benchmark voting method in model ensemble. Furthermore, our method has been shown to save more computational resources and yields better results than existing methods.
引用
收藏
页数:17
相关论文
共 50 条
  • [41] Ensemble of extreme learning machine for remote sensing image classification
    Han, Min
    Liu, Ben
    NEUROCOMPUTING, 2015, 149 : 65 - 70
  • [42] An ensemble method of the machine learning to prognosticate the gastric cancer
    Rezaei, Hirad Baradaran
    Amjadian, Alireza
    Sebt, Mohammad Vahid
    Askari, Reza
    Gharaei, Abolfazl
    ANNALS OF OPERATIONS RESEARCH, 2023, 328 (01) : 151 - 192
  • [43] Enrichment of Machine Learning based Activity Classification in Smart Homes using Ensemble Learning
    Agarwal, Bikash
    Chakravorty, Antorweep
    Wiktorski, Tomasz
    Rong, Chunming
    2016 IEEE/ACM 9TH INTERNATIONAL CONFERENCE ON UTILITY AND CLOUD COMPUTING (UCC), 2016, : 196 - 201
  • [44] An ensemble method of the machine learning to prognosticate the gastric cancer
    Hirad Baradaran Rezaei
    Alireza Amjadian
    Mohammad Vahid Sebt
    Reza Askari
    Abolfazl Gharaei
    Annals of Operations Research, 2023, 328 : 151 - 192
  • [45] Utilizing an Ensemble Machine Learning Framework for Handling Concept Drift in Spatiotemporal Data Streams Classification
    Angbera, Ature
    Chan, Huah Yong
    Informatica (Slovenia), 2024, 48 (02): : 213 - 222
  • [46] A Method of Imbalanced Traffic Classification Based on Ensemble Learning
    Ding, Yaojun
    2015 IEEE INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING, COMMUNICATIONS AND COMPUTING (ICSPCC), 2015, : 265 - 268
  • [47] Efficiency analysis for stochastic dynamic facility layout problem using meta-heuristic, data envelopment analysis and machine learning
    Tayal, Akash
    Kose, Utku
    Solanki, Arun
    Nayyar, Anand
    Marmolejo Saucedo, Jose Antonio
    COMPUTATIONAL INTELLIGENCE, 2020, 36 (01) : 172 - 202
  • [48] Evaluating the efficiency of system integration projects using data envelopment analysis (DEA) and machine learning
    Hong, HK
    Ha, SH
    Shin, CK
    Park, SC
    Kim, SH
    EXPERT SYSTEMS WITH APPLICATIONS, 1999, 16 (03) : 283 - 296
  • [49] An ensemble deep learning method as data fusion system for remote sensing multisensor classification
    Bigdeli, Behnaz
    Pahlavani, Parham
    Amirkolaee, Hamed Amini
    APPLIED SOFT COMPUTING, 2021, 110
  • [50] Classification of Multi-class Microarray Cancer Data Using Ensemble Learning Method
    Shekar, B. H.
    Dagnew, Guesh
    DATA ANALYTICS AND LEARNING, 2019, 43 : 279 - 292