Improvement of Bagging performance for classification of imbalanced datasets using evolutionary multi-objective optimization

被引:74
作者
Roshan, Seyed Ehsan [1 ]
Asadi, Shahrokh [1 ]
机构
[1] Univ Tehran, Coll Farabi, Dept Engn, Data Min Lab, Tehran, Iran
关键词
Multi-objective evolutionary; Imbalanced datasets; Ensemble learning; Bagging; Undersampling; Diversity; SUPPORT VECTOR MACHINES; REJECTIVE MULTIPLE TEST; ENSEMBLE METHOD; SMOTE; ALGORITHMS; DIVERSITY; SYSTEM; MARGIN;
D O I
10.1016/j.engappai.2019.103319
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Today, classification of imbalanced datasets, in which the samples belonging to one class is more than the samples pertaining to other classes, has been paid much attention owing to its vast application in real-world problems. Bagging ensemble method, as one of the most favorite ensemble learning algorithms can provide better performance in solving imbalanced problems when is incorporated with undersampling methods. In Bagging method, diversity of classifiers, performance of classifiers, appropriate number of bags (classifiers) and balanced training sets to train the classifiers are important factors in successfulness of Bagging so as to deal with imbalanced problems. In this paper, through inspiring of evolutionary undersampling (the new undersampling method for seeking the subsets of majority class samples) and taking the mentioned factors into account, i.e., diversity, performance of classifiers, number of classifiers and balanced training set, a multi-objective optimization undersampling is proposed. The proposed method uses multi-objective evolutionary to produce set of diverse, well-performing and (near) balanced bags. Accordingly, the proposed method provides the possibility of generating diverse and well-performing classifiers and determining the number of classifiers in Bagging algorithm. Moreover, two different strategies are employed in the proposed method so as to improve the diversity. In order to confirm the proposed method's efficiency, its performance is measured over 33 imbalanced datasets using AUC and then compared with 6 well-known ensemble learning algorithms. Investigating the obtained results of such comparisons using non-parametric statistical analysis demonstrate the dominancy of the proposed method compared to other employed techniques, as well.
引用
收藏
页数:19
相关论文
共 50 条
  • [21] Spread Assessment for Evolutionary Multi-Objective Optimization
    Li, Miqing
    Zheng, Jinhua
    [J]. EVOLUTIONARY MULTI-CRITERION OPTIMIZATION: 5TH INTERNATIONAL CONFERENCE, EMO 2009, 2009, 5467 : 216 - 230
  • [22] An Analysis on Recombination in Multi-Objective Evolutionary Optimization
    Qian, Chao
    Yu, Yang
    Zhou, Zhi-Hua
    [J]. GECCO-2011: PROCEEDINGS OF THE 13TH ANNUAL GENETIC AND EVOLUTIONARY COMPUTATION CONFERENCE, 2011, : 2051 - 2058
  • [23] Multi-objective genetic fuzzy classifiers for imbalanced and cost-sensitive datasets
    Pietro Ducange
    Beatrice Lazzerini
    Francesco Marcelloni
    [J]. Soft Computing, 2010, 14 : 713 - 728
  • [24] Multi-objective genetic fuzzy classifiers for imbalanced and cost-sensitive datasets
    Ducange, Pietro
    Lazzerini, Beatrice
    Marcelloni, Francesco
    [J]. SOFT COMPUTING, 2010, 14 (07) : 713 - 728
  • [25] A multi-objective evolutionary algorithm based on a grid with adaptive divisions for multi-objective optimization with irregular Pareto fronts
    Liu, Zhe
    Han, Fei
    Ling, Qinghua
    Han, Henry
    Jiang, Jing
    Liu, Qing
    [J]. APPLIED SOFT COMPUTING, 2025, 176
  • [26] Development of ensemble learning classification with density peak decomposition-based evolutionary multi-objective optimization
    SeyedEhsan Roshan
    Shahrokh Asadi
    [J]. International Journal of Machine Learning and Cybernetics, 2021, 12 : 1737 - 1751
  • [27] Development of ensemble learning classification with density peak decomposition-based evolutionary multi-objective optimization
    Roshan, SeyedEhsan
    Asadi, Shahrokh
    [J]. INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS, 2021, 12 (06) : 1737 - 1751
  • [28] EMoSOA: a new evolutionary multi-objective seagull optimization algorithm for global optimization
    Gaurav Dhiman
    Krishna Kant Singh
    Adam Slowik
    Victor Chang
    Ali Riza Yildiz
    Amandeep Kaur
    Meenakshi Garg
    [J]. International Journal of Machine Learning and Cybernetics, 2021, 12 : 571 - 596
  • [29] EMoSOA: a new evolutionary multi-objective seagull optimization algorithm for global optimization
    Dhiman, Gaurav
    Singh, Krishna Kant
    Slowik, Adam
    Chang, Victor
    Yildiz, Ali Riza
    Kaur, Amandeep
    Garg, Meenakshi
    [J]. INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS, 2021, 12 (02) : 571 - 596
  • [30] Multi-objective Evolutionary Ensemble Learning for Disease Classification
    Li, Nan
    Ma, Lianbo
    Zhang, Tian
    He, Meirui
    [J]. ADVANCES IN SWARM INTELLIGENCE, ICSI 2022, PT I, 2022, : 491 - 500