SVM ensemble training for imbalanced data classification using multi-objective optimization techniques

被引:0
|
作者
Joanna Grzyb
Michał Woźniak
机构
[1] Wroclaw University of Science and Technology,Department of Systems and Computer Networks
来源
Applied Intelligence | 2023年 / 53卷
关键词
Pattern classification; Imbalanced data; Classifier ensemble; SVM; Multi-objective optimization; Feature selection;
D O I
暂无
中图分类号
学科分类号
摘要
One of the main problems with classifier training for imbalanced data is defining the correct learning criterion. On the one hand, we want the minority class to be correctly recognized, and on the other hand, we do not want to make too many mistakes in the majority class. Commonly used metrics focus either on the predictive quality of the distinguished class or propose an aggregation of simple metrics. The aggregate metrics, such as Gmean or AUC, are primarily ambiguous, i.e., they do not indicate the specific values of errors made on the minority or majority class. Additionally, improper use of aggregate metrics results in solutions selected with their help that may favor the majority class. The authors realize that a solution to this problem is using overall risk. However, this requires knowledge of the costs associated with errors made between classes, which is often unavailable. Hence, this paper will propose the semoos algorithm - an approach based on multi-objective optimization that optimizes criteria related to the prediction quality of both minority and majority classes. semoos returns a pool of non-dominated solutions from which the user can choose the model that best suits him. Automatic solution selection formulas with a so-called Pareto front have also been proposed to compare state-of-the-art methods. The proposed approach will train a svm classifier ensemble dedicated to the imbalanced data classification task. The experimental evaluations carried out on a large number of benchmark datasets confirm its usefulness.
引用
收藏
页码:15424 / 15441
页数:17
相关论文
共 50 条
  • [1] SVM ensemble training for imbalanced data classification using multi-objective optimization techniques
    Grzyb, Joanna
    Wozniak, Michal
    APPLIED INTELLIGENCE, 2023, 53 (12) : 15424 - 15441
  • [2] Ensemble learning by means of a multi-objective optimization design approach for dealing with imbalanced data sets
    Ribeiro, Victor Henrique Alves
    Reynoso-Meza, Gilberto
    EXPERT SYSTEMS WITH APPLICATIONS, 2020, 147
  • [3] Adaptive multi-objective swarm fusion for imbalanced data classification
    li, Jinyan
    Fong, Simon
    Wong, Raymond K.
    Chu, Victor W.
    INFORMATION FUSION, 2018, 39 : 1 - 24
  • [4] Imbalanced Protein Data Classification Using Ensemble FTM-SVM
    Dai, Hong-Liang
    IEEE TRANSACTIONS ON NANOBIOSCIENCE, 2015, 14 (04) : 350 - 359
  • [5] Improvement of Bagging performance for classification of imbalanced datasets using evolutionary multi-objective optimization
    Roshan, Seyed Ehsan
    Asadi, Shahrokh
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2020, 87
  • [6] Early classification of time series using multi-objective optimization techniques
    Mori, U.
    Mendiburu, A.
    Miranda, I. M.
    Lozano, J. A.
    INFORMATION SCIENCES, 2019, 492 : 204 - 218
  • [7] Classification and Merging Techniques to Reduce Brokerage Using Multi-Objective Optimization
    Bettahalli Kengegowda, Dhanalakshmi
    Kamidoddi Chowdaiah, Srikantaiah
    Lokesh, Gururaj Harinahalli
    Flammini, Francesco
    ALGORITHMS, 2022, 15 (02)
  • [8] Ensemble Classification of PolSAR Data Using Multi-Objective Heuristic Combination Rule
    Saleh, Reza
    Farsi, Hasan
    Zahiri, Seyyed Hamid
    2016 1ST CONFERENCE ON SWARM INTELLIGENCE AND EVOLUTIONARY COMPUTATION (CSIEC 2016), 2016, : 88 - 92
  • [9] Multi-view ensemble learning using multi-objective particle swarm optimization for high dimensional data classification
    Kumar, Vipin
    Aydav, Prem Shankar Singh
    Minz, Sonajharia
    JOURNAL OF KING SAUD UNIVERSITY-COMPUTER AND INFORMATION SCIENCES, 2022, 34 (10) : 8523 - 8537
  • [10] Multi-objective Automatic Algorithm Configuration for the Classification Problem of Imbalanced Data
    Tari, Sara
    Szczepanski, Nicolas
    Mousin, Lucien
    Jacques, Julie
    Kessaci, Marie-Eleonore
    Jourdan, Laetitia
    2020 IEEE CONGRESS ON EVOLUTIONARY COMPUTATION (CEC), 2020,