Adaptive multi-objective swarm fusion for imbalanced data classification

被引:56
作者
li, Jinyan [1 ]
Fong, Simon [1 ]
Wong, Raymond K. [2 ]
Chu, Victor W. [2 ]
机构
[1] Univ Macau, Dept Comp Informat Sci, Macau, Peoples R China
[2] Univ New South Wales, Sch Comp Sci & Engn, Sydney, NSW, Australia
关键词
Swarm fusion; Swarm intelligence algorithm; Multi-objective; Crossover rebalancing; Imbalanced data classification; OPTIMIZATION; ALGORITHMS; PERFORMANCE; AGREEMENT; DESIGN; POWER;
D O I
10.1016/j.inffus.2017.03.007
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Learning a classifier from an imbalanced dataset is an important problem in data mining and machine learning. Since there is more information from the majority classes than the minorities in an imbalanced dataset, the classifier would become over-fitted to the former and under-fitted to the latter classes. Previous attempts to address the problem have been focusing on increasing the learning sensitivity to the minorities and/or rebalancing sample sizes among classes before learning. However, how to efficiently identify their optimal mix in rebalancing is still an unresolved problem. Due to non-linear relationships between attributes and class labels, merely to rebalance sample sizes rarely comes up with optimal results. Moreover, brute-force search for the perfect combination is known to be NP-hard and hence a smarter heuristic is required. In this paper, we propose a notion of swarm fusion to address the problem using stochastic swarm heuristics to cooperatively optimize the mixtures. Comparing with conventional rebalancing methods, e.g., linear search, our novel fusion approach is able to find a close to optimal mix with improved accuracy and reliability. Most importantly, it has found to be with higher computational speed than other coupled swarm optimization techniques and iteration methods. In our experiments, we first compared our proposed solution with traditional methods on thirty publicly available imbalanced datasets. Using neural network as base learner, our proposed method is found to outperform other traditional methods by up to 69% in terms of the credibility of the learned classifiers. Secondly, we wrapped our proposed swarm fusion method with decision tree. Notably, it defeated six state-of-the-art methods on ten imbalanced datasets in all evolution metrics that we considered. (C) 2017 Elsevier B.V. All rights reserved.
引用
收藏
页码:1 / 24
页数:24
相关论文
共 50 条
  • [31] A Multi-Objective Formulation for Facility Layout Problem
    Jaafari, Amir Ardestani
    Krishnan, Krishna K.
    Doulabi, Seyed Hossein Hashemi
    Davoudpour, Hamid
    [J]. WCECS 2009: WORLD CONGRESS ON ENGINEERING AND COMPUTER SCIENCE, VOLS I AND II, 2009, : 1238 - +
  • [32] A multi-objective heuristic algorithm for gene expression microarray data classification
    Lv, Jia
    Peng, Qinke
    Chen, Xiao
    Sun, Zhi
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2016, 59 : 13 - 19
  • [33] A dynamic locality multi-objective salp swarm algorithm for feature selection
    Aljarah, Ibrahim
    Habib, Maria
    Faris, Hossam
    Al-Madi, Nailah
    Heidari, Ali Asghar
    Mafarja, Majdi
    Abd Elaziz, Mohamed
    Mirjalili, Seyedali
    [J]. COMPUTERS & INDUSTRIAL ENGINEERING, 2020, 147
  • [34] Multi-objective Optimization of Parallel Manipulators using a Particle Swarm Algorithm
    Lopes, Antonio M.
    Freire, Helio
    De Moura Oliveira, P. B.
    Solteiro Pires, E. J.
    Reis, Cecilia
    [J]. NEW ASPECTS OF APPLIED INFORMATICS, BIOMEDICAL ELECTRONICS AND INFORMATICS AND COMMUNICATION, 2010, : 103 - +
  • [35] A swarm intelligence graph-based pathfinding algorithm (SIGPA) for multi-objective route planning
    Ntakolia, Charis
    Iakovidis, Dimitris K.
    [J]. COMPUTERS & OPERATIONS RESEARCH, 2021, 133 (133)
  • [36] Multi-Objective Particle Swarm Optimization Approach for Cost-Based Feature Selection in Classification
    Zhang, Yong
    Gong, Dun-wei
    Cheng, Jian
    [J]. IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2017, 14 (01) : 64 - 75
  • [37] Fuzzy adaptive cat swarm algorithm and Borda method for solving dynamic multi-objective problems
    Orouskhani, Maysam
    Shi, Daming
    [J]. EXPERT SYSTEMS, 2018, 35 (04)
  • [38] An adaptive multi-objective particle swarm optimisation algorithm based on fitness distance to streamline repository
    Wang, Suyu
    Ma, Dengcheng
    Ren, Ze
    Qu, Yuanyuan
    Wu, Miao
    [J]. INTERNATIONAL JOURNAL OF BIO-INSPIRED COMPUTATION, 2022, 20 (04) : 209 - 219
  • [39] Adaptive chaotic particle swarm algorithm for isogeometric multi-objective size optimization of FG plates
    Wang, Chao
    Yu, Tiantang
    Curiel-Sosa, Jose L.
    Xie, Nenggang
    Tinh Quoc Bui
    [J]. STRUCTURAL AND MULTIDISCIPLINARY OPTIMIZATION, 2019, 60 (02) : 757 - 778
  • [40] Endowing the MIA Cloud Autoscaler with Adaptive Evolutionary and Particle Swarm Multi-Objective Optimization Algorithms
    Yannibelli, Virginia
    Pacini, Elina
    Monge, David
    Mateos, Cristian
    Rodriguez, Guillermo
    [J]. ADVANCES IN COMPUTATIONAL INTELLIGENCE (MICAI 2021), PT I, 2021, 13067 : 383 - 400