Hybrid Algorithm Based on Simulated Annealing and Bacterial Foraging Optimization for Mining Imbalanced Data

被引:1
作者
Lee, Chou-Yuan [1 ]
Lee, Zne-Jung [1 ]
Huang, Jian-Qiong [1 ]
Ye, Fu-Lan [1 ]
Yao, Jie [1 ]
Ning, Zheng-Yuan [1 ]
Meen, Teen-Hang [2 ]
机构
[1] Fuzhou Univ Int Studies & Trade, Sch Technol, Fuzhou 350202, Peoples R China
[2] Natl Formosa Univ, Dept Elect Engn, Huwei 632, Yunlin, Taiwan
关键词
bacterial foraging optimization; simulated annealing; hybrid; chemotaxis; imbalanced data;
D O I
10.18494/SAM.2021.3167
中图分类号
TH7 [仪器、仪表];
学科分类号
0804 ; 080401 ; 081102 ;
摘要
The bacterial foraging optimization ( BFO) algorithm can simulate the mechanism of natural selection. However, as the direction of inversion is uncertain in the chemotaxis process, it easily falls into a local optimum. We propose a hybrid algorithm based on simulated annealing (SA) and BFO for mining imbalanced data. The key idea is to exploit the advantages of both SA and the BFO algorithm. In the proposed algorithm, SA finds the optimal solution by employing a jump process, so as to solve the uncertainty of the reversal direction in the chemotaxis process of BFO and avoid falling into a local optimum. SA is used to improve the chemotaxis process of BFO, and then the swarming process, reproduction process, and elimination-dispersal process of BFO are implemented. Four imbalanced datasets are used to test the performance of the proposed hybrid algorithm. In each imbalanced dataset used for testing, there is a certain correlation between the variables, making the dataset multivariate. Through the proposed algorithm, these four multivariate imbalanced datasets are effectively classified, and its performance compared with that of other algorithms. Experimental results show that for the different multivariate imbalanced datasets, the proposed algorithm is better than the original BFO algorithm in terms of various performance indicators. By combining the proposed algorithm with sensor-related technology, in the future, medical multivariate data and security monitoring system data obtained by sensors can be analyzed to improve the classification accuracy of multivariate data.
引用
收藏
页码:1297 / 1312
页数:16
相关论文
共 20 条
  • [1] Cervical Cancer Diagnosis Using Random Forest Classifier With SMOTE and Feature Reduction Techniques
    Abdoh, Sherif F.
    Rizka, Mohamed Abo
    Maghraby, Fahima A.
    [J]. IEEE ACCESS, 2018, 6 : 59475 - 59485
  • [2] Blake C. L., 1998, UCI REPOSITORY MACHI
  • [3] Imbalanced Data Fault Diagnosis Based on an Evolutionary Online Sequential Extreme Learning Machine
    Hao, Wei
    Liu, Feng
    [J]. SYMMETRY-BASEL, 2020, 12 (08):
  • [4] OPTIMIZATION BY SIMULATED ANNEALING
    KIRKPATRICK, S
    GELATT, CD
    VECCHI, MP
    [J]. SCIENCE, 1983, 220 (4598) : 671 - 680
  • [5] A novel algorithm applied to classify unbalanced data
    Lee, Chou-Yuan
    Lee, Zne-Jung
    [J]. APPLIED SOFT COMPUTING, 2012, 12 (08) : 2481 - 2485
  • [6] A Visual Analytics Approach for Understanding Reasons behind Snowballing and Comeback in MOBA Games
    Li, Quan
    Xu, Peng
    Chan, Yeuk Yin
    Wang, Yun
    Wang, Zhipeng
    Qu, Huamin
    Ma, Xiaojuan
    [J]. IEEE TRANSACTIONS ON VISUALIZATION AND COMPUTER GRAPHICS, 2017, 23 (01) : 211 - 220
  • [7] Model-Based Synthetic Sampling for Imbalanced Data
    Liu, Chien-Liang
    Hsieh, Po-Yen
    [J]. IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2020, 32 (08) : 1543 - 1556
  • [8] Animated Character Style Investigation with Decision Tree Classification
    Liu, Kun
    Chang, Kang-Ming
    Liu, Ying-Ju
    Chen, Jun-Hong
    [J]. SYMMETRY-BASEL, 2020, 12 (08):
  • [9] EQUATION OF STATE CALCULATIONS BY FAST COMPUTING MACHINES
    METROPOLIS, N
    ROSENBLUTH, AW
    ROSENBLUTH, MN
    TELLER, AH
    TELLER, E
    [J]. JOURNAL OF CHEMICAL PHYSICS, 1953, 21 (06) : 1087 - 1092
  • [10] Coevolutionary Structure-Redesigned-Based Bacterial Foraging Optimization
    Niu, Ben
    Liu, Jing
    Wu, Teresa
    Chu, Xianghua
    Wang, Zhengxu
    Liu, Yanmin
    [J]. IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2018, 15 (06) : 1865 - 1876