Simplified swarm optimization for hyperparameters of convolutional neural networks

被引:22
|
作者
Yeh, Wei -Chang [1 ]
Lin, Yi-Ping [1 ]
Liang, Yun-Chia [2 ]
Lai, Chyh-Ming [3 ]
Huang, Chia -Ling [4 ]
机构
[1] Natl Tsing Hua Univ, Dept Ind Engn & Engn Management, Integrat & Collaborat Lab, Hsinchu 300, Taiwan
[2] Yuan Ze Univ, Ind Engn & Management, Taoyuan, Taiwan
[3] Natl Def Univ, Management Coll, Taoyuan, Taiwan
[4] Kainan Univ, Dept Int Logist & Transportat Management, Taoyuan 33857, Taiwan
关键词
Machine learning; Image recognition; Convolutional neural networks; Simplified swarm optimization; Hyperparameter optimization;
D O I
10.1016/j.cie.2023.109076
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Convolutional neural networks (CNNs) are widely used in image recognition. Numerous CNN models, such as LeNet, AlexNet, VGG, ResNet, and GoogLeNet, have been developed by increasing the number of layers to improve performance. However, performance deteriorates beyond a certain number of layers. Hence, hyper -parameter optimization is a more efficient way to improve CNNs. To validate this concept, in the present study, an algorithm based on simplified swarm optimization was developed for optimizing the hyperparameters of the simplest CNN model: LeNet. The results of experiments involving the MNIST, Fashion-MNIST, and CIFAR-10 datasets indicated that the accuracy of the proposed algorithm was higher than those of LeNet and PSO-LeNet and that the proposed algorithm can be applied to more complex models such as AlexNet.
引用
收藏
页数:11
相关论文
共 50 条
  • [11] Optimization of convolutional neural network hyperparameters for automatic classification of adult mosquitoes
    Motta, Daniel
    Bandeira Santos, Alex Alisson
    Souza Machado, Bruna Aparecida
    Ribeiro-Filho, Otavio Goncalvez Vicente
    Arriaga Camargo, Luis Octavio
    Valdenegro-Toro, Matias Alejandro
    Kirchner, Frank
    Badaro, Roberto
    PLOS ONE, 2020, 15 (07):
  • [12] Developing a Volunteer Computing Project to Evolve Convolutional Neural Networks and Their Hyperparameters
    Desell, Travis
    2017 IEEE 13TH INTERNATIONAL CONFERENCE ON E-SCIENCE (E-SCIENCE), 2017, : 19 - 28
  • [13] Optimizing Hyperparameters for Thai Cuisine Recognition via Convolutional Neural Networks
    Theera-Ampornpunt, Nawanol
    Treepong, Panisa
    TRAITEMENT DU SIGNAL, 2023, 40 (03) : 1187 - 1193
  • [14] Analyzing the effect of hyperparameters in a automobile classifier based on convolutional neural networks
    Riveros, Elian Laura
    Chavez, Jose Galdos
    Caceres, Juan C. Gutierrez
    PROCEEDINGS OF THE 2016 35TH INTERNATIONAL CONFERENCE OF THE CHILEAN COMPUTER SCIENCE SOCIETY (SCCC), 2016,
  • [15] Convolutional neural networks optimization using multi-objective particle swarm optimization algorithm
    Rashno, Armin
    Fadaei, Sadegh
    INFORMATION SCIENCES, 2025, 689
  • [16] An Algorithmic Framework for the Optimization of Deep Neural Networks Architectures and Hyperparameters
    Keisler, Julie
    Talbi, El-Ghazali
    Claudel, Sandra
    Cabriel, Gilles
    JOURNAL OF MACHINE LEARNING RESEARCH, 2024, 25
  • [17] Using Particle Swarm Optimization with Gradient Descent for Parameter Learning in Convolutional Neural Networks
    Wessels, Steven
    van der Haar, Dustin
    PROGRESS IN PATTERN RECOGNITION, IMAGE ANALYSIS, COMPUTER VISION, AND APPLICATIONS, CIARP 2021, 2021, 12702 : 119 - 128
  • [18] Effects of hyperparameters on flow field reconstruction around a foil by convolutional neural networks
    Wu, Xia
    Wu, Shaobo
    Tian, Xinliang
    Guo, Xiaoxian
    Luo, Xiaofeng
    OCEAN ENGINEERING, 2022, 247
  • [19] Practical hyperparameters tuning of convolutional neural networks for EEG emotional features classification
    Mezzah, Samia
    Tari, Abdelkamel
    INTELLIGENT SYSTEMS WITH APPLICATIONS, 2023, 18
  • [20] Characterization and optimization of mechanical properties in design materials using convolutional neural networks and particle swarm optimization
    Ali M.
    Hussein M.
    Asian Journal of Civil Engineering, 2024, 25 (3) : 2443 - 2457