ModPSO-CNN: an evolutionary convolution neural network with application to visual recognition

被引:0
作者
Shanshan Tu
Sadaqat ur Rehman
Muhammad Waqas
Obaid ur Rehman
Zubair Shah
Zhongliang Yang
Anis Koubaa
机构
[1] Beijing University of Technology,Faculty of Information Technology
[2] Hamad Bin Khalifa University,Division of ICT, College of Science and Engineering
[3] Ghulam Ishaq Khan Institute of Engineering Science and Technology,Faculty of Computer Science and Engineering
[4] Department of Electrical Engineering Sarhad University of Science and IT,Tsinghua National Laboratory for Information Science and Technology
[5] Tsinghua University,Faculty of Computer Science, Robotics and Internet of Things Research Lab
[6] Prince Sultan University,undefined
[7] CISTER,undefined
[8] INESC-TEC,undefined
[9] ISEP,undefined
[10] Polytechnic Institute of Porto,undefined
[11] Department of Computer Science,undefined
[12] Namal Institute,undefined
来源
Soft Computing | 2021年 / 25卷
关键词
Particle swarm optimization; Convolution neural network; Backpropagation; Visual recognition;
D O I
暂无
中图分类号
学科分类号
摘要
Training optimization plays a vital role in the development of convolution neural network (CNN). CNNs are hard to train because of the presence of multiple local minima. The optimization problem for a CNN is non-convex, hence, has multiple local minima. If any of the chosen hyper-parameters are not appropriate, it will end up at bad local minima, which leads to poor performance. Hence, proper optimization of the training algorithm for CNN is the key to converge to a good local minimum. Therefore, in this paper, we introduce an evolutionary convolution neural network (ModPSO-CNN) algorithm. The proposed algorithm results in the fusion of modified particle swarm optimization (ModPSO) along with backpropagation (BP) and convolution neural network (CNN). The training of CNN involves ModPSO along with backpropagation (BP) algorithm to encourage performance improvement by avoiding premature convergence and local minima. The ModPSO have adaptive, dynamic and improved parameters, to handle the issues in training CNN. The adaptive and dynamic parameters bring a proper balance between the global and local search ability, while an improved parameter keeps the diversity of the swarm. The proposed ModPSO algorithm is validated on three standard mathematical test functions and compared with three variants of the benchmark PSO algorithm. Furthermore, the performance of the proposed ModPSO-CNN is also compared with other training algorithms focusing on the analysis of computational cost, convergence and accuracy based on a standard problem specific to classification applications, such as CIFAR-10 dataset and face and skin detection dataset.
引用
收藏
页码:2165 / 2176
页数:11
相关论文
共 70 条
  • [1] Bulan O(2017)Segmentation-and annotation-free license plate recognition with deep localization and failure identification IEEE Trans Intell Transp Syst 18 2351-2363
  • [2] Kozitsky V(2015)Pcanet: a simple deep learning baseline for image classification? IEEE Trans Image Process 24 5017-5032
  • [3] Ramesh P(2015)Robust face recognition via multimodal deep face representation IEEE Trans Multimedia 17 2049-2058
  • [4] Shreve M(2017)Human identification from freestyle walks using posture-based gait feature IEEE Trans Inform Forensics Secur 13 119-128
  • [5] Chan T-H(1998)Gradient-based learning applied to document recognition Proc IEEE 86 2278-2324
  • [6] Jia K(2010)Convolutional networks and applications in vision ISCAS 2010 253-256
  • [7] Gao S(2013)An experimental investigation of two wavelet-mlp hybrid frameworks for wind speed prediction using ga and pso optimization Int J Electric Power Energy Syst 52 161-173
  • [8] Lu J(2017)Stacking pcanet+: an overly simplified convnets baseline for face recognition IEEE Signal Process Lett 24 1581-1585
  • [9] Zeng Z(2013)Particle swarm optimization of mlp for the identification of factors related to common mental disorders Expert Syst Appl 40 4648-4652
  • [10] Ma Y(2007)A pyramidal neural network for visual pattern recognition IEEE Trans Neural Networks 18 329-343