Neural Architecture Search Based on a Multi-Objective Evolutionary Algorithm With Probability Stack

被引:55
作者
Xue, Yu [1 ]
Chen, Chen [1 ]
Slowik, Adam [2 ]
机构
[1] Nanjing Univ Informat Sci & Technol, Sch Software, Nanjing 210044, Peoples R China
[2] Koszalin Univ Technol, Dept Elect & Comp Sci, Koszalin PL-75453, Poland
基金
中国国家自然科学基金;
关键词
Deep learning; evolutionary computation; multiobjective optimization; neural architecture search (NAS);
D O I
10.1109/TEVC.2023.3252612
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
With the emergence of deep neural networks, many research fields, such as image classification, object detection, speech recognition, natural language processing, machine translation, and automatic driving, have made major breakthroughs in technology and the research achievements have been successfully applied in many real-life applications. Combining evolutionary computation and neural architecture search (NAS) is an important approach to improve the performance of deep neural networks. Usually, the related researchers only focus on precision. Thus, the searched neural architectures always perform poorly in the other indexes such as time cost. In this article, a multi-objective evolutionary algorithm with a probability stack (MOEA-PS) is proposed for NAS, which considers the two objects of precision and time consumption. MOEA-PS uses an adjacency list to represent the internal structure of deep neural networks. Besides, a unique mechanism is introduced into the multi-objective genetic algorithm to guide the process of crossover and mutation when generating offspring. Furthermore, the structure blocks are stacked using a proxy model to generate deep neural networks. The results of the experiments on Cifar-10 and Cifar-100 demonstrate that the proposed algorithm has a similar error rate compared with the most advanced NAS algorithms, but the time cost is lower. Finally, the network structure searched on Cifar-10 is transferred directly to the ImageNet dataset, which can achieve 73.6% classification accuracy.
引用
收藏
页码:778 / 786
页数:9
相关论文
共 51 条
[31]   Surrogate-Assisted Evolutionary Deep Learning Using an End-to-End Random Forest-Based Performance Predictor [J].
Sun, Yanan ;
Wang, Handing ;
Xue, Bing ;
Jin, Yaochu ;
Yen, Gary G. ;
Zhang, Mengjie .
IEEE TRANSACTIONS ON EVOLUTIONARY COMPUTATION, 2020, 24 (02) :350-364
[32]   Evolving Deep Convolutional Neural Networks for Image Classification [J].
Sun, Yanan ;
Xue, Bing ;
Zhang, Mengjie ;
Yen, Gary G. .
IEEE TRANSACTIONS ON EVOLUTIONARY COMPUTATION, 2020, 24 (02) :394-407
[33]   Completely Automated CNN Architecture Design Based on Blocks [J].
Sun, Yanan ;
Xue, Bing ;
Zhang, Mengjie ;
Yen, Gary G. .
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2020, 31 (04) :1242-1254
[34]   A Particle Swarm Optimization-Based Flexible Convolutional Autoencoder for Image Classification [J].
Sun, Yanan ;
Xue, Bing ;
Zhang, Mengjie ;
Yen, Gary G. .
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2019, 30 (08) :2295-2309
[35]  
Szegedy C, 2015, PROC CVPR IEEE, P1, DOI 10.1109/CVPR.2015.7298594
[36]  
Tan MX, 2019, PROC CVPR IEEE, P2815, DOI [arXiv:1807.11626, 10.1109/CVPR.2019.00293]
[37]  
Wei W., 2018, PROC INT C LEARN REP, P1
[38]   Genetic CNN [J].
Xie, Lingxi ;
Yuille, Alan .
2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, :1388-1397
[39]  
Xie SR, 2020, Arxiv, DOI arXiv:1812.09926
[40]  
Xu Yilun, 2019, IEEE ICC, DOI DOI 10.1109/icc.2019.8761264