An Automatic Convolutional Neural Network Optimization Using a Diversity-Guided Genetic Algorithm

被引:9
作者
Fatyanosa, Tirana Noor [1 ]
Aritsugi, Masayoshi [2 ]
机构
[1] Kumamoto Univ, Grad Sch Sci & Technol, Kumamoto 8608555, Japan
[2] Kumamoto Univ, Fac Adv Sci & Technol, Kumamoto 8608555, Japan
关键词
Computer architecture; Statistics; Sociology; Genetic algorithms; Optimization; Neural networks; Convergence; Convolutional neural networks; genetic algorithms; hyperparameter optimization; text classification; CROSSOVER; MUTATION; IDENTIFICATION; CLASSIFICATION; PROBABILITIES;
D O I
10.1109/ACCESS.2021.3091729
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Hyperparameters and architecture greatly influence the performance of convolutional neural networks (CNNs); therefore, their optimization is important to obtain the desired results. One of the state-of-the-art methods to achieve this is the use of neuroevolution that utilizes a genetic algorithm (GA) to optimize a CNN. However, the GA is often trapped into a local optimum resulting in premature convergence. In this study, we propose an approach called the "diversity-guided genetic algorithm-convolutional neural network (DGGA-CNN)" that uses adaptive parameter control and random injection to facilitate the search process by exploration and exploitation while preserving the population diversity. The alternation between exploration and exploitation is guided by using an average pairwise Hamming distance. Moreover, the DGGA fully handles the architecture of the CNN by using a novel finite state machine (FSM) combined with three novel mutation mechanisms that are specifically created for architecture chromosomes. Tests conducted on suggestion mining and twitter airline datasets reveal that the DGGA-CNN performs well with valid architectures and a comparison with other methods demonstrates its capability and efficiency.
引用
收藏
页码:91410 / 91426
页数:17
相关论文
共 79 条
[61]  
Snoek J., 2012, Neural Information Processing Systems, P2951
[62]  
Spears W. M, 1998, THESIS DEPT COMP SCI
[63]   ADAPTIVE PROBABILITIES OF CROSSOVER AND MUTATION IN GENETIC ALGORITHMS [J].
SRINIVAS, M ;
PATNAIK, LM .
IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS, 1994, 24 (04) :656-667
[64]  
Srivastava N, 2014, J MACH LEARN RES, V15, P1929
[65]  
Sudarto S, 2019, J INFOR TECHNOL COMP, V4, P42
[66]   Evolution of Deep Convolutional Neural Networks Using Cartesian Genetic Programming [J].
Suganuma, Masanori ;
Kobayashi, Masayuki ;
Shirakawa, Shinichi ;
Nagao, Tomoharu .
EVOLUTIONARY COMPUTATION, 2020, 28 (01) :141-163
[67]   Automatically Designing CNN Architectures Using the Genetic Algorithm for Image Classification [J].
Sun, Yanan ;
Xue, Bing ;
Zhang, Mengjie ;
Yen, Gary G. ;
Lv, Jiancheng .
IEEE TRANSACTIONS ON CYBERNETICS, 2020, 50 (09) :3840-3854
[68]  
Ursem R. K., 2002, Parallel Problem Solving from Nature - PPSN VII. 7th International Conference. Proceedings (Lecture Notes in Computer Science Vol.2439), P462
[69]   Hyperparameter Importance Across Datasets [J].
van Rijn, Jan N. ;
Hutter, Frank .
KDD'18: PROCEEDINGS OF THE 24TH ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY & DATA MINING, 2018, :2367-2376
[70]   Fuzzy adaptation of crossover and mutation rates in genetic algorithms based on population performance [J].
Vannucci, Marco ;
Colla, Valentina .
JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2015, 28 (04) :1805-1818