Multi-objective simulated annealing for hyper-parameter optimization in convolutional neural networks

被引:10
|
作者
Gulcu, Ayla [1 ]
Kus, Zeki [1 ]
机构
[1] Fatih Sultan Mehmet Univ, Comp Sci, Istanbul, Turkey
关键词
Multi-objective; Simulated annealing; Convolutional neural networks; Hyper-parameter optimization; EVOLUTIONARY ALGORITHMS; RANDOM SEARCH;
D O I
10.7717/peerj-cs.338
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this study, we model a CNN hyper-parameter optimization problem as a bi-criteria optimization problem, where the first objective being the classification accuracy and the second objective being the computational complexity which is measured in terms of the number of floating point operations. For this bi-criteria optimization problem, we develop a Multi-Objective Simulated Annealing (MOSA) algorithm for obtaining high-quality solutions in terms of both objectives. CIFAR-10 is selected as the benchmark dataset, and the MOSA trade-off fronts obtained for this dataset are compared to the fronts generated by a single-objective Simulated Annealing (SA) algorithm with respect to several front evaluation metrics such as generational distance, spacing and spread. The comparison results suggest that the MOSA algorithm is able to search the objective space more effectively than the SA method. For each of these methods, some front solutions are selected for longer training in order to see their actual performance on the original test set. Again, the results state that the MOSA performs better than the SA under multi-objective setting. The performance of the MOSA configurations are also compared to other search generated and human designed state-of-the-art architectures. It is shown that the network configurations generated by the MOSA are not dominated by those architectures, and the proposed method can be of great use when the computational complexity is as important as the test accuracy.
引用
收藏
页码:2 / 27
页数:27
相关论文
共 50 条
  • [1] Neural Networks Designing Neural Networks: Multi-Objective Hyper-Parameter Optimization
    Smithson, Sean C.
    Yang, Guang
    Gross, Warren J.
    Meyer, Brett H.
    2016 IEEE/ACM INTERNATIONAL CONFERENCE ON COMPUTER-AIDED DESIGN (ICCAD), 2016,
  • [2] USING METAHEURISTICS FOR HYPER-PARAMETER OPTIMIZATION OF CONVOLUTIONAL NEURAL NETWORKS
    Bibaeva, Victoria
    2018 IEEE 28TH INTERNATIONAL WORKSHOP ON MACHINE LEARNING FOR SIGNAL PROCESSING (MLSP), 2018,
  • [3] Learning networks hyper-parameter using multi-objective optimization of statistical performance metrics
    Torres, Guillermo
    Sanchez, Carles
    Gil, Debora
    2022 24TH INTERNATIONAL SYMPOSIUM ON SYMBOLIC AND NUMERIC ALGORITHMS FOR SCIENTIFIC COMPUTING, SYNASC, 2022, : 233 - 238
  • [4] Hyper-Parameter Selection in Convolutional Neural Networks Using Microcanonical Optimization Algorithm
    Gulcu, Ayla
    Kus, Zeki
    IEEE ACCESS, 2020, 8 : 52528 - 52540
  • [5] HYPER-PARAMETER OPTIMIZATION OF DEEP CONVOLUTIONAL NETWORKS FOR OBJECT RECOGNITION
    Talathi, Sachin S.
    2015 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2015, : 3982 - 3986
  • [6] Application of a Stochastic Schemata Exploiter for Multi-Objective Hyper-parameter Optimization of Machine Learning
    Makino, Hiroya
    Kita, Eisuke
    REVIEW OF SOCIONETWORK STRATEGIES, 2023, 17 (02): : 179 - 213
  • [7] Application of a Stochastic Schemata Exploiter for Multi-Objective Hyper-parameter Optimization of Machine Learning
    Hiroya Makino
    Eisuke Kita
    The Review of Socionetwork Strategies, 2023, 17 : 179 - 213
  • [8] Probabilistic Sequential Multi-Objective Optimization of Convolutional Neural Networks
    Yin, Zixuan
    Gross, Warren
    Meyer, Brett H.
    PROCEEDINGS OF THE 2020 DESIGN, AUTOMATION & TEST IN EUROPE CONFERENCE & EXHIBITION (DATE 2020), 2020, : 1055 - 1060
  • [9] HYPER-PARAMETER OPTIMIZATION FOR CONVOLUTIONAL NEURAL NETWORK COMMITTEES BASED ON EVOLUTIONARY ALGORITHMS
    Bochinski, Erik
    Senst, Tobias
    Sikora, Thomas
    2017 24TH IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2017, : 3924 - 3928
  • [10] Annealing of Monel 400 Alloy Using Principal Component Analysis, Hyper-parameter Optimization, Machine Learning Techniques, and Multi-objective Particle Swarm Optimization
    Chintakindi, Sanjay
    Alsamhan, Ali
    Abidi, Mustufa Haider
    Kumar, Maduri Praveen
    INTERNATIONAL JOURNAL OF COMPUTATIONAL INTELLIGENCE SYSTEMS, 2022, 15 (01)