Evolutionary Neural Architecture Search with Predictor of Ranking-Based Score

被引:0
作者
Jiang, Peng-Cheng [1 ]
Xue, Yu [1 ]
机构
[1] School of Software, Nanjing University of Information Science and Technology, Nanjing
来源
Jisuanji Xuebao/Chinese Journal of Computers | 2024年 / 47卷 / 11期
关键词
evolutionary computation; genetic algorithms; neural architecture search; ranking prediction; score prediction; surrogate models;
D O I
10.11897/SP.J1016.2024.02522
中图分类号
学科分类号
摘要
The exceptional performance of neural networks has been extensively validated across various practical applications, with architecture serving as the primary determinant of their efficacy. Presently, the state-of-the-art architectures necessitate manual design, heavily relying on expert experience and iterative trial-and-error methodologies for performance validation. In recent years, the emergence of Evolutionary Neural Architecture Search (ENAS) has alleviated the burden associated with manual design. However, despite the ability of ENAS methods to autonomously identify superior architectures, their widespread application remains impeded by the substantial time and computational resources required. Surrogate models can mitigate this excessive resource consumption to some extent. However, existing surrogate model-assisted evolutionary neural architecture searches fail to fully integrate the search and surrogate processes. Moreover, it is difficult for the current surrogate methods to accurately predict network architectural rankings with similar accuracies. Furthermore, existing surrogate models typically necessitate substantial amounts of architectural information as training data to attain satisfactory surrogate accuracy. These limitations hinder the effective assistance of surrogate models in ENAS. thereby constraining its advancement. In this paper, we propose a Rank Score Predictor-assisted Evolutionary Neural Architecture Search method (RSP-ENAS). By introducing a novel loss function specifically designed for rank score prediction, the Multi-Layer Perceptron (MLP) employed as a score predictor can optimally align the ranking of individual performance scores within the population with their actual performance order. During the search process utilizing this method, the predicted scores are directly applicable for elite selection. We introduce a two-stage search strategy in the search phase, initially focusing on accumulating historical information for the surrogate dataset from evaluating a small population and subsequently emphasizing the use of the surrogate model to predict fitness values for a larger population in the later stages. The experiments conducted in this study were performed on the EvoXBench platform, yielding superior results across all benchmark datasets. Additionally, we validated our approach on the ImageNet dataset. Compared to alternative methodologies, our approach successfully identifies the optimal architecture within the NASBench-101 space. On the three datasets within the NASBench-201 space, accuracy improvements of 0. 35%. 1. 12%, and 0. 55% were achieved relative to other optimal methods. In experiments utilizing real datasets on ImageNet, our method demonstrated a 2. 2% enhancement in classification accuracy. Moreover, with the same quantity of data, the ranking results generated by the proposed rank score prediction model exhibited a 1. 55% improvement in Kendall's Tau coefficient when compared to other optimal approaches. We further validated the effectiveness of One-hot encoding and the proposed rank loss within the surrogate model, demonstrating the efficacy of these two components for the overall algorithm. This research underscores the potential of advanced surrogate models to enhance the efficiency and accuracy of neural architecture search processes. By reducing computational costs and improving the precision of architecture rankings, our RSP-ENAS method could significantly advance the practical application and accessibility of neural network design, potentially catalyzing more rapid advancements in the fields of machine learning and artificial intelligence. Future work may explore the surrogate models for less training data, which could yield even more substantial enhancements in neural architecture search methodologies. © 2024 Science Press. All rights reserved.
引用
收藏
页码:2522 / 2535
页数:13
相关论文
共 74 条
[21]  
Huang J, Xue B, Sun Y, Zhang M., EDE-NAS: An eclectic differential evolution approach to single- path neural architecture search, Proceedings of the Australasian Joint Conference on Artificial Intelligence, 13728, pp. 116-130, (2022)
[22]  
Sun Y, Xue B, Zhang M, Yen G G., Completely automated CNN architecture design based on blocks, IEEE Transactions on Neural Networks and Learning Systems, 31, 4, pp. 1242-1254, (2020)
[23]  
Sun Y, Xue B, Zhang M, Et al., Automatically designing CNN architectures using the genetic algorithm for image classification, IEEE Transactions on Cybernetics, 50, 9, pp. 3840-3854, (2020)
[24]  
Chu X, Zhang B, Xu R., FairNAS
[25]  
Rethinking evaluation fairness of weight sharing neural architecture search, Proceedings of the IEEE/CVF International Conference on computer vision, pp. 12219-12228, (2021)
[26]  
Zhang H, Jin Y, Hao K., Evolutionary search for complete neural network architectures with partial weight sharing, IEEE Transactions on Evolutionary Computation, 26, 5, pp. 1072-1086, (2022)
[27]  
Lu Z, Deb K, ioodman E, Et al., NSGANetV2. Evolutionary multi-objective surrogate-assisted neural architecture search, Proceedings of the European Conference on Computer Vision, pp. 35-51, (2020)
[28]  
Huang J, Xue B, Sun Y, Et al., Split-level evolutionary neural architecture search with elite weight inheritance, IEEE Transactions on Neural Networks and Learning Systems, pp. 1-15, (2023)
[29]  
Sun Y, Wang H, Xue B, Et al., Surrogate-assisted evolutionary deep learning using an end-to-end random forest-based performance predictor, IEEE Transactions on Evolutionary Computation, 24, 2, pp. 350-364, (2020)
[30]  
Guo B, Chen T, He S, Et al., Generalized global ranking-aware neural architecture ranker for efficient image classifier search, Proceedings of the 30th ACM International Conference on Multimedia, pp. 3730-3741, (2022)