Bandit-NAS: Bandit sampling and training method for Neural Architecture Search

被引:8
作者
Lin, Yiqi [1 ]
Endo, Yuki [1 ]
Lee, Jinho [1 ]
Kamijo, Shunsuke [1 ]
机构
[1] Univ Tokyo, 3-8-1 Komaba, Tokyo, Japan
关键词
Neural Architecture Search; Reinforcement learning; Bandit algorithm;
D O I
10.1016/j.neucom.2024.127684
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Existing Neural Architecture Search algorithms achieve a low error rate in vision tasks, such as image classification, by training child networks with equal resources during the search. However, it is unnecessary to allocate equal resources or fully converge scores to assess which child architectures should be adopted, resulting in computational redundancy. In this study, we present Bandit-NAS, an approach that automatically computes data slicing and training time for each child network. Firstly, we formulate the search for the optimal training time for a given resource as an M -armed bandit problem. Secondly, we extend the original NAS methods by proposing an end -to -end bandit algorithm, combined with reinforcement learning -based NAS algorithms, to determine an update strategy. Bandit-NAS enables simultaneous training of M child networks within a specified resource constraint (one epoch training time), with the allocation of training data based on the current accuracy of the child networks, thereby minimizing their error rate. Experimental results on 3 different datasets, MNIST , CIFAR-10 and CIFAR-100 demonstrate the superiority of Bandit-NAS over baseline NAS algorithms, such as ENAS and DQNAS, achieving lower error rates and faster search time.
引用
收藏
页数:13
相关论文
共 42 条
[1]  
Ba J, 2014, ACS SYM SER
[2]  
Baker B, 2017, Arxiv, DOI arXiv:1611.02167
[3]  
Brock A, 2017, Arxiv, DOI [arXiv:1708.05344, DOI 10.48550/ARXIV.1708.05344]
[4]  
Chauhan A., 2023, arXiv, DOI DOI 10.48550/ARXIV.2301.06687
[5]  
Chen X, 2020, Arxiv, DOI arXiv:1912.10952
[6]  
Deng BY, 2017, Arxiv, DOI arXiv:1712.03351
[7]  
DeVries T, 2017, Arxiv, DOI arXiv:1708.04552
[8]   One-Shot Neural Architecture Search via Self-Evaluated Template Network [J].
Dong, Xuanyi ;
Yang, Yi .
2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, :3680-3689
[9]  
Dudziak Lukasz, 2020, Advances in Neural Information Processing Systems, V33, P10480
[10]  
Gastaldi X., 2017, arXiv