Bandit-NAS: Bandit Sampling Method for Neural Architecture Search

被引:0
作者
Lin, Yiqi [1 ]
Wang, Ru [1 ]
机构
[1] Univ Tokyo, Tokyo, Japan
来源
2023 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, IJCNN | 2023年
关键词
Neural Architecture Search; Reinforcement Learning; Bandit Algorithm;
D O I
10.1109/IJCNN54540.2023.10191003
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Existing NAS (Neural Architecture Search) algorithms achieve a low error rate on vision tasks such as image classification by training each child network with equal resources during the search. However, it is not necessary to train with the equal resource or use the fully converge score to obtain the relative performance of each child network, and there is computational redundancy in training all child networks with the equal resource. In this paper, we propose Bandit-NAS to automatically compute the required data slicing and training time for each child network. i): We first model the search of the best child network training time for a given resource into an M-armed bandit problem. ii): Then we propose a reward-flexible bandit algorithm in conjunction with existing reinforcement learning-based NAS algorithms to determine an update strategy. The proposed Bandit-NAS can train M child networks simultaneously under a given resource constraint (training time for one epoch), and the amount of training data is allocated according to the current accuracy of the child networks, thus minimizing the error rate of the child networks. Experiments on CIFAR-10 show that proposed Bandit-NAS performs better the baseline NAS algorithm, e.g., ENAS, with lower error rate and faster searching time.
引用
收藏
页数:8
相关论文
共 38 条
  • [1] Brock A, 2017, Arxiv, DOI arXiv:1708.05344
  • [2] Deng BY, 2017, Arxiv, DOI arXiv:1712.03351
  • [3] DeVries T, 2017, Arxiv, DOI arXiv:1708.04552
  • [4] One-Shot Neural Architecture Search via Self-Evaluated Template Network
    Dong, Xuanyi
    Yang, Yi
    [J]. 2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 3680 - 3689
  • [5] Dudziak L., 2020, Advances in Neural Information Processing Systems, V33, P10480
  • [6] Gastaldi X., 2017, arXiv
  • [7] Graves A, 2012, STUD COMPUT INTELL, V385, P1, DOI [10.1162/neco.1997.9.1.1, 10.1007/978-3-642-24797-2]
  • [8] Hanlin Chen, 2020, Computer Vision - ECCV 2020. 16th European Conference. Proceedings. Lecture Notes in Computer Science (LNCS 12358), P70, DOI 10.1007/978-3-030-58601-0_5
  • [9] Hataya R, 2020, EUROPEAN C COMPUTER, P1
  • [10] Densely Connected Convolutional Networks
    Huang, Gao
    Liu, Zhuang
    van der Maaten, Laurens
    Weinberger, Kilian Q.
    [J]. 30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, : 2261 - 2269