Sparse Gate for Differentiable Architecture Search

被引:1
作者
Fan, Liang [1 ]
Wang, Handing [1 ]
机构
[1] Xidian Univ, Sch Artificial Intelligence, Xian, Peoples R China
来源
2023 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, IJCNN | 2023年
基金
中国国家自然科学基金;
关键词
Differentiable architecture search; gating network; Routing strategy; discretization errors;
D O I
10.1109/IJCNN54540.2023.10191908
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Differentiable architecture search is now one of mainstream methods to design the structure of neural networks. It makes the neural architecture search efficient through parameter sharing and differentiable search. However, there are still many significant challenges for designing the optimal architecture, including the phenomenon of skip connection aggregation, excessive memory usage, and large discretization errors. To address these issues, we propose a novel approach called Gate-DARTS where a sparse gating network is introduced to route each input sample to the best k operators. This can reduce the memory requirements of the algorithm and break the residual structure. For the issue of discretization error, we then propose a auxiliary loss to enlarge the difference between different operators. We conduct comprehensive experiments on DARTS-like search space, and Gate-DARTS achieves 97.45% test accuracy on CIFAR10 with 0.23 GPU-days, 83.82% on CIFAR100 with 0.28 GPU-days. Our code has been made available at https://github.com/HandingWangXDGroup/Gate-DARTS.
引用
收藏
页数:8
相关论文
共 37 条
  • [1] [Anonymous], 2022, COMPLEX INTELLIGENT, DOI DOI 10.1007/S12144-022-03233-5
  • [2] Bender G, 2018, PR MACH LEARN RES, V80
  • [3] Bi K., 2020, ARXIV200703331
  • [4] Brock A., 2018, P INT C LEARN REPR
  • [5] Cai H., 2018, Proxylessnas: direct neural architecture search on target task and hardware
  • [6] AutoFormer: Searching Transformers for Visual Recognition
    Chen, Minghao
    Peng, Houwen
    Fu, Jianlong
    Ling, Haibin
    [J]. 2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 12250 - 12260
  • [7] Chen X., 2020, PMLR, P1554
  • [8] Progressive Differentiable Architecture Search: Bridging the Depth Gap between Search and Evaluation
    Chen, Xin
    Xie, Lingxi
    Wu, Jun
    Tian, Qi
    [J]. 2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 1294 - 1303
  • [9] Chu X., 2020, arXiv preprint arXiv:2009.01027, P1
  • [10] Chu X., 2020, EUR C COMPUT VIS, P465, DOI 10.1007/978-3-030-58555-6_28