EPC-DARTS: Efficient partial channel connection for differentiable architecture search

被引:5
作者
Cai, Zicheng [1 ]
Chen, Lei [1 ]
Liu, Hai-Lin [1 ]
机构
[1] Guangdong Univ Technol, Sch Math & Stat, Guangzhou, Peoples R China
基金
中国国家自然科学基金;
关键词
Neural architecture search; Partial channel connection; Efficient channel attention;
D O I
10.1016/j.neunet.2023.07.029
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
With weight-sharing and continuous relaxation strategies, the differentiable architecture search (DARTS) proposes a fast and effective solution to perform neural network architecture search in various deep learning tasks. However, unresolved issues, such as the inefficient memory utilization, and the poor stability of the search architecture due to channels randomly selected, which has even caused performance collapses, are still perplexing researchers and practitioners. In this paper, a novel efficient channel attention mechanism based on partial channel connection for differentiable neural architecture search, termed EPC-DARTS, is proposed to address these two issues. Specifically, we design an efficient channel attention module, which is applied to capture cross-channel interactions and assign weight based on channel importance, to dramatically improve search efficiency and reduce memory occupation. Moreover, only partial channels with higher weights in the mixed calculation of operation are used through the efficient channel attention mechanism, and thus unstable network architectures obtained by the random selection operation can also be avoided in the proposed EPC-DARTS. Experimental results show that the proposed EPC-DARTS achieves remarkably competitive performance (CIFAR-10/CIFAR-100: a test accuracy rate of 97.60%/84.02%), compared to other state-of-the-art NAS methods using only 0.2 GPU-Days.& COPY; 2023 Elsevier Ltd. All rights reserved.
引用
收藏
页码:344 / 353
页数:10
相关论文
共 38 条
  • [1] Cai H, 2019, Arxiv, DOI arXiv:1812.00332
  • [2] Progressive Differentiable Architecture Search: Bridging the Depth Gap between Search and Evaluation
    Chen, Xin
    Xie, Lingxi
    Wu, Jun
    Tian, Qi
    [J]. 2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 1294 - 1303
  • [3] Chu XX, 2021, Arxiv, DOI arXiv:2009.01027
  • [4] Searching for A Robust Neural Architecture in Four GPU Hours
    Dong, Xuanyi
    Yang, Yi
    [J]. 2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 1761 - 1770
  • [5] LIVECell-A large-scale dataset for label-free live cell segmentation
    Edlund, Christoffer
    Jackson, Timothy R.
    Khalid, Nabeel
    Bevan, Nicola
    Dale, Timothy
    Dengel, Andreas
    Ahmed, Sheraz
    Trygg, Johan
    Sjoegren, Rickard
    [J]. NATURE METHODS, 2021, 18 (09) : 1038 - +
  • [6] Feurer M, 2015, ADV NEUR IN, V28
  • [7] Deep Residual Learning for Image Recognition
    He, Kaiming
    Zhang, Xiangyu
    Ren, Shaoqing
    Sun, Jian
    [J]. 2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, : 770 - 778
  • [8] Hu J, 2018, PROC CVPR IEEE, P7132, DOI [10.1109/CVPR.2018.00745, 10.1109/TPAMI.2019.2913372]
  • [9] Densely Connected Convolutional Networks
    Huang, Gao
    Liu, Zhuang
    van der Maaten, Laurens
    Weinberger, Kilian Q.
    [J]. 30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, : 2261 - 2269
  • [10] Jain J., 2021, arXiv