MSR-DARTS: Minimum Stable Rank of Differentiable Architecture Search

被引:0
作者
Machida, Kengo [1 ]
Uto, Kuniaki [1 ]
Shinoda, Koichi [1 ]
Suzuki, Taiji [2 ,3 ]
机构
[1] Tokyo Inst Technol, Sch Comp, Tokyo, Japan
[2] Univ Tokyo, Grad Sch Informat Sci & Technol, Tokyo, Japan
[3] RIKEN, Ctr Adv Intelligence Project, Tokyo, Japan
来源
2022 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN) | 2022年
关键词
neural architecture search; stable rank; generalization;
D O I
10.1109/IJCNN55064.2022.9892751
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In neural architecture search (NAS), differentiable architecture search (DARTS) has recently attracted much attention due to its high efficiency. However, this method finds a model with the weights converging faster than the others, and such a model with fastest convergence often leads to overfitting. Accordingly, the resulting model cannot always be well-generalized. To overcome this problem, we propose a method called minimum stable rank DARTS (MSR-DARTS), for finding a model with the best generalization error by replacing architecture optimization with the selection process using the minimum stable rank criterion. Specifically, a convolution operator is represented by a matrix, and MSR-DARTS selects the one with the smallest stable rank. We evaluated MSR-DARTS on CIFAR-10 and ImageNet datasets. It achieves an error rate of 2.54% with 4.0M parameters within 0.3 GPU-days on CIFAR-10, and a top-1 error rate of 23.9% on ImageNet.
引用
收藏
页数:9
相关论文
共 50 条
  • [1] D-DARTS: Distributed Differentiable Architecture Search
    Heuillet, Alexandre
    Tabia, Hedi
    Arioui, Hichem
    Youcef-Toumi, Kamal
    PATTERN RECOGNITION LETTERS, 2023, 176 : 42 - 48
  • [2] D-DARTS: Distributed Differentiable Architecture Search
    Heuillet, Alexandre
    Tabia, Hedi
    Arioui, Hichem
    Youcef-Toumi, Kamal
    PATTERN RECOGNITION LETTERS, 2023, 176 : 42 - 48
  • [3] μDARTS: Model Uncertainty-Aware Differentiable Architecture Search
    Chakraborty, Biswadeep
    Mukhopadhyay, Saibal
    IEEE ACCESS, 2022, 10 : 98670 - 98682
  • [4] Att-DARTS: Differentiable Neural Architecture Search for Attention
    Nakai, Kohei
    Matsubara, Takashi
    Uehara, Kuniaki
    2020 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2020,
  • [5] STO-DARTS: Stochastic Bilevel Optimization for Differentiable Neural Architecture Search
    Cai, Zicheng
    Chen, Lei
    Ling, Tongtao
    Liu, Hai-Lin
    IEEE TRANSACTIONS ON EMERGING TOPICS IN COMPUTATIONAL INTELLIGENCE, 2024, 8 (03): : 2324 - 2335
  • [6] EPC-DARTS: Efficient partial channel connection for differentiable architecture search
    Cai, Zicheng
    Chen, Lei
    Liu, Hai-Lin
    NEURAL NETWORKS, 2023, 166 : 344 - 353
  • [7] LMD-DARTS: Low-Memory, Densely Connected, Differentiable Architecture Search
    Li, Zhongnian
    Xu, Yixin
    Ying, Peng
    Chen, Hu
    Sun, Renke
    Xu, Xinzheng
    ELECTRONICS, 2024, 13 (14)
  • [8] DARTS-PAP: Differentiable Neural Architecture Search by Polarization of Instance Complexity Weighted Architecture Parameters
    Li, Yunhong
    Li, Shuai
    Yu, Zhenhua
    MULTIMEDIA MODELING, MMM 2023, PT II, 2023, 13834 : 277 - 288
  • [9] FP-DARTS: Fast parallel differentiable neural architecture search for image classification
    Wang, Wenna
    Zhang, Xiuwei
    Cui, Hengfei
    Yin, Hanlin
    Zhang, Yannnig
    PATTERN RECOGNITION, 2023, 136
  • [10] MR-DARTS: Restricted connectivity differentiable architecture search in multi-path search space
    Gao, Feng
    Song, Bin
    Wang, Dan
    Qin, Hao
    NEUROCOMPUTING, 2022, 482 : 27 - 39