NDARTS: A Differentiable Architecture Search Based on the Neumann Series

被引:0
作者
Han, Xiaoyu [1 ]
Li, Chenyu [1 ]
Wang, Zifan [1 ]
Liu, Guohua [1 ]
机构
[1] Southeast Univ, Sch Math, Nanjing 211189, Peoples R China
关键词
neural network; neural architecture search; DARTS; Neumann series;
D O I
10.3390/a16120536
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Neural architecture search (NAS) has shown great potential in discovering powerful and flexible network models, becoming an important branch of automatic machine learning (AutoML). Although search methods based on reinforcement learning and evolutionary algorithms can find high-performance architectures, these search methods typically require hundreds of GPU days. Unlike searching in a discrete search space based on reinforcement learning and evolutionary algorithms, the differentiable neural architecture search (DARTS) continuously relaxes the search space, allowing for optimization using gradient-based methods. Based on DARTS, we propose NDARTS in this article. The new algorithm uses the Implicit Function Theorem and the Neumann series to approximate the hyper-gradient, which obtains better results than DARTS. In the simulation experiment, an ablation experiment was carried out to study the influence of the different parameters on the NDARTS algorithm and to determine the optimal weight, then the best performance of the NDARTS algorithm was searched for in the DARTS search space and the NAS-BENCH-201 search space. Compared with other NAS algorithms, the results showed that NDARTS achieved excellent results on the CIFAR-10, CIFAR-100, and ImageNet datasets, and was an effective neural architecture search algorithm.
引用
收藏
页数:25
相关论文
共 42 条
[1]  
Baker B, 2017, Arxiv, DOI arXiv:1611.02167
[2]  
Cai H, 2017, Arxiv, DOI [arXiv:1707.04873, 10.48550/arXiv.1707.04873]
[3]  
Cai H, 2019, Arxiv, DOI arXiv:1812.00332
[4]  
Chen XN, 2021, Arxiv, DOI arXiv:2002.05283
[5]  
Chen X, 2019, Arxiv, DOI arXiv:1904.12760
[6]  
Chu XX, 2020, Arxiv, DOI [arXiv:1911.12126, DOI 10.1007/978-3-030-58555-628]
[7]  
Deng J, 2009, PROC CVPR IEEE, P248, DOI 10.1109/CVPRW.2009.5206848
[8]  
Dong XY, 2020, Arxiv, DOI arXiv:2001.00326
[9]   One-Shot Neural Architecture Search via Self-Evaluated Template Network [J].
Dong, Xuanyi ;
Yang, Yi .
2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, :3680-3689
[10]   Searching for A Robust Neural Architecture in Four GPU Hours [J].
Dong, Xuanyi ;
Yang, Yi .
2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, :1761-1770