CARS: Continuous Evolution for Efficient Neural Architecture Search

被引:197
作者
Yang, Zhaohui [1 ,2 ]
Wang, Yunhe [2 ]
Chen, Xinghao [2 ]
Shi, Boxin [3 ,4 ]
Xu, Chao [1 ]
Xu, Chunjing [2 ]
Tian, Qi [2 ]
Xu, Chang [5 ]
机构
[1] Peking Univ, Dept Machine Intelligence, Key Lab Machine Percept MOE, Beijing, Peoples R China
[2] Huawei Technol, Noahs Ark Lab, Shenzhen, Peoples R China
[3] Peking Univ, NELVT, Dept CS, Beijing, Peoples R China
[4] Peng Cheng Lab, Shenzhen, Peoples R China
[5] Univ Sydney, Fac Engn, Sch Comp Sci, Sydney, NSW, Australia
来源
2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR) | 2020年
基金
澳大利亚研究理事会; 中国国家自然科学基金;
关键词
D O I
10.1109/CVPR42600.2020.00190
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Searching techniques in most of existing neural architecture search (NAS) algorithms are mainly dominated by differentiable methods for the efficiency reason. In contrast, we develop an efficient continuous evolutionary approach for searching neural networks. Architectures in the population that share parameters within one SuperNet in the latest generation will be tuned over the training dataset with a few epochs. The searching in the next evolution generation will directly inherit both the SuperNet and the population, which accelerates the optimal network generation. The non-dominated sorting strategy is further applied to preserve only results on the Pareto front for accurately updating the SuperNet. Several neural networks with different model sizes and performances will be produced after the continuous search with only 0.4 GPU days. As a result, our framework provides a series of networks with the number of parameters ranging from 3.7M to 5.1M under mobile settings. These networks surpass those produced by the stateof-the-art methods on the benchmark ImageNet dataset.
引用
收藏
页码:1826 / 1835
页数:10
相关论文
共 61 条
[1]  
[Anonymous], 2019, PROBABILISTIC NEURAL
[2]  
[Anonymous], 2017, ICLR
[3]  
[Anonymous], 2018, ICLR
[4]  
Bender G, 2018, PR MACH LEARN RES, V80
[5]  
Cai H., 2019, INT C LEARNING REPRE
[6]   Data-Free Learning of Student Networks [J].
Chen, Hanting ;
Wang, Yunhe ;
Xu, Chang ;
Yang, Zhaohui ;
Liu, Chuanjian ;
Shi, Boxin ;
Xu, Chunjing ;
Xu, Chao ;
Tian, Qi .
2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, :3513-3521
[7]   An Evolutionary Many-Objective Optimization Algorithm Using Reference-Point-Based Nondominated Sorting Approach, Part I: Solving Problems With Box Constraints [J].
Deb, Kalyanmoy ;
Jain, Himanshu .
IEEE TRANSACTIONS ON EVOLUTIONARY COMPUTATION, 2014, 18 (04) :577-601
[8]  
Dong Jin-Dong, 2018, ECCV
[9]   Searching for A Robust Neural Architecture in Four GPU Hours [J].
Dong, Xuanyi ;
Yang, Yi .
2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, :1761-1770
[10]   Fast R-CNN [J].
Girshick, Ross .
2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2015, :1440-1448