Can weight sharing outperform random architecture search? An investigation with TuNAS

被引:72
作者
Bender, Gabriel [1 ]
Liu, Hanxiao [1 ]
Chen, Bo [1 ]
Chu, Grace [1 ]
Cheng, Shuyang [2 ]
Kindermans, Pieter-Jan [1 ]
Le, Quoc [1 ]
机构
[1] Google Res, Mountain View, CA 94043 USA
[2] Waymo, Mountain View, CA USA
来源
2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2020) | 2020年
关键词
D O I
10.1109/CVPR42600.2020.01433
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Efficient Neural Architecture Search methods based on weight sharing have shown good promise in democratizing Neural Architecture Search for computer vision models. There is, however, an ongoing debate whether these efficient methods are significantly better than random search. Here we perform a thorough comparison between efficient and random search methods on a family of progressively larger and more challenging search spaces for image classification and detection on ImageNet and COCO. While the efficacies of both methods are problem-dependent, our experiments demonstrate that there are large, realistic tasks where efficient search methods can provide substantial gains over random search. In addition, we propose and evaluate techniques which improve the quality of searched architectures and reduce the need for manual hyper-parameter tuning.
引用
收藏
页码:14311 / 14320
页数:10
相关论文
共 45 条
[1]  
Baker B., 2017, INT C LEARN REPR, P1
[2]  
Baker B, 2017, ARXIV
[3]  
Bender G, 2018, PR MACH LEARN RES, V80
[4]  
Brock A., 2018, INT C LEARN REPR, P1, DOI DOI 10.1145/3205873.3205877
[5]  
Cai Han, 2018, ARXIV
[6]  
Chen Yukang, 2019, Adv. Neural Inform. Process. Syst. (NeurIPS)
[7]  
Cui Jiequan, 2019, INT C COMP VIS
[8]   ChamNet: Towards Efficient Network Design through Platform-Aware Model Adaptation [J].
Dai, Xiaoliang ;
Zhang, Peizhao ;
Wu, Bichen ;
Yin, Hongxu ;
Sun, Fei ;
Wang, Yanghan ;
Dukhan, Marat ;
Hu, Yunqing ;
Wu, Yiming ;
Jia, Yangqing ;
Vajda, Peter ;
Uyttendaele, Matt ;
Jha, Niraj K. .
2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, :11390-11399
[9]  
Fedorov Igor, 2019, ARXIV
[10]   NAS-FPN: Learning Scalable Feature Pyramid Architecture for Object Detection [J].
Ghiasi, Golnaz ;
Lin, Tsung-Yi ;
Le, Quoc V. .
2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, :7029-7038