Can weight sharing outperform random architecture search? An investigation with TuNAS

被引:72
作者
Bender, Gabriel [1 ]
Liu, Hanxiao [1 ]
Chen, Bo [1 ]
Chu, Grace [1 ]
Cheng, Shuyang [2 ]
Kindermans, Pieter-Jan [1 ]
Le, Quoc [1 ]
机构
[1] Google Res, Mountain View, CA 94043 USA
[2] Waymo, Mountain View, CA USA
来源
2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2020) | 2020年
关键词
D O I
10.1109/CVPR42600.2020.01433
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Efficient Neural Architecture Search methods based on weight sharing have shown good promise in democratizing Neural Architecture Search for computer vision models. There is, however, an ongoing debate whether these efficient methods are significantly better than random search. Here we perform a thorough comparison between efficient and random search methods on a family of progressively larger and more challenging search spaces for image classification and detection on ImageNet and COCO. While the efficacies of both methods are problem-dependent, our experiments demonstrate that there are large, realistic tasks where efficient search methods can provide substantial gains over random search. In addition, we propose and evaluate techniques which improve the quality of searched architectures and reduce the need for manual hyper-parameter tuning.
引用
收藏
页码:14311 / 14320
页数:10
相关论文
共 45 条
[31]  
Real E, 2019, AAAI CONF ARTIF INTE, P4780
[32]   MobileNetV2: Inverted Residuals and Linear Bottlenecks [J].
Sandler, Mark ;
Howard, Andrew ;
Zhu, Menglong ;
Zhmoginov, Andrey ;
Chen, Liang-Chieh .
2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, :4510-4520
[33]  
Simonyan K, 2015, Arxiv, DOI arXiv:1409.1556
[34]  
Stamoulis Dimitrios, 2019, ARXIV
[35]  
Tan MX, 2019, PROC CVPR IEEE, P2815, DOI [arXiv:1807.11626, 10.1109/CVPR.2019.00293]
[36]  
WILLIAMS RJ, 1992, MACH LEARN, V8, P229, DOI 10.1007/BF00992696
[37]   FBNet: Hardware-Aware Efficient ConvNet Design via Differentiable Neural Architecture Search [J].
Wu, Bichen ;
Dai, Xiaoliang ;
Zhang, Peizhao ;
Wang, Yanghan ;
Sun, Fei ;
Wu, Yiming ;
Tian, Yuandong ;
Vajda, Peter ;
Jia, Yangqing ;
Keutzer, Kurt .
2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, :10726-10734
[38]  
Xie S., 2019, INT C LEARN REPR
[39]  
Xie Saining, 2019, arXiv
[40]   NetAdapt: Platform-Aware Neural Network Adaptation for Mobile Applications [J].
Yang, Tien-Ju ;
Howard, Andrew ;
Chen, Bo ;
Zhang, Xiao ;
Go, Alec ;
Sandler, Mark ;
Sze, Vivienne ;
Adam, Hartwig .
COMPUTER VISION - ECCV 2018, PT X, 2018, 11214 :289-304