Searching the Deployable Convolution Neural Networks for GPUs

被引:2
作者
Wang, Linnan [1 ]
Yu, Chenhan [1 ]
Salian, Satish [1 ]
Kierat, Slawomir [1 ]
Migacz, Szymon [1 ]
Florea, Alex Fit [1 ]
机构
[1] NVIDIA, Santa Clara, CA 95051 USA
来源
2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR) | 2022年
关键词
D O I
10.1109/CVPR52688.2022.01191
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Customizing Convolution Neural Networks (CNN) for production use has been a challenging task for DL practitioners. This paper intends to expedite the model customization with a model hub that contains the optimized models tiered by their inference latency using Neural Architecture Search (NAS). To achieve this goal, we build a distributed NAS system to search on a novel search space that consists of prominent factors to impact latency and accuracy. Since we target GPU, we name the NAS optimized models as GPUNet, which establishes a new SOTA Pareto frontier in inference latency and accuracy. Within 1ms, GPUNet is 2x faster than EfficientNet-X and FBNetV3 with even better accuracy. We also validate GPUNet on detection tasks, and GPUNet consistently outperforms EfficientNet-X and FBNetV3 on COCO detection tasks in both latency and accuracy. All of these data validate that our NAS system is effective and generic to handle different design tasks. With this NAS system, we expand GPUNet to cover a wide range of latency targets such that DL practitioners can deploy our models directly in different scenarios.
引用
收藏
页码:12217 / 12226
页数:10
相关论文
共 46 条
[1]  
[Anonymous], 1961, The, DOI DOI 10.1007/S001220051440
[2]  
[Anonymous], SOBOL SEQUENCE IMPLE
[3]  
[Anonymous], Pytorch image models (timm)
[4]  
[Anonymous], 2019, ARXIV190311059
[5]  
[Anonymous], NVIDIAS TENSORRT
[6]  
[Anonymous], BLACK BOX OPTIMIZATI
[7]  
[Anonymous], INTELS OPENVINO
[8]  
Araujo A., 2019, DISTILL, DOI [10.23915/distill.00021, DOI 10.23915/DISTILL.00021, 10.23915/Distill.00021]
[9]  
Bao H., 2021, PROC INT C LEARN REP
[10]   FBNetV3: Joint Architecture-Recipe Search using Predictor Pretraining [J].
Dai, Xiaoliang ;
Wan, Alvin ;
Zhang, Peizhao ;
Wu, Bichen ;
He, Zijian ;
Wei, Zhen ;
Chen, Kan ;
Tian, Yuandong ;
Yu, Matthew ;
Vajda, Peter ;
Gonzalez, Joseph E. .
2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, :16271-16280