Optimizing Deep Learning Models for Object Detection

被引:0
作者
Barburescu, Calin-George [1 ]
Iuhasz, Gabriel [1 ]
机构
[1] West Univ Timisoara, Timisoara, Romania
来源
2020 22ND INTERNATIONAL SYMPOSIUM ON SYMBOLIC AND NUMERIC ALGORITHMS FOR SCIENTIFIC COMPUTING (SYNASC 2020) | 2020年
关键词
object detection; yolov3; hfp8; hyperparameter tuning; experimental;
D O I
10.1109/SYNASC51798.2020.00051
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Deep learning models for object detection have gotten larger and larger over the years, spanning from 3.9M trainable parameters for EfficientDet to 209M for the AmoebaNet-based NAS-FPN detector. Different strategies are currently being researched in order to improve the efficiency of deep learning models for object detection, one of which is running the training and inference of the neural network in low precision. Interesting results have been achieved by researchers, starting from the original paradigm of using operators and doing the necessary operations in IEEE single precision (FP32), to achieving similar accuracies of the models using custom minifloat formats (FP8). The results can be pushed even further by using genetic algorithms for hyperparameter tuning, in order to find specific hyperparameter for the FP8 version of the model. In this paper, we will present the results of our experiments utilizing YOLOv3 with hybrid floating-point format (HFP8). One of the experiments in this paper shows how our solution can be used for checking social distancing guidelines which is a very important topic in the current COVID-19 pandemic.
引用
收藏
页码:270 / 277
页数:8
相关论文
共 21 条
[1]  
[Anonymous], 2010, International journal of computer vision, DOI DOI 10.1007/s11263-009-0275-4
[2]  
[Anonymous], 2017, CVPR 2017, DOI DOI 10.1142/9789812771728_0012
[3]  
Darken C., 2013, P 30 INT C MACH LEAR
[4]   Fast R-CNN [J].
Girshick, Ross .
2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2015, :1440-1448
[5]   Rich feature hierarchies for accurate object detection and semantic segmentation [J].
Girshick, Ross ;
Donahue, Jeff ;
Darrell, Trevor ;
Malik, Jitendra .
2014 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2014, :580-587
[6]   Deep Residual Learning for Image Recognition [J].
He, Kaiming ;
Zhang, Xiangyu ;
Ren, Shaoqing ;
Sun, Jian .
2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, :770-778
[7]   Microsoft COCO: Common Objects in Context [J].
Lin, Tsung-Yi ;
Maire, Michael ;
Belongie, Serge ;
Hays, James ;
Perona, Pietro ;
Ramanan, Deva ;
Dollar, Piotr ;
Zitnick, C. Lawrence .
COMPUTER VISION - ECCV 2014, PT V, 2014, 8693 :740-755
[8]   Feature Pyramid Networks for Object Detection [J].
Lin, Tsung-Yi ;
Dollar, Piotr ;
Girshick, Ross ;
He, Kaiming ;
Hariharan, Bharath ;
Belongie, Serge .
30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, :936-944
[9]   SSD: Single Shot MultiBox Detector [J].
Liu, Wei ;
Anguelov, Dragomir ;
Erhan, Dumitru ;
Szegedy, Christian ;
Reed, Scott ;
Fu, Cheng-Yang ;
Berg, Alexander C. .
COMPUTER VISION - ECCV 2016, PT I, 2016, 9905 :21-37
[10]  
Long Jonathan., P IEEE C COMPUTER VI