Yolov3-Pruning(transfer): real-time object detection algorithm based on transfer learning

被引:0
作者
Xiaoning Li
Zhengzhong Wang
Shichao Geng
Lin Wang
Huaxiang Zhang
Li Liu
Donghua Li
机构
[1] Shandong Normal University,School of Information Science and Engineering
[2] Shandong Normal University,School of Journalism and Communication
[3] Shandong Normal University,Institute of Data Science and Technology
来源
Journal of Real-Time Image Processing | 2022年 / 19卷
关键词
Object detection; Transfer learning; Pruning; Detection accuracy; Inference speed; Real-time processing;
D O I
暂无
中图分类号
学科分类号
摘要
In recent years, object detection algorithms have achieved great success in the field of machine vision. To pursue the detection accuracy of the model, the scale of the network is constantly increasing, which leads to the continuous increase in computational cost and a large requirement for memory. The larger network scale allows their execution to take a longer time, facing the balance between the detection accuracy and the speed of execution. Therefore, the developed algorithm is not suitable for real-time applications. To improve the detection performance of small targets, we propose a new method, the real-time object detection algorithm based on transfer learning. Based on the baseline Yolov3 model, pruning is done to reduce the scale of the model, and then migration learning is used to ensure the detection accuracy of the model. The object detection method using transfer learning achieves a good balance between detection accuracy and inference speed and is more conducive to the real-time processing of images. Through the evaluation of the dataset voc2007 + 2012, the experimental results show that the parameters of the Yolov3-Pruning(transfer): model are reduced by 3X compared with the baseline Yolov3 model, and the detection accuracy is improved, realizes real-time processing, and improves the detection accuracy.
引用
收藏
页码:839 / 852
页数:13
相关论文
共 96 条
[11]  
Al-Maadeed S(2018)Least squares kernel ensemble regression in reproducing kernel hilbert space Neurocomputing 311 208-5978
[12]  
Bouwmans T(2019)Deep learning-based remaining useful life estimation of bearings using multi-scale feature extraction Reliabil. Eng. Syst. Saf. 182 4490-4354
[13]  
Jung SK(2018)Object-location-aware hashing for multi-label image retrieval via automatic mask learning IEEE Trans. Image Process. 27 5966-281
[14]  
Millon M(2020)Graph convolutional networks for hyperspectral image classification IEEE Trans. Geosci. Remote Sens. 59 4340-1296
[15]  
Galan A(2020)More diverse means better: multimodal deep learning meets remote-sensing imagery classification IEEE Trans. Geosci. Remote Sens. 59 268-138
[16]  
Courbin F(2016)Joint depth and semantic inference from a single image via elastic conditional random field Pattern Recogn. 59 1285-31
[17]  
Treu T(2019)Model compression with adversarial robustness: aunified optimization framework Adv. Neural. Inf. Process. Syst. 32 128-5158
[18]  
Suyu S(2020)Autopruner: an end-to-end trainable filter pruning method for efficient deep model inference Pattern Recogn. 107 1-306
[19]  
Ding X(2019)Hybrid deep learning and empirical mode decomposition model for time series applications Expert Syst. Appl. 120 5146-755
[20]  
Birrer S(2019)Quantification of model uncertainty in rans simulations: A review Prog. Aerosp. Sci. 108 302-18859