An improved Tiny YOLOv3 for real-time object detection

被引:24
作者
Gai, Wendong [1 ]
Liu, Yakun [1 ]
Zhang, Jing [1 ]
Jing, Gang [1 ]
机构
[1] Shandong Univ Sci & Technol, Coll Elect Engn & Automat, Qingdao, Peoples R China
关键词
Object detection; Tiny YOLOv3; multi-scale prediction; K-means; real-time; APPLE DETECTION;
D O I
10.1080/21642583.2021.1901156
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The existing real-time object detection algorithm often omits the objects in the object detection. So an improved Tiny YOLOv3 (you look only once) algorithm is proposed with both lightweight and high accuracy of object detection. The improved Tiny YOLOv3 uses K-means clustering to estimate the size of the anchor boxes for dataset. The pooling and convolution layers are added in the network to strengthen feature fusion and reduce parameters. The network structure increases upsampling and downsampling to enhance multi-scale fusion. The complete intersection over union is added in the loss function, which effectively improves the detection results. In addition, the proposed method has the lightweight module size and can be trained in the CPU. The experimental results show that the proposed method can meet the requirements of the detection speed and accuracy.
引用
收藏
页码:314 / 321
页数:8
相关论文
共 16 条
[1]   Fast Feature Pyramids for Object Detection [J].
Dollar, Piotr ;
Appel, Ron ;
Belongie, Serge ;
Perona, Pietro .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2014, 36 (08) :1532-1545
[2]   Rich feature hierarchies for accurate object detection and semantic segmentation [J].
Girshick, Ross ;
Donahue, Jeff ;
Darrell, Trevor ;
Malik, Jitendra .
2014 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2014, :580-587
[3]   Deep Residual Learning for Image Recognition [J].
He, Kaiming ;
Zhang, Xiangyu ;
Ren, Shaoqing ;
Sun, Jian .
2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, :770-778
[4]  
He KM, 2014, LECT NOTES COMPUT SC, V8691, P346, DOI [arXiv:1406.4729, 10.1007/978-3-319-10578-9_23]
[5]  
He Kaiming, 2017, PROC IEEE INT C COMP, P2961
[6]   SSD: Single Shot MultiBox Detector [J].
Liu, Wei ;
Anguelov, Dragomir ;
Erhan, Dumitru ;
Szegedy, Christian ;
Reed, Scott ;
Fu, Cheng-Yang ;
Berg, Alexander C. .
COMPUTER VISION - ECCV 2016, PT I, 2016, 9905 :21-37
[7]   RETRACTED: Video analytics for semantic substance extraction using OpenCV in python']python (Retracted Article) [J].
Manju, A. ;
Valarmathie, P. .
JOURNAL OF AMBIENT INTELLIGENCE AND HUMANIZED COMPUTING, 2021, 12 (03) :4057-4066
[8]   Real-Time Apple Detection System Using Embedded Systems With Hardware Accelerators: An Edge AI Application [J].
Mazzia, Vittorio ;
Khaliq, Aleem ;
Salvetti, Francesco ;
Chiaberge, Marcello .
IEEE ACCESS, 2020, 8 :9102-9114
[9]  
Redmon J., 2018, COMPUT VIS PATTERN R, P89
[10]   YOLO9000: Better, Faster, Stronger [J].
Redmon, Joseph ;
Farhadi, Ali .
30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, :6517-6525