Object detection using stacked YOLOv3

被引:1
作者
Padmanabula S.S. [1 ]
Puvvada R.C. [1 ]
Sistla V. [1 ]
Kishore Kolli V.K. [1 ]
机构
[1] Department of CSE, VFSTR Deemed to be University, Vadlamudi, Guntur, Andhra Pradesh
来源
Ingenierie des Systemes d'Information | 2020年 / 25卷 / 05期
关键词
Class probabilities; Deep neural network; Non-maxima suppression; Object detection; Transfer learning; Unified architecture; YOLOv3;
D O I
10.18280/ISI.250517
中图分类号
学科分类号
摘要
Object detection is a stimulating task in the applications of computer vision. It is gaining a lot of attention in many real-time applications such as detection of number plates of suspect cars, identifying trespassers under surveillance areas, detecting unmasked faces in security gates during the COVID-19 period, etc. Region-based Convolution Neural Networks(R-CNN), You only Look once (YOLO) based CNNs, etc., comes under Deep Learning approaches. In this proposed work, an improved stacked Yolov3 model is designed for the detection of objects by bounding boxes. Hyperparameters are tuned to get optimum performance. The proposed model evaluated using the COCO dataset, and the performance is better than other existing object detection models. Anchor boxes are used for overlapping objects. After removing all the predicted bounding boxes that have a low detection probability, bounding boxes with the highest detection probability are selected and eliminated all the bounding boxes whose Intersection Over Union value is higher than 0.4. Non-Maximal Suppression (NMS) is used to only keep the best bounding box. In this experimentation, we have tried with various range of values, but finally got better result at threshold 0.5. © 2020 International Information and Engineering Technology Association. All rights reserved.
引用
收藏
页码:691 / 697
页数:6
相关论文
共 23 条
[1]  
Shakil S., Rajjak A., Kureshi A.K., Object detection and tracking using YOLO v3 framework for increased resolution video, International Journal of Innovative Technology and Exploring Engineering (IJITEE), 9, 6, pp. 118-125, (2020)
[2]  
Agrawal P., Girshick R., Malik J., Analyzing the Performance of Multilayer Neural Networks for Object Recognition, Computer Vision - ECCV 2014. ECCV 2014. Lecture Notes in Computer Science, 8695, (2014)
[3]  
Szegedy C., Toshev A., Erhan D., Deep neural networks for object detection, Advances in Neural Information Processing Systems, 26, pp. 1-9, (2013)
[4]  
Liu L., Ouyang W.L., Wang X.G., Fieguth P., Chen J., Liu X.W., Pietikainen M., Deep learning for generic object detection: A survey, International Journal of Computer Vision, 128, 2, pp. 261-318, (2020)
[5]  
Galvez R.L., Bandala A.A., Dadios E.P., Vicerra R.R.P., Maningo J.M.Z., Object detection using convolutional neural networks, TENCON 2018 - 2018 IEEE Region 10 Conference, pp. 2023-2027, (2019)
[6]  
Zhao Z.Q., Zheng P., Xu S.T., Wu X., Object detection with deep learning: A review, IEEE Transactions on Neural Networks and Learning Systems, 30, 11, pp. 3212-3232, (2019)
[7]  
Ammar A., Koubaa A., Ahmed M., Saad A., Aerial images processing for car detection using convolutional neural networks: Comparison between Faster R-CNN and YoloV3, (2019)
[8]  
Redmon J., Divvala S., Girshick R., Farhadi A., You only look once: Unified, real-time object detection, Computer Vision and Pattern Recognition (cs.CV), (2016)
[9]  
Redmon J., Farhadi A., YOLO9000: Better, faster, stronger, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 6517-6525, (2017)
[10]  
Redmon J., Farhadi A., YOLOv3: An incremental improvement, Computer Vision and Pattern Recognition (cs.CV), (2018)