An advanced YOLOv3 method for small-scale road object detection

被引:29
作者
Wang, Kun [1 ]
Liu, Maozhen [1 ]
Ye, Zhaojun [1 ]
机构
[1] Civil Aviat Univ China, Coll Elect Informat & Automat, Tianjin 300300, Peoples R China
基金
中国国家自然科学基金;
关键词
Road object detection; Deep learning; YOLOv3; Convolutional neural network; VEHICLE DETECTION; MULTISCALE; NETWORK;
D O I
10.1016/j.asoc.2021.107846
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Road target detection is a very challenging task in the field of computer vision because it is easily affected by complex backgrounds and sparse features of small targets. YOLOv3 (You Only Look Once v3) is currently one of the state-of-the-art object detection methods of deep learning. However, because the k-means clustering algorithm is sensitive to the initial clustering center, the local fragile visual field features related to small objects in the prediction map are severely lost and the final decision-making theory (The grid located in the center of the foreground object is responsible for predicting this object) of the network ignores the detailed information of the neighboring grid, there are still many problems in object detection. In this paper, we propose an improved algorithm based on YOLOv3 for small-scale object detection. We use the improved k-medians clustering method instead of the previous k-means to improve the model instability caused by the singularity; We propose a local enhancement method to strengthen weak features for small-scale object detection by paralleling a branch on the backbone. Besides, a flexible offset sampling structure added in parallel for information compensation is also designed. A series of experiments showing that our system has achieved good detection results on the KITTI and UA-DETRAC public datasets, and the distinguishing performance for small-scale objects is significantly improved. Therefore, our method is effective in road target detection tasks. (C) 2021 Published by Elsevier Ltd.
引用
收藏
页数:16
相关论文
共 47 条
  • [1] Bochkovskiy A., 2020, PREPRINT, DOI DOI 10.48550/ARXIV.2004.10934
  • [2] Feature-Fused SSD: Fast Detection for Small Objects
    Cao, Guimei
    Xie, Xuemei
    Yang, Wenzhe
    Liao, Quan
    Shi, Guangming
    Wu, Jinjian
    [J]. NINTH INTERNATIONAL CONFERENCE ON GRAPHIC AND IMAGE PROCESSING (ICGIP 2017), 2018, 10615
  • [3] Multi-View 3D Object Detection Network for Autonomous Driving
    Chen, Xiaozhi
    Ma, Huimin
    Wan, Ji
    Li, Bo
    Xia, Tian
    [J]. 30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, : 6526 - 6534
  • [4] Xception: Deep Learning with Depthwise Separable Convolutions
    Chollet, Francois
    [J]. 30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, : 1800 - 1807
  • [5] Christ PF, 2017, I S BIOMED IMAGING, P839, DOI 10.1109/ISBI.2017.7950648
  • [6] Histograms of oriented gradients for human detection
    Dalal, N
    Triggs, B
    [J]. 2005 IEEE COMPUTER SOCIETY CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, VOL 1, PROCEEDINGS, 2005, : 886 - 893
  • [7] CenterNet: Keypoint Triplets for Object Detection
    Duan, Kaiwen
    Bai, Song
    Xie, Lingxi
    Qi, Honggang
    Huang, Qingming
    Tian, Qi
    [J]. 2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 6568 - 6577
  • [8] Engelcke Martin, 2017, 2017 IEEE International Conference on Robotics and Automation (ICRA), P1355, DOI 10.1109/ICRA.2017.7989161
  • [9] Fang L., 2018, MULTIMEDIA TOOLS APP, P1
  • [10] Felzenszwalb P, 2008, PROC CVPR IEEE, P1984