An advanced YOLOv3 method for small-scale road object detection

被引：33

作者：

Wang, Kun ^{[1
]}

Liu, Maozhen ^{[1
]}

Ye, Zhaojun ^{[1
]}

机构：

[1] Civil Aviat Univ China, Coll Elect Informat & Automat, Tianjin 300300, Peoples R China

来源：

APPLIED SOFT COMPUTING | 2021年 / 112卷

基金：

中国国家自然科学基金;

关键词：

Road object detection; Deep learning; YOLOv3; Convolutional neural network; VEHICLE DETECTION; MULTISCALE; NETWORK;

D O I：

10.1016/j.asoc.2021.107846

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Road target detection is a very challenging task in the field of computer vision because it is easily affected by complex backgrounds and sparse features of small targets. YOLOv3 (You Only Look Once v3) is currently one of the state-of-the-art object detection methods of deep learning. However, because the k-means clustering algorithm is sensitive to the initial clustering center, the local fragile visual field features related to small objects in the prediction map are severely lost and the final decision-making theory (The grid located in the center of the foreground object is responsible for predicting this object) of the network ignores the detailed information of the neighboring grid, there are still many problems in object detection. In this paper, we propose an improved algorithm based on YOLOv3 for small-scale object detection. We use the improved k-medians clustering method instead of the previous k-means to improve the model instability caused by the singularity; We propose a local enhancement method to strengthen weak features for small-scale object detection by paralleling a branch on the backbone. Besides, a flexible offset sampling structure added in parallel for information compensation is also designed. A series of experiments showing that our system has achieved good detection results on the KITTI and UA-DETRAC public datasets, and the distinguishing performance for small-scale objects is significantly improved. Therefore, our method is effective in road target detection tasks. (C) 2021 Published by Elsevier Ltd.

引用

页数：16

共 47 条

[1]

[Anonymous], 2015, PROC ADVNEURAL INF P

[2]

Bochkovskiy A., 2020, PREPRINT

[3] Feature-Fused SSD: Fast Detection for Small Objects [J].

Cao, Guimei ;

Xie, Xuemei ;

Yang, Wenzhe ;

Liao, Quan ;

Shi, Guangming ;

Wu, Jinjian .

NINTH INTERNATIONAL CONFERENCE ON GRAPHIC AND IMAGE PROCESSING (ICGIP 2017), 2018, 10615

[4] Multi-View 3D Object Detection Network for Autonomous Driving [J].

Chen, Xiaozhi ;

Ma, Huimin ;

Wan, Ji ;

Li, Bo ;

Xia, Tian .

30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, :6526-6534

[5] Xception: Deep Learning with Depthwise Separable Convolutions [J].

Chollet, Francois .

30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, :1800-1807

[6]

Christ PF, 2017, I S BIOMED IMAGING, P839, DOI 10.1109/ISBI.2017.7950648

[7] Histograms of oriented gradients for human detection [J].

Dalal, N ;

Triggs, B .

2005 IEEE COMPUTER SOCIETY CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, VOL 1, PROCEEDINGS, 2005, :886-893

[8] CenterNet: Keypoint Triplets for Object Detection [J].

Duan, Kaiwen ;

Bai, Song ;

Xie, Lingxi ;

Qi, Honggang ;

Huang, Qingming ;

Tian, Qi .

2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, :6568-6577

[9]

Engelcke Martin, 2017, 2017 IEEE International Conference on Robotics and Automation (ICRA), P1355, DOI 10.1109/ICRA.2017.7989161

[10]

Fang L., 2018, MULTIMEDIA TOOLS APP, P1

← 1 2 3 4 5 →