Occluded Pedestrian Detection Algorithm Based on Improved YOLOv3

被引：13

作者：

Li Xiang ^{[1
,2
,3
,4
,5
]}

He Miao ^{[1
,2
,3
,4
]}

Luo Haibo ^{[1
,2
,3
,4
]}

机构：

[1] Chinese Acad Sci, Key Lab Optoelect Informat Proc, Shenyang 110016, Liaoning, Peoples R China

[2] Chinese Acad Sci, Shenyang Inst Automat, Shenyang 110016, Liaoning, Peoples R China

[3] Chinese Acad Sci, Inst Robot, Shenyang 110169, Liaoning, Peoples R China

[4] Chinese Acad Sci, Inst Intelligent Mfg, Shenyang 110169, Liaoning, Peoples R China

[5] Univ Chinese Acad Sci, Beijing 100049, Peoples R China

来源：

ACTA OPTICA SINICA | 2022年 / 42卷 / 14期

关键词：

machine vision; object detection; neural network; pedestrian detection; attention mechanism;

D O I：

10.3788/AOS202242.1415003

中图分类号：

O43 [光学];

学科分类号：

070207 ; 0803 ;

摘要：

In crowded scenes, it is difficult for YOLOv3 to detect the objects that overlap each other heavily. Aiming at the reasons for the decline of YOLOv3 performance, three improvements are proposed. Firstly, a Tight Loss function is proposed, which optimizes the variance and mean of the coordinates of the prediction boxes to make the prediction boxes belonging to the same target more compact, thus reducing the false positive rate. Secondly, a high-resolution feature pyramid is proposed, in which the resolution of each pyramid feature is improved by upsampling, and shallow features are introduced to enhance the differences between adjacent sub-features, so as to generate distinguishing depth features for highly overlapped targets. Thirdly, a detection head based on spatial attention mechanism is proposed to reduce the number of redundant prediction boxes, so as to reduce the computational burden of the non- maximum suppression (NMS) process. The experimental results on the crowded dataset CrowdHuman show that the average accuracy and recall rate of YOLOv3 detection are improved by 2. 91 percentage points and 3. 20 percentage points, and the miss rate is reduced by 1. 24 percentage points by using the proposed algorithms under the condition of using the traditional NMS method, which demonstrates the effectiveness of the proposed algorithms in boosting the performance in occluded pedestrian detection.

引用

页数：10

共 27 条

[1]

Bochkovskiy A., 2020, ARXIV 200410934

[2] Soft-NMS - Improving Object Detection With One Line of Code [J].

Bodla, Navaneeth ;

Singh, Bharat ;

Chellappa, Rama ;

Davis, Larry S. .

2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, :5562-5570

[3]

Ge Z, 2020, 2020 IEEE INT C MULT

[4]

Ge Z., 2021, YOLOX: Exceeding YOLO series in 2021., DOI 10.48550/ARXIV.2107.08430

[5] 驾驶特性的识别评估及其在智能汽车上的应用综述 [J].

郭烈 ;

马跃 ;

岳明 ;

秦增科 .

交通运输工程学报, 2021, 21 (02) :7-20

[6] Position Detection Algorithm of Road Obstacles Based on 3D LiDAR [J].

Hu Jie ;

Liu Han ;

Au Wencai ;

Zhao Liang .

CHINESE JOURNAL OF LASERS-ZHONGGUO JIGUANG, 2021, 48 (24)

[7]

Ji D F, 2020, INFORM CONTROL, V49, P401

[8]

Kopsiaftis G, 2015, INT GEOSCI REMOTE SE, P1881, DOI 10.1109/IGARSS.2015.7326160

[9] Adaptive NMS: Refining Pedestrian Detection in a Crowd [J].

Liu, Songtao ;

Huang, Di ;

Wang, Yunhong .

2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, :6452-6461

[10] Research Progress of Key Technologies in Recognition Sensing for Opto-Electronic Information and Event [J].

Liu Tiegen ;

Liu Kun ;

Dai Lin ;

Jiang Junfeng ;

Wang Jian ;

Ding Zhenyang ;

Sang Mei ;

Hu Haofeng ;

Wang Shuang ;

Xue Chao ;

Wang Jingbin ;

Deng Ye .

ACTA OPTICA SINICA, 2021, 41 (01)

← 1 2 3 →