Occlusion-aware pedestrian detection combined with dual attention mechanism

被引：0

作者：

Zhou D. ^{[1
,2
]}

Song R. ^{[1
]}

Yang X. ^{[1
]}

机构：

[1] College of Automation Engineering, Nanjing University of Aeronautics and Astronautics, Nanjing

[2] Jiangsu Key Laboratory of Internet of Things and Control Technologies (Nanjing University of Aeronautics and Astronautics), Nanjing

来源：

Harbin Gongye Daxue Xuebao/Journal of Harbin Institute of Technology | 2021年 / 53卷 / 09期

关键词：

Attention mechanism; Convolutional neural networks; Occlusion; Pedestrian detection; Real-time;

D O I：

10.11918/201904144

中图分类号：

O212 [数理统计];

学科分类号：

摘要：

To address the occlusion problem of pedestrian detection algorithm when applied in traffic scenarios, this paper presents an occlusion-aware algorithm combined with dual attention mechanism for pedestrian detection. Based on the RetinaNet framework, the spatial-wise attention mechanism and channel-wise attention mechanism were utilized in regression and classification branches respectively, guiding the detector to pay more attention to the visible parts of pedestrians. Moreover, visible bounding box information of pedestrians was introduced to optimize the traditional regression loss function, so that it can adaptively adjust the weights of predicted boxes according to the degree of occlusion. Experiments on Caltech and CityPerson datasets show that the proposed algorithm had better robustness and higher accuracy than other eight advanced algorithms such as RetinaNet. Especially in the case of heavy occlusion, the log-average miss rate of the proposed algorithm was only 45.69%, which was 12% lower than those of other algorithms. Furthermore, the proposed algorithm could detect pedestrians in quasi-real-time. It processed 11.8 frames per second on Caltech dataset and 10.0 frames per second on CityPerson dataset. The detection methods of dual attention mechanism and occlusion-aware regression loss function proposed in this paper are feasible and effective, and have significant advantages for the processing of occluded pedestrians. © 2021, Editorial Board of Journal of Harbin Institute of Technology. All right reserved.

引用

页码：156 / 163

页数：7

共 28 条

[1]

SU Songzhi, LI Shaozi, CHEN Shuyuan, Et al., Overview of pedestrian detection technology, Electronic Journal, 40, 4, (2012)

[2]

DALAL N, TRIGGS B., Histograms of oriented gradients for human detection, IEEE Conference on Computer Vision and Pattern Recognition, (2005)

[3]

FELZENSZWALB P F, GIRSHICK R B, MCALLESTER D, Et al., Object detection with discriminatively trained part-based models, IEEE Transactions on Pattern Analysis and Machine Intelligence, 32, 9, (2010)

[4]

DOLLAR P, APPEL R, BELONGIE S, Et al., Fast feature pyramids for object detection, IEEE Transactions on Pattern Analysis and Machine Intelligence, 36, 8, (2014)

[5]

KRIZHEVSKY A, SUTSKEVER I, HINTON G E., ImageNet classification with deep convolutional neural networks, Advances in Neural Information Processing Systems, (2012)

[6]

YAO Qunli, HU Xian, LEI Hong, Research of deep convolution neural network in objection detection, Computer Engineering and Application, 54, 17, (2018)

[7]

REN S, HE K, GIRSHICK R, Et al., Faster R-CNN: towards real-time object detection with region proposal networks, Advances in Neural Information Processing Systems, (2015)

[8]

REDMON J, DIVVALA S, GIRSHICK R, Et al., You only look once: unified, real-time object detection, IEEE Conference on Computer Vision and Pattern Recognition, (2016)

[9]

LIU W, ANGUELOV D, ERHAN D, Et al., SSD: single shot MultiBox detector, European Conference on Computer Vision, (2016)

[10]

TIAN Y, LUO P, WANG X, Et al., Deep learning strong parts for pedestrian detection, IEEE International Conference on Computer Vision, (2015)

← 1 2 3 →