Key points and visible part fusion attention network for occluded pedestrian detection in traffic environments

被引：0

作者：

Liu, Peiyu ^{[1
]}

Ma, Yixuan ^{[1
]}

机构：

[1] Beijing Jiaotong Univ, Sch Software Engn, Beijing 100044, Peoples R China

来源：

OPTOELECTRONICS LETTERS | 2024年 / 20卷 / 07期

关键词：

D O I：

10.1007/s11801-024-4053-x

中图分类号：

O43 [光学];

学科分类号：

070207 ; 0803 ;

摘要：

Aiming at the problem of low detection accuracy of occluded pedestrian in traffic environments, this paper proposes a key points and visible part fusion network for occluded pedestrian detection. The proposed algorithm constructs two attention modules by introducing human key points and the bounding box of visible parts respectively, which suppresses the occluded parts in the channel features and spatial features of pedestrian features respectively. Experimental results on CityPersons and Caltech datasets demonstrate the effectiveness of the proposed algorithm. The missing rate (MR) is reduced to 40.78 on the Heavy subset of the CityPersons dataset and surpasses many outstanding methods.

引用

页码：430 / 436

页数：7

共 31 条

[1] Abdul K., 2023, Localized semantic feature mixers for efficient pedestrian detection in autonomous drivingC, P5476
[2] Cao JL, 2022, IEEE T PATTERN ANAL, V44, P4913, DOI [10.1109/TPAMI.2021.3076733, 10.1145/3459990.3460716]
[3] Beyond triplet loss: a deep quadruplet network for person re-identification
Chen, Weihua
Chen, Xiaotang
Zhang, Jianguo
Huang, Kaiqi
[J]. 30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, : 1320 - 1329
[4] Cheng B., 2020, Higher HRNet: scale-aware representation learning for bottom-up human pose estimationC, P5386
[5] The Cityscapes Dataset for Semantic Urban Scene Understanding
Cordts, Marius
Omran, Mohamed
Ramos, Sebastian
Rehfeld, Timo
Enzweiler, Markus
Benenson, Rodrigo
Franke, Uwe
Roth, Stefan
Schiele, Bernt
[J]. 2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, : 3213 - 3223
[6] Dollár P, 2009, PROC CVPR IEEE, P304, DOI 10.1109/CVPRW.2009.5206631
[7] Microsoft COCO: Common Objects in Context
Lin, Tsung-Yi
Maire, Michael
Belongie, Serge
Hays, James
Perona, Pietro
Ramanan, Deva
Dollar, Piotr
Zitnick, C. Lawrence
[J]. COMPUTER VISION - ECCV 2014, PT V, 2014, 8693 : 740 - 755
[8] Cross-Modal Collaborative Representation Learning and a Large-Scale RGBT Benchmark for Crowd Counting
Liu, Lingbo
Chen, Jiaqi
Wu, Hefeng
Li, Guanbin
Li, Chenglong
Lin, Liang
[J]. 2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 4821 - 4831
[9] VLPD: Context-Aware Pedestrian Detection via Vision-Language Semantic Self-Supervision
Liu, Mengyin
Jiang, Jie
Zhu, Chao
Yin, Xu-Cheng
[J]. 2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR, 2023, : 6662 - 6671
[10] Adaptive NMS: Refining Pedestrian Detection in a Crowd
Liu, Songtao
Huang, Di
Wang, Yunhong
[J]. 2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 6452 - 6461

← 1 2 3 4 →