Key points and visible part fusion attention network for occluded pedestrian detection in traffic environments

被引:0
作者
Liu, Peiyu [1 ]
Ma, Yixuan [1 ]
机构
[1] Beijing Jiaotong Univ, Sch Software Engn, Beijing 100044, Peoples R China
关键词
A;
D O I
10.1007/s11801-024-4053-x
中图分类号
O43 [光学];
学科分类号
070207 ; 0803 ;
摘要
Aiming at the problem of low detection accuracy of occluded pedestrian in traffic environments, this paper proposes a key points and visible part fusion network for occluded pedestrian detection. The proposed algorithm constructs two attention modules by introducing human key points and the bounding box of visible parts respectively, which suppresses the occluded parts in the channel features and spatial features of pedestrian features respectively. Experimental results on CityPersons and Caltech datasets demonstrate the effectiveness of the proposed algorithm. The missing rate (MR) is reduced to 40.78 on the Heavy subset of the CityPersons dataset and surpasses many outstanding methods.
引用
收藏
页码:430 / 436
页数:7
相关论文
共 31 条
  • [1] Abdul K., 2023, Localized semantic feature mixers for efficient pedestrian detection in autonomous drivingC, P5476
  • [2] Cao JL, 2022, IEEE T PATTERN ANAL, V44, P4913, DOI [10.1109/TPAMI.2021.3076733, 10.1145/3459990.3460716]
  • [3] Beyond triplet loss: a deep quadruplet network for person re-identification
    Chen, Weihua
    Chen, Xiaotang
    Zhang, Jianguo
    Huang, Kaiqi
    [J]. 30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, : 1320 - 1329
  • [4] Cheng B., 2020, Higher HRNet: scale-aware representation learning for bottom-up human pose estimationC, P5386
  • [5] The Cityscapes Dataset for Semantic Urban Scene Understanding
    Cordts, Marius
    Omran, Mohamed
    Ramos, Sebastian
    Rehfeld, Timo
    Enzweiler, Markus
    Benenson, Rodrigo
    Franke, Uwe
    Roth, Stefan
    Schiele, Bernt
    [J]. 2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, : 3213 - 3223
  • [6] Dollár P, 2009, PROC CVPR IEEE, P304, DOI 10.1109/CVPRW.2009.5206631
  • [7] Microsoft COCO: Common Objects in Context
    Lin, Tsung-Yi
    Maire, Michael
    Belongie, Serge
    Hays, James
    Perona, Pietro
    Ramanan, Deva
    Dollar, Piotr
    Zitnick, C. Lawrence
    [J]. COMPUTER VISION - ECCV 2014, PT V, 2014, 8693 : 740 - 755
  • [8] Cross-Modal Collaborative Representation Learning and a Large-Scale RGBT Benchmark for Crowd Counting
    Liu, Lingbo
    Chen, Jiaqi
    Wu, Hefeng
    Li, Guanbin
    Li, Chenglong
    Lin, Liang
    [J]. 2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 4821 - 4831
  • [9] VLPD: Context-Aware Pedestrian Detection via Vision-Language Semantic Self-Supervision
    Liu, Mengyin
    Jiang, Jie
    Zhu, Chao
    Yin, Xu-Cheng
    [J]. 2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR, 2023, : 6662 - 6671
  • [10] Adaptive NMS: Refining Pedestrian Detection in a Crowd
    Liu, Songtao
    Huang, Di
    Wang, Yunhong
    [J]. 2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 6452 - 6461