PAFM: pose-drive attention fusion mechanism for occluded person re-identification

被引:21
作者
Yang, Jing [1 ]
Zhang, Canlong [1 ]
Tang, Yanping [2 ]
Li, Zhixin [1 ]
机构
[1] Guangxi Normal Univ, Guangxi Key Lab Multisource Informat Min & Secur, Guilin 541004, Peoples R China
[2] Guilin Univ Elect Technol, Sch Comp Sci & Informat Secur, Guilin 541004, Peoples R China
基金
中国国家自然科学基金;
关键词
Attention mechanism; Pose estimation; Multitask network; Person re-identification;
D O I
10.1007/s00521-022-06903-4
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Pedestrians are often occluded by various obstacles in public places, which is a big challenge for person re-identification. To alleviate the occlusion problem, we propose a Pose-drive Attention Fusion Mechanism (PAFM) that jointly fuses the discriminative features with pose-driven attention and spatial attention in an end-to-end framework. To simultaneously use global and local features, a multi-task network is constructed to realize multi-granularity feature representation. After anchoring the region of interest to the un-occluded spatial semantic information in the image through the spatial attention mechanism, some key points of the pedestrian's body are extracted using pose estimation and then fused with the spatial attention map to eliminate the harm of occlusion to the re-identification. Besides, the identification granularity is increased by matching the local features. We test and verify the effectiveness of the PAFM on Occluded-DukeMTMC, Occluded-REID and Partial-REID. The experimental results show that the proposed method has achieved competitive performance to the state-of-the-art methods.
引用
收藏
页码:8241 / 8252
页数:12
相关论文
共 39 条
  • [1] UniPose: Unified Human Pose Estimation in Single Images and Videos
    Artacho, Bruno
    Savakis, Andreas
    [J]. 2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2020, : 7033 - 7042
  • [2] Realtime Multi-Person 2D Pose Estimation using Part Affinity Fields
    Cao, Zhe
    Simon, Tomas
    Wei, Shih-En
    Sheikh, Yaser
    [J]. 30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, : 1302 - 1310
  • [3] Hard sample mining makes person re-identification more efficient and accurate
    Chen, Kezhou
    Chen, Yang
    Han, Chuchu
    Sang, Nong
    Gao, Changxin
    [J]. NEUROCOMPUTING, 2020, 382 (259-267) : 259 - 267
  • [4] ABD-Net: Attentive but Diverse Person Re-Identification
    Chen, Tianlong
    Ding, Shaojin
    Xie, Jingyi
    Yuan, Ye
    Chen, Wuyang
    Yang, Yang
    Ren, Zhou
    Wang, Zhangyang
    [J]. 2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 8350 - 8360
  • [5] Salience-Guided Cascaded Suppression Network for Person Re-identification
    Chen, Xuesong
    Fu, Canmiao
    Zhao, Yong
    Zheng, Feng
    Song, Jingkuan
    Ji, Rongrong
    Yang, Yi
    [J]. 2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2020, : 3297 - 3307
  • [6] Deng J, 2009, PROC CVPR IEEE, P248, DOI 10.1109/CVPRW.2009.5206848
  • [7] RMPE: Regional Multi-Person Pose Estimation
    Fang, Hao-Shu
    Xie, Shuqin
    Tai, Yu-Wing
    Lu, Cewu
    [J]. 2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, : 2353 - 2362
  • [8] Deep Spatial Pyramid Feature Collaborative Reconstruction for Partial Person ReID
    Gao, Zan
    Gao, Lishuai
    Zhang, Hua
    Cheng, Zhiyong
    Hong, Richang
    [J]. PROCEEDINGS OF THE 27TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA (MM'19), 2019, : 1879 - 1887
  • [9] Ge YX, 2018, ADV NEUR IN, V31
  • [10] Deep Residual Learning for Image Recognition
    He, Kaiming
    Zhang, Xiangyu
    Ren, Shaoqing
    Sun, Jian
    [J]. 2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, : 770 - 778