Person Re-Identification via Attention Pyramid

被引:94
作者
Chen, Guangyi [1 ]
Gu, Tianpei [1 ]
Lu, Jiwen [1 ,2 ]
Bao, Jin-An [1 ]
Zhou, Jie [1 ]
机构
[1] Tsinghua Univ, Dept Automat, Beijing Natl Res Ctr Informat Sci & Technol BNRIS, Beijing 100084, Peoples R China
[2] Beijing Acad Artificial Intelligence, Beijing 100084, Peoples R China
基金
中国国家自然科学基金;
关键词
Feature extraction; Computational modeling; Computational efficiency; Aggregates; Task analysis; Visualization; Stacking; Person re-identification; attention learning; feature pyramid; NETWORK;
D O I
10.1109/TIP.2021.3107211
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, we propose an attention pyramid method for person re-identification. Unlike conventional attention-based methods which only learn a global attention map, our attention pyramid exploits the attention regions in a multi-scale manner because human attention varies with different scales. Our attention pyramid imitates the process of human visual perception which tends to notice the foreground person over the cluttered background, and further focus on the specific color of the shirt with close observation. Specifically, we describe our attention pyramid by a "split-attend-merge-stack" principle. We first split the features into multiple local parts and learn the corresponding attentions. Then, we merge local attentions and stack these merged attentions with the residual connection as an attention pyramid. The proposed attention pyramid is a lightweight plug-and-play module that can be applied to off-the-shelf models. We implement our attention pyramid method in two different attention mechanisms including: channel-wise attention and spatial attention. We evaluate our method on four large-scale person re-identification benchmarks including Market-1501, DukeMTMC, CUHK03, and MSMT17. Experimental results demonstrate the superiority of our method, which outperforms the state-of-the-art methods by a large margin with limited computationa cost. Code is available at https://github.com/CHENGY12/APNet.
引用
收藏
页码:7663 / 7676
页数:14
相关论文
共 84 条
[1]  
[Anonymous], 2016, P 29 IEEE C COMPUTER
[2]   Mixed High-Order Attention Network for Person Re-Identification [J].
Chen, Binghui ;
Deng, Weihong ;
Hu, Jiani .
2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, :371-381
[3]   Temporal Coherence or Temporal Motion: Which Is More Critical for Video-Based Person Re-identification? [J].
Chen, Guangyi ;
Rao, Yongming ;
Lu, Jiwen ;
Zhou, Jie .
COMPUTER VISION - ECCV 2020, PT VIII, 2020, 12353 :660-676
[4]   Deep Credible Metric Learning for Unsupervised Domain Adaptation Person Re-identification [J].
Chen, Guangyi ;
Lu, Yuhao ;
Lu, Jiwen ;
Zhou, Jie .
COMPUTER VISION - ECCV 2020, PT VIII, 2020, 12353 :643-659
[5]   Self-Critical Attention Learning for Person Re-Identification [J].
Chen, Guangyi ;
Lin, Chunze ;
Ren, Liangliang ;
Lu, Jiwen ;
Zhou, Jie .
2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, :9636-9645
[6]   Learning Recurrent 3D Attention for Video-Based Person Re-Identification [J].
Chen, Guangyi ;
Lu, Jiwen ;
Yang, Ming ;
Zhou, Jie .
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2020, 29 :6963-6976
[7]   Spatial-Temporal Attention-Aware Learning for Video-Based Person Re-Identification [J].
Chen, Guangyi ;
Lu, Jiwen ;
Yang, Ming ;
Zhou, Jie .
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2019, 28 (09) :4192-4205
[8]   ABD-Net: Attentive but Diverse Person Re-Identification [J].
Chen, Tianlong ;
Ding, Shaojin ;
Xie, Jingyi ;
Yuan, Ye ;
Chen, Wuyang ;
Yang, Yang ;
Ren, Zhou ;
Wang, Zhangyang .
2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, :8350-8360
[9]   Beyond triplet loss: a deep quadruplet network for person re-identification [J].
Chen, Weihua ;
Chen, Xiaotang ;
Zhang, Jianguo ;
Huang, Kaiqi .
30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, :1320-1329
[10]   Salience-Guided Cascaded Suppression Network for Person Re-identification [J].
Chen, Xuesong ;
Fu, Canmiao ;
Zhao, Yong ;
Zheng, Feng ;
Song, Jingkuan ;
Ji, Rongrong ;
Yang, Yi .
2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2020, :3297-3307