X-ReID: Cross-Instance Transformer for Identity-Level Person Re-Identification

被引:2
作者
Shen, Leqi [1 ,2 ]
He, Tao [3 ,4 ]
Zhao, Sicheng [2 ]
Shen, Zhelun [5 ]
Guo, Yuchen [2 ]
Xu, Tianshi [3 ]
Ding, Guiguang [1 ,2 ]
机构
[1] Tsinghua Univ, Sch Software, Beijing, Peoples R China
[2] Tsinghua Univ, BNRist, Beijing, Peoples R China
[3] GRG Banking Equipment Co Ltd, Guangzhou, Peoples R China
[4] South China Univ Technol, Sch Elect & Informat Engn, Guangzhou, Peoples R China
[5] Baidu Res, RAL, Beijing, Peoples R China
来源
2024 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, ICME 2024 | 2024年
关键词
Person Re-Identification; Transformer; Crossattention; Identity-level;
D O I
10.1109/ICME57554.2024.10687457
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Currently, most existing person re-identification methods use instance-level features, which are extracted only from a single image. However, these instance-level features can easily ignore the discriminative information because the appearance of each identity varies greatly in different images. Thus, it is necessary to exploit identity-level features, which can be shared across different images of each identity. In this paper, we propose a novel training framework, named X-ReID, to promote instance-level features to identity-level features by employing cross-attention to incorporate information from one image to another of the same identity, thus more unified and discriminative pedestrian information can be obtained. Extensive experiments on benchmark datasets show the superiority of our method over existing works. Particularly, on the challenging MSMT17, our proposed method gains 1.1% mAP improvements when compared to the second place.
引用
收藏
页数:6
相关论文
共 22 条
[1]  
[Anonymous], 2018, CVPR, DOI DOI 10.1109/CVPR.2018.00016
[2]  
Calinski T., 1974, Communications in Statistics-theory and Methods, V3, P1, DOI [DOI 10.1080/03610927408827101, 10.1080/03610927408827101, https://doi.org/10.1080/03610927408827101]
[3]  
CHEN DD, 2018, CVPR, P6654, DOI DOI 10.1109/CVPR.2018.00696
[4]   Large-scale Tag-based Font Retrieval with Generative Feature Learning [J].
Chen, Tianlang ;
Wang, Zhaowen ;
Xu, Ning ;
Jin, Hailin ;
Luo, Jiebo .
2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, :9115-9124
[5]   Salience-Guided Cascaded Suppression Network for Person Re-identification [J].
Chen, Xuesong ;
Fu, Canmiao ;
Zhao, Yong ;
Zheng, Feng ;
Song, Jingkuan ;
Ji, Rongrong ;
Yang, Yi .
2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2020, :3297-3307
[6]  
He Shuting, 2021, ICCV
[7]  
He T., 2022, AAAI
[8]  
He Tao, 2021, CAMERA BIAS REGULARI
[9]  
Jia Mengxi, 2023, AAAI
[10]  
Li YZ, 2021, PROCEEDINGS OF THE 30TH ACM INTERNATIONAL CONFERENCE ON INFORMATION & KNOWLEDGE MANAGEMENT, CIKM 2021, P998, DOI [10.1007/978-981-15-9746-6_74, 10.1145/3459637.3482257]