Counterfactual Intervention Feature Transfer for Visible-Infrared Person Re-identification

被引:21
作者
Li, Xulin [1 ,2 ]
Lu, Yan [1 ,2 ]
Liu, Bin [1 ,2 ]
Liu, Yating [3 ]
Yin, Guojun [1 ,2 ]
Chu, Qi [1 ,2 ]
Huang, Jinyang [1 ,2 ]
Zhu, Feng [4 ]
Zhao, Rui [4 ,5 ]
Yu, Nenghai [1 ,2 ]
机构
[1] Univ Sci & Technol China, Sch Informat Sci & Technol, Hefei, Peoples R China
[2] Chinese Acad Sci, Key Lab Electromagnet Space Informat, Beijing, Peoples R China
[3] Univ Sci & Technol China, Sch Data Sci, Hefei, Peoples R China
[4] SenseTime Res, Hong Kong, Peoples R China
[5] Shanghai Jiao Tong Univ, Qing Yuan Res Inst, Shanghai, Peoples R China
来源
COMPUTER VISION, ECCV 2022, PT XXVI | 2022年 / 13686卷
关键词
Person re-identification; Counterfactual; Cross-modality;
D O I
10.1007/978-3-031-19809-0_22
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Graph-based models have achieved great success in person re-identification tasks recently, which compute the graph topology structure (affinities) among different people first and then pass the information across them to achieve stronger features. But we find existing graph-based methods in the visible-infrared person re-identification task (VI-ReID) suffer from bad generalization because of two issues: 1) train-test modality balance gap, which is a property of VI-ReID task. The number of two modalities data are balanced in the training stage but extremely unbalanced in inference, causing the low generalization of graph-based VI-ReID methods. 2) sub-optimal topology structure caused by the end-to-end learning manner to the graph module. We analyze that the joint learning of backbone features and graph features weaken the learning of graph topology, making it not generalized enough during the inference process. In this paper, we propose a Counterfactual Intervention Feature Transfer (CIFT) method to tackle these problems. Specifically, a Homogeneous and Heterogeneous Feature Transfer ((HFT)-F-2) is designed to reduce the train-test modality balance gap by two independent types of well-designed graph modules and an unbalanced scenario simulation. Besides, a Counterfactual Relation Intervention (CRI) is proposed to utilize the counterfactual intervention and causal effect tools to highlight the role of topology structure in the whole training process, which makes the graph topology structure more reliable. Extensive experiments on standard VI-ReID benchmarks demonstrate that CIFT outperforms the state-of-the-art methods under various settings.
引用
收藏
页码:381 / 398
页数:18
相关论文
共 49 条
  • [1] Boski M, 2017, 2017 10TH INTERNATIONAL WORKSHOP ON MULTIDIMENSIONAL (ND) SYSTEMS (NDS)
  • [2] Chalupka K, 2015, Arxiv, DOI arXiv:1412.2309
  • [3] Group Consistent Similarity Learning via Deep CRF for Person Re-Identification
    Chen, Dapeng
    Xu, Dan
    Li, Hongsheng
    Sebe, Nicu
    Wang, Xiaogang
    [J]. 2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 8649 - 8658
  • [4] Chen L, 2020, PROC CVPR IEEE, P10797, DOI 10.1109/CVPR42600.2020.01081
  • [5] Dai PY, 2018, PROCEEDINGS OF THE TWENTY-SEVENTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, P677
  • [6] Gong SG, 2014, ADV COMPUT VIS PATT, P1, DOI 10.1007/978-1-4471-6296-4_1
  • [7] Generative Adversarial Networks
    Goodfellow, Ian
    Pouget-Abadie, Jean
    Mirza, Mehdi
    Xu, Bing
    Warde-Farley, David
    Ozair, Sherjil
    Courville, Aaron
    Bengio, Yoshua
    [J]. COMMUNICATIONS OF THE ACM, 2020, 63 (11) : 139 - 144
  • [8] Hao X., 2021, INT C COMPUTER VISIO, P16403
  • [9] Deep Residual Learning for Image Recognition
    He, Kaiming
    Zhang, Xiangyu
    Ren, Shaoqing
    Sun, Jian
    [J]. 2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, : 770 - 778
  • [10] Ling YG, 2021, PROCEEDINGS OF THE THIRTIETH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, IJCAI 2021, P845