Cross-Modality Earth Mover's Distance for Visible Thermal Person Re-identification

被引:0
作者
Ling, Yongguo [1 ]
Zhong, Zhun [2 ]
Luo, Zhiming [1 ]
Yang, Fengxiang [1 ]
Cao, Donglin [1 ]
Lin, Yaojin [3 ]
Li, Shaozi [1 ]
Sebe, Nicu [2 ]
机构
[1] Xiamen Univ, Dept Artificial Intelligence, Xiamen, Peoples R China
[2] Univ Trento, Dept Informat Engn & Comp Sci, Trento, Italy
[3] Minnan Normal Univ, Sch Comp Sci, Zhangzhou, Peoples R China
来源
THIRTY-SEVENTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 37 NO 2 | 2023年
基金
中国国家自然科学基金;
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Visible thermal person re-identification (VT-ReID) suffers from inter-modality discrepancy and intra-identity variations. Distribution alignment is a popular solution for VT-ReID, however, it is usually restricted to the influence of the intra-identity variations. In this paper, we propose the Cross-Modality Earth Mover's Distance (CM-EMD) that can alleviate the impact of the intra-identity variations during modality alignment. CM-EMD selects an optimal transport strategy and assigns high weights to pairs that have a smaller intra-identity variation. In this manner, the model will focus on reducing the inter-modality discrepancy while paying less attention to intra-identity variations, leading to a more effective modality alignment. Moreover, we introduce two techniques to improve the advantage of CM-EMD. First, Cross-Modality Discrimination Learning (CM-DL) is designed to overcome the discrimination degradation problem caused by modality alignment. By reducing the ratio between intra-identity and inter-identity variances, CM-DL leads the model to learn more discriminative representations. Second, we construct the Multi-Granularity Structure (MGS), enabling us to align modalities from both coarse- and fine-grained levels with the proposed CM-EMD. Extensive experiments show the benefits of the proposed CM-EMD and its auxiliary techniques (CM-DL and MGS). Our method achieves state-of-the-art performance on two VT-ReID benchmarks.
引用
收藏
页码:1631 / 1639
页数:9
相关论文
共 66 条
  • [1] Arjovsky M, 2017, PR MACH LEARN RES, V70
  • [2] Chen LC, 2018, ADV NEUR IN, V31
  • [3] Chen LQ, 2019, Arxiv, DOI arXiv:1901.06283
  • [4] Chen XY, 2019, PR MACH LEARN RES, V97
  • [5] Neural Feature Search for RGB-Infrared Person Re-Identification
    Chen, Yehansen
    Wan, Lin
    Li, Zhihang
    Jing, Qianyan
    Sun, Zongyuan
    [J]. 2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 587 - 597
  • [6] Choi S, 2020, PROC CVPR IEEE, P10254, DOI 10.1109/CVPR42600.2020.01027
  • [7] Cuturi M., 2013, Advances in neural information processing systems, V26
  • [8] Dai PY, 2018, PROCEEDINGS OF THE TWENTY-SEVENTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, P677
  • [9] Learning Modality-Specific Representations for Visible-Infrared Person Re-Identification
    Feng, Zhanxiang
    Lai, Jianhuang
    Xie, Xiaohua
    [J]. IEEE TRANSACTIONS ON IMAGE PROCESSING, 2020, 29 : 579 - 590
  • [10] CM-NAS: Cross-Modality Neural Architecture Search for Visible-Infrared Person Re-Identification
    Fu, Chaoyou
    Hu, Yibo
    Wu, Xiang
    Shi, Hailin
    Mei, Tao
    He, Ran
    [J]. 2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 11803 - 11812