DSSL: Deep Surroundings-person Separation Learning for Text-based Person Retrieval

被引:111
|
作者
Zhu, Aichun [1 ]
Wang, Zijie [1 ]
Li, Yifeng [1 ]
Wan, Xili [1 ]
Jin, Jing [1 ]
Wang, Tian [2 ]
Hu, Fangqiang [1 ]
Hua, Gang [3 ]
机构
[1] Nanjing Tech Univ, Nanjing, Peoples R China
[2] Beihang Univ, Beijing, Peoples R China
[3] China Univ Min & Technol, Xuzhou, Jiangsu, Peoples R China
来源
PROCEEDINGS OF THE 29TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2021 | 2021年
基金
中国博士后科学基金; 中国国家自然科学基金;
关键词
person retrieval; text-based person re-identification; cross-modal retrieval; surroundings-person separation;
D O I
10.1145/3474085.3475369
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Many previous methods on text-based person retrieval tasks are devoted to learning a latent common space mapping, with the purpose of extracting modality-invariant features from both visual and textual modality. Nevertheless, due to the complexity of high-dimensional data, the unconstrained mapping paradigms are not able to properly catch discriminative clues about the corresponding person while drop the misaligned information. Intuitively, the information contained in visual data can be divided into person information (PI) and surroundings information (SI), which are mutually exclusive from each other. To this end, we propose a novel Deep Surroundings-person Separation Learning (DSSL) model in this paper to effectively extract and match person information, and hence achieve a superior retrieval accuracy. A surroundings-person separation and fusion mechanism plays the key role to realize an accurate and effective surroundings-person separation under a mutually exclusion constraint. In order to adequately utilize multimodal and multi-granular information for a higher retrieval accuracy, five diverse alignment paradigms are adopted. Extensive experiments are carried out to evaluate the proposed DSSL on CUHK-PEDES, which is currently the only accessible dataset for text-base person retrieval task. DSSL achieves the state-of-the-art performance on CUHK-PEDES. To properly evaluate our proposed DSSL in the real scenarios, a Real Scenarios Text-based Person Reidentification (RSTPReid) dataset is constructed to benefit future research on text-based person retrieval, which will be publicly available.
引用
收藏
页码:209 / 217
页数:9
相关论文
共 50 条
  • [21] Decentralized Text-Based Person Re-Identification in Multi-Camera Networks
    Agyeman, Rockson
    Rinner, Bernhard
    IEEE ACCESS, 2024, 12 : 172125 - 172148
  • [22] Specific Person Retrieval via Incomplete Text Description
    Ye, Mang
    Liang, Chao
    Wang, Zheng
    Leng, Qingming
    Chen, Jun
    Liu, Jun
    ICMR'15: PROCEEDINGS OF THE 2015 ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA RETRIEVAL, 2015, : 547 - 550
  • [23] MGRL: MUTUAL-GUIDANCE REPRESENTATION LEARNING FOR TEXT-TO-IMAGE PERSON RETRIEVAL
    Lv, Tianle
    Li, Shuang
    Leng, Jiaxu
    Gao, Xinbo
    2024 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, ICASSP 2024, 2024, : 2895 - 2899
  • [24] Learning deep part-aware emb e dding for person retrieval
    Zhao, Yang
    Shen, Chunhua
    Yu, Xiaohan
    Chen, Hao
    Gao, Yongsheng
    Xiong, Shengwu
    PATTERN RECOGNITION, 2021, 116
  • [25] Learning discriminative region representation for person retrieval
    Zhao, Yang
    Yu, Xiaohan
    Gao, Yongsheng
    Shen, Chunhua
    PATTERN RECOGNITION, 2022, 121
  • [26] Person retrieval based on viewpoint saliency prior
    Hu, R. (hurm1964@gmail.com), 1600, Binary Information Press, P.O. Box 162, Bethel, CT 06801-0162, United States (09): : 8235 - 8242
  • [27] ASPD-Net: Self-aligned part mask for improving text-based person re-identification with adversarial representation learning
    Wang, Zijie
    Xue, Jingyi
    Wan, Xili
    Zhu, Aichun
    Li, Yifeng
    Zhu, Xiaomei
    Hu, Fangqiang
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2022, 116
  • [28] Image-Centered Pseudo Label Generation for Weakly Supervised Text-Based Person Re-Identification
    Nie, Weizhi
    Wu, Chengji
    Sun, Hao
    Xie, Wei
    PATTERN RECOGNITION AND COMPUTER VISION, PRCV 2024, PT XII, 2025, 15042 : 477 - 491
  • [29] Adaptive and Robust Partition Learning for Person Retrieval With Policy Gradient
    Shi, Yuxuan
    Wei, Zhen
    Ling, Hefei
    Wang, Ziyang
    Zhu, Pengfei
    Shen, Jialie
    Li, Ping
    IEEE TRANSACTIONS ON MULTIMEDIA, 2021, 23 : 3264 - 3277
  • [30] Enhancing Text-Image Person Retrieval Through Nuances Varied Sample
    Xia, Jiaer
    Yang, Haozhe
    Zhang, Yan
    Dai, Pingyang
    PATTERN RECOGNITION AND COMPUTER VISION, PRCV 2023, PT I, 2024, 14425 : 185 - 196