Bottom-up color-independent alignment learning for text-image person re-identification

被引:0
|
作者
Du, Guodong [1 ]
Zhu, Hanyue [1 ]
Zhang, Liyan [1 ]
机构
[1] Nanjing Univ Aeronaut & Astronaut, Coll Comp Sci & Technol, Nanjing 211106, Peoples R China
关键词
Text-based person retrieval; Person re-identification; Cross-modal retrieval; Color information;
D O I
10.1016/j.engappai.2024.109421
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Text-to-image person re-identification (TIReID) refers to identifying images of a person of interest from a large-scale person image database based on natural language descriptions. Most of existing methods generally rely heavily on color information when matching cross-modal data, which is a kind of overfitting and can be termed as the color over-reliance problem. This problem would distract the model from other tiny but discriminative clues (e.g. clothes details, structural information, etc.), which are essential for both semantic alignment and fine-grained matching, and thus leads to a sub-optimal retrieval performance. To this end, in this paper, we propose a novel Bottom-up Color-independent Alignment Learning Framework (BCALF) for text- based person retrieval to tackle this problem in two folds, decoupling color-independent discrete local features and aggregating multiple key discrete features. We employ color-confused images as an auxiliary modality and perform discrete fine-grained semantic alignment where the minimal semantic units interact within the joint feature space to focus solely on content information. Furthermore, the multiple discrete local features are aggregated into more discriminative non-local decisive features. BCALF achieves semantic alignment from minimal semantic units to non-local aggregation units, which can be understood as a bottom-up process. Experimental results demonstrate that BCALF consistently outperforms previous methods and achieves the state-of-the-art performance on the CUHK-PEDES, ICFG-PEDES and RSTPReid datasets.
引用
收藏
页数:12
相关论文
共 36 条
  • [1] Cross-modal feature learning and alignment network for text-image person re-identification
    Huang, Bailiang
    Qi, Xiaolong
    Chen, Bin
    JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2024, 103
  • [2] Graph-based Consistent Reconstruction and Alignment for imbalanced text-image person re-identification
    Du, Guodong
    Gong, Tiantian
    Zhang, Liyan
    EXPERT SYSTEMS WITH APPLICATIONS, 2025, 260
  • [3] Multimodal Feature Hierarchical Fusion for Text-Image Person Re-identification
    Li, Jiaxuan
    Huang, Likun
    Zhu, Chuanhu
    Zhang, Song
    Li, Qiang
    PATTERN RECOGNITION AND COMPUTER VISION, PT V, PRCV 2024, 2025, 15035 : 468 - 481
  • [4] Learning Granularity-Unified Representations for Text-to-Image Person Re-identification
    Shao, Zhiyin
    Zhang, Xinyu
    Fang, Meng
    Lin, Zhifeng
    Wang, Jian
    Ding, Changxing
    PROCEEDINGS OF THE 30TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2022, 2022, : 5566 - 5574
  • [5] Recurrent matching networks of spatial alignment learning for person re-identification
    Lin, Lan
    Zhang, Dan
    Zheng, Xin
    Ye, Mao
    Guo, Jiuxia
    MULTIMEDIA TOOLS AND APPLICATIONS, 2020, 79 (45-46) : 33735 - 33755
  • [6] Recurrent matching networks of spatial alignment learning for person re-identification
    Lan Lin
    Dan Zhang
    Xin Zheng
    Mao Ye
    Jiuxia Guo
    Multimedia Tools and Applications, 2020, 79 : 33735 - 33755
  • [7] Text-to-Image Person Re-Identification Based on Multimodal Graph Convolutional Network
    Han, Guang
    Lin, Min
    Li, Ziyang
    Zhao, Haitao
    Kwong, Sam
    IEEE TRANSACTIONS ON MULTIMEDIA, 2024, 26 : 6025 - 6036
  • [8] Adaptive image segmentation based on color clustering for person re-identification
    Lixia Zhang
    Kangshun Li
    Yan Zhang
    Yu Qi
    Lei Yang
    Soft Computing, 2017, 21 : 5729 - 5739
  • [9] Adaptive image segmentation based on color clustering for person re-identification
    Zhang, Lixia
    Li, Kangshun
    Zhang, Yan
    Qi, Yu
    Yang, Lei
    SOFT COMPUTING, 2017, 21 (19) : 5729 - 5739
  • [10] Unsupervised Person Re-Identification via Differentiated Color Perception Learning
    Chen, Feng
    Liu, Heng
    Tang, Jun
    Zhang, Yulin
    IEEE TRANSACTIONS ON CONSUMER ELECTRONICS, 2024, 70 (03) : 6011 - 6022