Graph-based Consistent Reconstruction and Alignment for imbalanced text-image person re-identification

被引:0
|
作者
Du, Guodong [1 ]
Gong, Tiantian [1 ]
Zhang, Liyan [1 ]
机构
[1] Nanjing Univ Aeronaut & Astronaut, Coll Comp Sci & Technol, Nanjing, Peoples R China
基金
中国国家自然科学基金;
关键词
Person re-identification; Image-text retrieval; Cross-modal alignment; Modality imbalance; Robustness;
D O I
10.1016/j.eswa.2024.125429
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Text-image person re-identification (TIReID) has emerged as a versatile approach for retrieving target pedestrians using textual descriptions. However, current TIReID research has been overly idealistic and has overlooked the issues of data incompleteness and modal imbalance in real-world application scenarios. Therefore, in this paper, we propose imbalanced text-image person re-identification (ITIReID) to address these problems. In comparison to TIReID, ITIReID contains a larger proportion of unimodal data, which leads to modal imbalance. The setting of ITIReID is more aligned with real-world scenarios, and studying ITIReID can expand the application scalability of TIReID. We propose a Graph-based Consistent Reconstruction and Alignment framework (GCRA), for ITIReID, which achieves modal balance by completing missing modality features for training implementation. By treating the accessible modality features as graph nodes, GCRA firstly builds an adjacency graph where a new semantic distance that establishes semantic relevance between nodes by comprehensively measuring both intra-modality and inter-modality correlation, serves as the measurement of graph's edges. GCRA further reconstructs the missing nodes - thus re-establishing missing modality features - using existing nodes connected with high semantic relevance. To ensure the reliability and effectiveness of reconstructed features, we propose a proxy-based identity constraint and a reconstruction constraint. In addition, to enable effective semantic alignment using both the reconstructed features and original features, we introduce a cross-modal semantic constraint. Extensive experiments demonstrate that GCRA can effectively handle issues of data incompleteness and modal imbalance, exhibiting its effectiveness and superiority.
引用
收藏
页数:14
相关论文
共 50 条
  • [31] Multi-view Based Pose Alignment Method for Person Re-identification
    Zhang, Yulei
    Zhao, Qingjie
    Li, You
    PROCEEDINGS OF 2019 CHINESE INTELLIGENT AUTOMATION CONFERENCE, 2020, 586 : 439 - 447
  • [32] Learning consistent region features for lifelong person re-identification
    Huang, Jinze
    Yu, Xiaohan
    An, Dong
    Wei, Yaoguang
    Bai, Xiao
    Zheng, Jin
    Wang, Chen
    Zhou, Jun
    PATTERN RECOGNITION, 2023, 144
  • [33] Adaptive image segmentation based on color clustering for person re-identification
    Lixia Zhang
    Kangshun Li
    Yan Zhang
    Yu Qi
    Lei Yang
    Soft Computing, 2017, 21 : 5729 - 5739
  • [34] MINING FALSE POSITIVE EXAMPLES FOR TEXT-BASED PERSON RE-IDENTIFICATION
    Xu, Wenhao
    Shao, Zhiyin
    Ding, Changxing
    2023 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, ICIP, 2023, : 1680 - 1684
  • [35] Explainable graph-attention based person re-identification in outdoor conditions
    Behera, Nayan Kumar Subhashis
    Sa, Pankaj Kumar
    Bakshi, Sambit
    Bilotti, Umberto
    MULTIMEDIA TOOLS AND APPLICATIONS, 2023,
  • [36] Adaptive image segmentation based on color clustering for person re-identification
    Zhang, Lixia
    Li, Kangshun
    Zhang, Yan
    Qi, Yu
    Yang, Lei
    SOFT COMPUTING, 2017, 21 (19) : 5729 - 5739
  • [37] Robust person re-identification via graph convolution networks
    Guisik Kim
    Dong Wook Shu
    Junseok Kwon
    Multimedia Tools and Applications, 2021, 80 : 29129 - 29138
  • [38] Robust person re-identification via graph convolution networks
    Kim, Guisik
    Shu, Dong Wook
    Kwon, Junseok
    MULTIMEDIA TOOLS AND APPLICATIONS, 2021, 80 (19) : 29129 - 29138
  • [39] Spatial Preserved Graph Convolution Networks for Person Re-identification
    Li, Zhaoju
    Zhou, Zongwei
    Jiang, Nan
    Han, Zhenjun
    Xing, Junliang
    Jiao, Jianbin
    ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS, 2020, 16 (01)
  • [40] A Weighted Center Graph Fusion Method for Person Re-Identification
    Geng, Shuze
    Yu, Ming
    Guo, Yingchun
    Yu, Yang
    IEEE ACCESS, 2019, 7 : 23329 - 23342