Graph-based Consistent Reconstruction and Alignment for imbalanced text-image person re-identification

被引：0

作者：

Du, Guodong ^{[1
]}

Gong, Tiantian ^{[1
]}

Zhang, Liyan ^{[1
]}

机构：

[1] Nanjing Univ Aeronaut & Astronaut, Coll Comp Sci & Technol, Nanjing, Peoples R China

来源：

EXPERT SYSTEMS WITH APPLICATIONS | 2025年 / 260卷

基金：

中国国家自然科学基金;

关键词：

Person re-identification; Image-text retrieval; Cross-modal alignment; Modality imbalance; Robustness;

D O I：

10.1016/j.eswa.2024.125429

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Text-image person re-identification (TIReID) has emerged as a versatile approach for retrieving target pedestrians using textual descriptions. However, current TIReID research has been overly idealistic and has overlooked the issues of data incompleteness and modal imbalance in real-world application scenarios. Therefore, in this paper, we propose imbalanced text-image person re-identification (ITIReID) to address these problems. In comparison to TIReID, ITIReID contains a larger proportion of unimodal data, which leads to modal imbalance. The setting of ITIReID is more aligned with real-world scenarios, and studying ITIReID can expand the application scalability of TIReID. We propose a Graph-based Consistent Reconstruction and Alignment framework (GCRA), for ITIReID, which achieves modal balance by completing missing modality features for training implementation. By treating the accessible modality features as graph nodes, GCRA firstly builds an adjacency graph where a new semantic distance that establishes semantic relevance between nodes by comprehensively measuring both intra-modality and inter-modality correlation, serves as the measurement of graph's edges. GCRA further reconstructs the missing nodes - thus re-establishing missing modality features - using existing nodes connected with high semantic relevance. To ensure the reliability and effectiveness of reconstructed features, we propose a proxy-based identity constraint and a reconstruction constraint. In addition, to enable effective semantic alignment using both the reconstructed features and original features, we introduce a cross-modal semantic constraint. Extensive experiments demonstrate that GCRA can effectively handle issues of data incompleteness and modal imbalance, exhibiting its effectiveness and superiority.

引用

页数：14

共 50 条

[1] Cross-modal feature learning and alignment network for text-image person re-identification
Huang, Bailiang
Qi, Xiaolong
Chen, Bin
JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2024, 103
[2] Multimodal Feature Hierarchical Fusion for Text-Image Person Re-identification
Li, Jiaxuan
Huang, Likun
Zhu, Chuanhu
Zhang, Song
Li, Qiang
PATTERN RECOGNITION AND COMPUTER VISION, PT V, PRCV 2024, 2025, 15035 : 468 - 481
[3] Bottom-up color-independent alignment learning for text-image person re-identification
Du, Guodong
Zhu, Hanyue
Zhang, Liyan
ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2024, 138
[4] Text-to-Image Person Re-Identification Based on Multimodal Graph Convolutional Network
Han, Guang
Lin, Min
Li, Ziyang
Zhao, Haitao
Kwong, Sam
IEEE TRANSACTIONS ON MULTIMEDIA, 2024, 26 : 6025 - 6036
[5] Spatial enhanced multi-level alignment learning for text-image person re-identification with coupled noisy labels
Zhao, Jiacheng
Che, Haojie
Li, Yongxi
MULTIMEDIA SYSTEMS, 2025, 31 (02)
[6] A Graph-Based Approach for Making Consensus-Based Decisions in Image Search and Person Re-Identification
Barman, Arko
Shah, Shishir K.
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2021, 43 (03) : 753 - 765
[7] Graph-Based Local Feature Adaptation for Cross-Domain Person Re-Identification
Wang, Jun
IEEE ACCESS, 2022, 10 : 3017 - 3029
[8] Person Re-Identification Based on Graph Relation Learning
Wang, Hao
Bi, Xiaojun
NEURAL PROCESSING LETTERS, 2021, 53 (02) : 1401 - 1415
[9] Person Re-Identification Based on Graph Relation Learning
Hao Wang
Xiaojun Bi
Neural Processing Letters, 2021, 53 : 1401 - 1415
[10] Image-Text Person Re-Identification with Transformer-Based Modal Fusion
Li, Xin
Guo, Hubo
Zhang, Meiling
Fu, Bo
ELECTRONICS, 2025, 14 (03):

← 1 2 3 4 5 →