Bottom-up color-independent alignment learning for text-image person re-identification

被引：0

作者：

Du, Guodong ^{[1
]}

Zhu, Hanyue ^{[1
]}

Zhang, Liyan ^{[1
]}

机构：

[1] Nanjing Univ Aeronaut & Astronaut, Coll Comp Sci & Technol, Nanjing 211106, Peoples R China

来源：

ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE | 2024年 / 138卷

关键词：

Text-based person retrieval; Person re-identification; Cross-modal retrieval; Color information;

D O I：

10.1016/j.engappai.2024.109421

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Text-to-image person re-identification (TIReID) refers to identifying images of a person of interest from a large-scale person image database based on natural language descriptions. Most of existing methods generally rely heavily on color information when matching cross-modal data, which is a kind of overfitting and can be termed as the color over-reliance problem. This problem would distract the model from other tiny but discriminative clues (e.g. clothes details, structural information, etc.), which are essential for both semantic alignment and fine-grained matching, and thus leads to a sub-optimal retrieval performance. To this end, in this paper, we propose a novel Bottom-up Color-independent Alignment Learning Framework (BCALF) for text- based person retrieval to tackle this problem in two folds, decoupling color-independent discrete local features and aggregating multiple key discrete features. We employ color-confused images as an auxiliary modality and perform discrete fine-grained semantic alignment where the minimal semantic units interact within the joint feature space to focus solely on content information. Furthermore, the multiple discrete local features are aggregated into more discriminative non-local decisive features. BCALF achieves semantic alignment from minimal semantic units to non-local aggregation units, which can be understood as a bottom-up process. Experimental results demonstrate that BCALF consistently outperforms previous methods and achieves the state-of-the-art performance on the CUHK-PEDES, ICFG-PEDES and RSTPReid datasets.

引用

页数：12

共 36 条

[1] Cross-modal feature learning and alignment network for text-image person re-identification
Huang, Bailiang
Qi, Xiaolong
Chen, Bin
JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2024, 103
[2] Graph-based Consistent Reconstruction and Alignment for imbalanced text-image person re-identification
Du, Guodong
Gong, Tiantian
Zhang, Liyan
EXPERT SYSTEMS WITH APPLICATIONS, 2025, 260
[3] Multimodal Feature Hierarchical Fusion for Text-Image Person Re-identification
Li, Jiaxuan
Huang, Likun
Zhu, Chuanhu
Zhang, Song
Li, Qiang
PATTERN RECOGNITION AND COMPUTER VISION, PT V, PRCV 2024, 2025, 15035 : 468 - 481
[4] Learning Granularity-Unified Representations for Text-to-Image Person Re-identification
Shao, Zhiyin
Zhang, Xinyu
Fang, Meng
Lin, Zhifeng
Wang, Jian
Ding, Changxing
PROCEEDINGS OF THE 30TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2022, 2022, : 5566 - 5574
[5] Recurrent matching networks of spatial alignment learning for person re-identification
Lin, Lan
Zhang, Dan
Zheng, Xin
Ye, Mao
Guo, Jiuxia
MULTIMEDIA TOOLS AND APPLICATIONS, 2020, 79 (45-46) : 33735 - 33755
[6] Recurrent matching networks of spatial alignment learning for person re-identification
Lan Lin
Dan Zhang
Xin Zheng
Mao Ye
Jiuxia Guo
Multimedia Tools and Applications, 2020, 79 : 33735 - 33755
[7] Text-to-Image Person Re-Identification Based on Multimodal Graph Convolutional Network
Han, Guang
Lin, Min
Li, Ziyang
Zhao, Haitao
Kwong, Sam
IEEE TRANSACTIONS ON MULTIMEDIA, 2024, 26 : 6025 - 6036
[8] Adaptive image segmentation based on color clustering for person re-identification
Lixia Zhang
Kangshun Li
Yan Zhang
Yu Qi
Lei Yang
Soft Computing, 2017, 21 : 5729 - 5739
[9] Adaptive image segmentation based on color clustering for person re-identification
Zhang, Lixia
Li, Kangshun
Zhang, Yan
Qi, Yu
Yang, Lei
SOFT COMPUTING, 2017, 21 (19) : 5729 - 5739
[10] Unsupervised Person Re-Identification via Differentiated Color Perception Learning
Chen, Feng
Liu, Heng
Tang, Jun
Zhang, Yulin
IEEE TRANSACTIONS ON CONSUMER ELECTRONICS, 2024, 70 (03) : 6011 - 6022

← 1 2 3 4 →