Multi deep invariant feature learning for cross-resolution person re-identification

被引：0

作者：

Zhang, Weicheng ^{[1
,2
]}

Xiong, Shuhua ^{[1
]}

He, Xiaohai ^{[1
]}

Wu, Xiaohong ^{[1
]}

He, Jie ^{[2
]}

Chen, Honggang ^{[1
,3
]}

机构：

[1] Sichuan Univ, Coll Elect & Informat Engn, Chengdu 610065, Peoples R China

[2] Wuzhou Univ, Guangxi Key Lab Machine Vis & Intelligent Control, Wuzhou 543002, Peoples R China

[3] Yunan Univ, Yunnan Key Lab Software Engn, Kunming 650600, Peoples R China

来源：

INFORMATION PROCESSING & MANAGEMENT | 2024年 / 61卷 / 04期

基金：

中国国家自然科学基金;

关键词：

Cross-resolution person re-identification; Invariant feature learning; Feature reconstruction; Dual-stream input; Deep learning;

D O I：

10.1016/j.ipm.2024.103764

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Person re -identification (Re -ID) is focused on identifying and matching the same pedestrian when captured across various surveillance cameras. However, variations in camera performance and the distances between pedestrians and cameras may result in capturing images of the same person at different resolutions. This specific issue, referred to as cross -resolution person re -identification (CRReID), presents considerable difficulties in achieving accurate person ReID. To address this issue, we propose a Multi -level and scale Deep Invariant Feature learning Framework (MDIFF), which effectively tackles the problem of cross -resolution person matching. Our MDIFF reconstructs person features at the shallow layers to alleviate information gaps and extracts resolution -invariant features at the deep layers for cross -resolution person matching. First, to mitigate information loss in resolution -invariant features, we propose a Dual Input Feature Reconstruction (DIFR) structure incorporating dual -stream input and a lightweight decoder, constrained by degradation loss and image reconstruction loss. Second, we propose a Multi -level Global-Local feature Interaction and Fusion (MGLIF) module to enhance the invariant features of persons and obtain deep invariant representations, making the final representation more robust to resolution changes and more discriminative. Finally, to make the feature distribution of the same identity more compact across different resolutions, we propose a cross -resolution joint loss optimization strategy, including cross -resolution triplet loss, cross -resolution center loss, and identity loss. Our comprehensive experimental results demonstrate the superior performance and efficacy of our MDIFF, outperforming current stateof-the-art methods across various CRReID benchmark datasets. Our code is available at https: //github.com/MiSanl/MDIFF-for-CRReID.

引用

页数：18

共 62 条

[1] A Two-Stage Convolutional Neural Network for Joint Demosaicking and Super-Resolution [J].

Chang, Kan ;

Li, Hengxin ;

Tan, Yufei ;

Ding, Pak Lun Kevin ;

Li, Baoxin .

IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2022, 32 (07) :4238-4254

[2] ABD-Net: Attentive but Diverse Person Re-Identification [J].

Chen, Tianlong ;

Ding, Shaojin ;

Xie, Jingyi ;

Yuan, Ye ;

Chen, Wuyang ;

Yang, Yang ;

Ren, Zhou ;

Wang, Zhangyang .

2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, :8350-8360

[3] Beyond Appearance: a Semantic Controllable Self-Supervised Learning Framework for Human-Centric Visual Tasks [J].

Chen, Weihua ;

Xu, Xianzhe ;

Jia, Jian ;

Luo, Hao ;

Wang, Yaohua ;

Wang, Fan ;

Jin, Rong ;

Sun, Xiuyu .

2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, :15050-15061

[4]

Chen YC, 2019, AAAI CONF ARTIF INTE, P8215

[5] Dual Aggregation Transformer for Image Super-Resolution [J].

Chen, Zheng ;

Zhang, Yulun ;

Gu, Jinjin ;

Kong, Linghe ;

Yang, Xiaokang ;

Yu, Fisher .

2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2023), 2023, :12278-12287

[6] Custom Pictorial Structures for Re-identification [J].

Cheng, Dong Seon ;

Cristani, Marco ;

Stoppa, Michele ;

Bazzani, Loris ;

Murino, Vittorio .

PROCEEDINGS OF THE BRITISH MACHINE VISION CONFERENCE 2011, 2011,

[7] Inter-Task Association Critic for Cross-Resolution Person Re-Identification [J].

Cheng, Zhiyi ;

Dong, Qi ;

Gong, Shaogang ;

Zhu, Xiatian .

2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2020, :2602-2612

[8] Second-order Attention Network for Single Image Super-Resolution [J].

Dai, Tao ;

Cai, Jianrui ;

Zhang, Yongbing ;

Xia, Shu-Tao ;

Zhang, Lei .

2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, :11057-11066

[9] Learning a Deep Convolutional Network for Image Super-Resolution [J].

Dong, Chao ;

Loy, Chen Change ;

He, Kaiming ;

Tang, Xiaoou .

COMPUTER VISION - ECCV 2014, PT IV, 2014, 8692 :184-199

[10] Efficient Perturbation Inference and Expandable Network for continual learning [J].

Du, Fei ;

Yang, Yun ;

Zhao, Ziyuan ;

Zeng, Zeng .

NEURAL NETWORKS, 2023, 159 :97-106

← 1 2 3 4 5 6 7 →