A review on video person re-identification based on deep learning

被引：3

作者：

Ma, Haifei ^{[1
,3
]}

Zhang, Canlong ^{[1
,2
]}

Zhang, Yifeng ^{[1
]}

Li, Zhixin ^{[1
,2
]}

Wang, Zhiwen ^{[4
]}

Wei, Chunrong ^{[1
]}

机构：

[1] Guangxi Normal Univ, Key Lab Educ Blockchain & Intelligent Technol, Minist Educ, Guilin, Peoples R China

[2] Guangxi Normal Univ, Guangxi Key Lab Multisource Informat Min & Secur, Guilin, Peoples R China

[3] Guangdong Univ Sci & Technol, Dongguan, Peoples R China

[4] Guangxi Univ Sci & Technol, Sch Elect Engn, Liuzhou, Peoples R China

来源：

NEUROCOMPUTING | 2024年 / 609卷

基金：

美国国家科学基金会;

关键词：

Video-based person ReID; Temporal learning; Literature survey and perspectives; Attention mechanism; Convolutional neural network; UNSUPERVISED DOMAIN ADAPTATION; NEURAL-NETWORK;

D O I：

10.1016/j.neucom.2024.128479

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Person Re-Identification (ReID) is an essential technology for matching a person across non-overlapping cameras. It has attracted increasing attention in recent years due to its wide range of applications in various real-world scenarios such as security surveillance and criminal investigation. Different from other person ReID tasks, video-based ReID uses a video clip as the retrieval input, which can provide more promising ReID performance because that the video has rich information on appearance, motion cues and pose variations on temporal pipeline. Over the last few years, many deep learning-based video person ReID have been proposed to address various challenges, such as illumination variation, complex background, occlusion, etc. To provide a more comprehensive and readable review on existing video-based person ReID methods, we propose a novel taxonomy method that observes existing methods from four perspectives: data, algorithms, computing power, and applications. Specifically, we first introduce some popular datasets and evaluation criterion used for video-based person ReID. Next, from limited data and little annotation view, we introduce data augmentation and unsupervised learning ReID. From algorithm view, we focus on reviewing supervised methods including spatial feature learning, temporal feature learning and spatio-temporal feature learning, and further discuss and conduct a systematic comparison among these approaches. From complex open-world application view, we mainly summarized domain adaption and multimodal ReID. From insufficient GPU computing power view, we mainly discuss modality-agnostic unified large-scale ReID and their lightweighting. Finally, we provide a discussion of open problems and potential research directions for the community.

引用

页数：17

共 162 条

[1] Spatio-Temporal Representation Factorization for Video-based Person Re-Identification
Aich, Abhishek
Zheng, Meng
Karanam, Srikrishna
Chen, Terrence
Roy-Chowdhury, Amit K.
Wu, Ziyan
[J]. 2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 152 - 162
[2] Alsehaim A., 2022, VID-trans-reid: Enhanced video transformers for person re-identification
[3] Bahdanau D, 2016, Arxiv, DOI arXiv:1409.0473
[4] Salient-to-Broad Transition for Video Person Re-identification
Bai, Shutao
Ma, Bingpeng
Chang, Hong
Huang, Rui
Chen, Xilin
[J]. 2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2022, : 7329 - 7338
[5] Hierarchical Connectivity-Centered Clustering for Unsupervised Domain Adaptation on Person Re-Identification
Bai, Yan
Wang, Ce
Lou, Yihang
Liu, Jun
Duan, Ling-Yu
[J]. IEEE TRANSACTIONS ON IMAGE PROCESSING, 2021, 30 : 6715 - 6729
[6] The relation between the ROC curve and the CMC
Bolle, RM
Connell, JH
Pankanti, S
Ratha, NK
Senior, AW
[J]. FOURTH IEEE WORKSHOP ON AUTOMATIC IDENTIFICATION ADVANCED TECHNOLOGIES, PROCEEDINGS, 2005, : 15 - 20
[7] Video Person Re-Identification Using Attribute-Enhanced Features
Chai, Tianrui
Chen, Zhiyuan
Li, Annan
Chen, Jiaxin
Mei, Xinyu
Wang, Yunhong
[J]. IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2022, 32 (11) : 7951 - 7966
[8] Towards Modality-Agnostic Person Re-identification with Descriptive Query
Chen, Cuiqun
Ye, Mang
Jiang, Ding
[J]. 2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, : 15128 - 15137
[9] Saliency and Granularity: Discovering Temporal Coherence for Video-Based Person Re-Identification
Chen, Cuiqun
Ye, Mang
Qi, Meibin
Wu, Jingjing
Liu, Yimin
Jiang, Jianguo
[J]. IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2022, 32 (09) : 6100 - 6112
[10] Structure-Aware Positional Transformer for Visible-Infrared Person Re-Identification
Chen, Cuiqun
Ye, Mang
Qi, Meibin
Wu, Jingjing
Jiang, Jianguo
Lin, Chia-Wen
[J]. IEEE TRANSACTIONS ON IMAGE PROCESSING, 2022, 31 : 2352 - 2364

← 1 2 3 4 5 6 7 8 9 10 →