POSNet: a hybrid deep learning model for efficient person re-identification

被引:3
作者
Batool, Eliza [1 ]
Gillani, Saira [2 ]
Naz, Sheneela [3 ]
Bukhari, Maryam [1 ]
Maqsood, Muazzam [1 ]
Yeo, Sang-Soo [4 ]
Rho, Seungmin [5 ]
机构
[1] COMSATS Univ Islamabad, Dept Comp Sci, Attock Campus, Attock, Pakistan
[2] Univ Cent Punjab, Fac Informat Technol & Comp Sci, Lahore, Pakistan
[3] COMSATS Univ Islamabad, Dept Comp Sci, Islamabad, Pakistan
[4] Mokwon Univ, Dept Comp Engn, Daejeon 35349, South Korea
[5] Chung Ang Univ, Dept Ind Secur, Seoul 06974, South Korea
基金
新加坡国家研究基金会;
关键词
Person re-identification; POSNet; Hybrid deep learning; Spatio-temporal feature learning; Limited labeled data challenges; Intra- and inter-class variations; Soft-pool-assisted attentions; LSTM; RECOGNITION; NETWORK;
D O I
10.1007/s11227-023-05169-4
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Person re-identification refers to the process of recognizing a person across several non-overlapping cameras. It is becoming increasingly important in computer vision for real-world surveillance applications. However, the deployment of person-re-identification systems as a surveillance system raises various challenges in their performance. These challenges include limited labeled data, occlusions conditions, human body postures, as well as inter- and intra-class variations. Such challenges deteriorate the effectiveness of person-re-identification systems and lead to the extraction of less discriminative features. Hence, to address these problems, we proposed a hybrid deep learning model, namely POSNet (pseudo-labeled omni-scale network) for efficient person re-identification. The proposed method is referred to as a hybrid because it combines label estimate with modified omni-scale feature learning, i.e., spatiotemporal-assisted omni-scale feature extraction to accomplish person re-identification. To further enhance omni-scale feature learning, we have proposed soft-pool-assisted attention mechanisms during spatial learning. More precisely, soft-pool preserves more important features, and that features are further emphasized by spatial and channel attention layers. Following on, this omni-scale with soft-pool attention learning extracts the spatial information from all frames of videos, and later on, the temporal learning is incorporated using the LSTM model. To handle limited labeled data problems, the proposed hybrid model first assigns pseudo-labels to the unlabeled data and adopts a progressive learning strategy to retrain the model on both labeled and unlabeled data with improved feature extraction, i.e., modified omni-scale feature learning. Moreover, the proposed POSNet model is validated on two large video-based person re-identification datasets, namely MARS and DukeMTMC-Video. It is observed from the research findings that the proposed POSNet outperformed the existing studies with the highest mAP and rank@1 score of 83.7 and 90.3%, respectively.
引用
收藏
页码:13090 / 13118
页数:29
相关论文
共 96 条
  • [71] Few-Shot Deep Adversarial Learning for Video-Based Person Re-Identification
    Wu, Lin
    Wang, Yang
    Yin, Hongzhi
    Wang, Meng
    Shao, Ling
    [J]. IEEE TRANSACTIONS ON IMAGE PROCESSING, 2020, 29 : 1233 - 1245
  • [72] Exploit the Unknown Gradually: One-Shot Video-Based Person Re-Identification by Stepwise Learning
    Wu, Yu
    Lin, Yutian
    Dong, Xuanyi
    Yan, Yan
    Ouyang, Wanli
    Yang, Yi
    [J]. 2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 5177 - 5186
  • [73] Learning from self-discrepancy via multiple co-teaching for cross-domain person re-identification
    Xiang, Suncheng
    Fu, Yuzhuo
    Guan, Mengyuan
    Liu, Ting
    [J]. MACHINE LEARNING, 2023, 112 (06) : 1923 - 1940
  • [74] Progressive learning with style transfer for distant domain adaptation
    Xiang, Suncheng
    Fu, Yuzhuo
    Liu, Ting
    [J]. IET IMAGE PROCESSING, 2020, 14 (14) : 3527 - 3535
  • [75] Hope: heatmap and offset for pose estimation
    Xiao, Jing
    Li, Haichao
    Qu, Guangzhuo
    Fujita, Hamido
    Cao, Yang
    Zhu, Jia
    Huang, Changqin
    [J]. JOURNAL OF AMBIENT INTELLIGENCE AND HUMANIZED COMPUTING, 2021, 13 (6) : 2937 - 2949
  • [76] XIE G, 2021, J HIGH ENERGY ASTROP, DOI DOI 10.1007/S00530-021-00752-2
  • [77] Semi-supervised person re-identification using multi-view clustering
    Xin, Xiaomeng
    Wang, Jinjun
    Xie, Ruji
    Zhou, Sanping
    Huang, Wenli
    Zheng, Nanning
    [J]. PATTERN RECOGNITION, 2019, 88 : 285 - 297
  • [78] Yadav A, 2020, ARXIV
  • [79] Small obstacles detection on roads scenes using semantic segmentation for the safe navigation of autonomous vehicles
    Yasmin, Sadaf
    Durrani, Mehr Yahya
    Gillani, Saira
    Bukhari, Maryam
    Maqsood, Muazzam
    Zghaibeh, Manaf
    [J]. JOURNAL OF ELECTRONIC IMAGING, 2022, 31 (06)
  • [80] Deep Learning for Person Re-Identification: A Survey and Outlook
    Ye, Mang
    Shen, Jianbing
    Lin, Gaojie
    Xiang, Tao
    Shao, Ling
    Hoi, Steven C. H.
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2022, 44 (06) : 2872 - 2893