Joint discriminative representation learning for end-to-end person search

被引:17
作者
Zhang, Pengcheng [1 ]
Yu, Xiaohan [2 ,3 ]
Bai, Xiao [1 ]
Wang, Chen [1 ]
Zheng, Jin [1 ]
Ning, Xin [4 ]
机构
[1] Beihang Univ, Jiangxi Res Inst, Sch Comp Sci & Engn, State Key Lab Software Dev Environm, Beijing, Peoples R China
[2] Macquarie Univ, Sch Comp, Sydney, Australia
[3] Griffith Univ, Inst Integrated & Intelligent Syst, Brisbane, Australia
[4] Chinese Acad Sci, Inst Semicond, Beijing, Peoples R China
基金
美国国家科学基金会;
关键词
Person search; Person re-identification; Part segmentation; Batch sampling; NETWORK;
D O I
10.1016/j.patcog.2023.110053
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Person search simultaneously detects and retrieves a query person from uncropped scene images. Existing methods are either two-step or end-to-end. The former employs two standalone models for the two sub-tasks, while the latter conducts person search with a unified model. Despite encouraging progress, most existing end-to-end methods focus on balancing the model between detection and retrieval sub-tasks, while ignoring to enhance the learned representation for retrieval, which leads to inferior accuracy to two-step approaches. To that end, we propose a novel hierarchical framework that jointly optimizes instance-aware and part -aware embedding to enable discriminative representation learning. Specifically, we develop a region-of-interest cosegment (ROICoseg) module that captures part-aware information without requiring extra annotations to enable fine-grained discriminative representation. On top of that, a Contextual Instance Batch Sampling (CIBS) method is introduced to effectively employ contextual information for constructing training batches, thus facilitating effective instance-aware representation learning. We further introduce the first cross-door person search dataset (CDPS) that retrieves a target person in outdoor cameras with an indoor captured image or vice versa. Extensive experiments show that our proposed model achieves competitive performance on CUHK-SYSU and outperforms state-of-the-art end-to-end methods on the more challenging PRW and CDPS.1
引用
收藏
页数:11
相关论文
共 50 条
  • [21] Sparse Bayesian Learning for End-to-End EEG Decoding
    Wang, Wenlong
    Qi, Feifei
    Wipf, David Paul
    Cai, Chang
    Yu, Tianyou
    Li, Yuanqing
    Zhang, Yu
    Yu, Zhuliang
    Wu, Wei
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 45 (12) : 15632 - 15649
  • [22] Detecting web attacks with end-to-end deep learning
    Pan, Yao
    Sun, Fangzhou
    Teng, Zhongwei
    White, Jules
    Schmidt, Douglas C.
    Staples, Jacob
    Krause, Lee
    JOURNAL OF INTERNET SERVICES AND APPLICATIONS, 2019, 10 (01)
  • [23] End-to-end person re-identification: Real-time video surveillance over edge-cloud environment
    Gaikwad, Bipin
    Karmakar, Abhijit
    COMPUTERS & ELECTRICAL ENGINEERING, 2022, 99
  • [24] An End-to-End Foreground-Aware Network for Person Re-Identification
    Liu, Yiheng
    Zhou, Wengang
    Liu, Jianzhuang
    Qi, Guo-Jun
    Tian, Qi
    Li, Houqiang
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2021, 30 : 2060 - 2071
  • [25] Weakly supervised end-to-end domain adaptation for person re-identification
    Zhang, Lei
    Li, Haisheng
    Liu, Ruijun
    Wang, Xiaochuan
    Wu, Xiaoqun
    COMPUTERS & ELECTRICAL ENGINEERING, 2024, 113
  • [26] Deep Regression Neural Network for End-to-End Person Re-Identification
    Guo, Yingchun
    Zhao, Kunpeng
    Hao, Xiaoke
    Yu, Ming
    IEEE ACCESS, 2019, 7 : 92825 - 92837
  • [27] Person Search Based on Improved Joint Learning Network
    Zhang, Huimei
    Chen, Changhong
    Gan, Zongliang
    PROCEEDINGS OF THE THIRD INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND APPLICATION ENGINEERING (CSAE2019), 2019,
  • [28] End-to-End Video Surveillance Framework for Anomaly Detection and Person Re-identification
    Nandan, Rohan
    Lingeri, Rohan
    Mehta, Rohan
    Kanwal, Preet
    Atluri, Rishita
    DEEP LEARNING THEORY AND APPLICATIONS, PT I, DELTA 2024, 2024, 2171 : 328 - 339
  • [29] LEARNING DISCRIMINATIVE PART FEATURES THROUGH ATTENTIONS FOR EFFECTIVE AND SCALABLE PERSON SEARCH
    Park, Jicheol
    Jeong, Boseung
    Shin, Jongju
    Lee, Juyoung
    Kwak, Suha
    2020 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2020, : 2376 - 2380
  • [30] Prototype-Guided Attention Distillation for Discriminative Person Search
    Kim, Hanjae
    Lee, Jiyoung
    Sohn, Kwanghoon
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2025, 47 (01) : 99 - 115