Joint discriminative representation learning for end-to-end person search

被引:23
作者
Zhang, Pengcheng [1 ]
Yu, Xiaohan [2 ,3 ]
Bai, Xiao [1 ]
Wang, Chen [1 ]
Zheng, Jin [1 ]
Ning, Xin [4 ]
机构
[1] Beihang Univ, Jiangxi Res Inst, Sch Comp Sci & Engn, State Key Lab Software Dev Environm, Beijing, Peoples R China
[2] Macquarie Univ, Sch Comp, Sydney, Australia
[3] Griffith Univ, Inst Integrated & Intelligent Syst, Brisbane, Australia
[4] Chinese Acad Sci, Inst Semicond, Beijing, Peoples R China
基金
美国国家科学基金会;
关键词
Person search; Person re-identification; Part segmentation; Batch sampling; NETWORK;
D O I
10.1016/j.patcog.2023.110053
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Person search simultaneously detects and retrieves a query person from uncropped scene images. Existing methods are either two-step or end-to-end. The former employs two standalone models for the two sub-tasks, while the latter conducts person search with a unified model. Despite encouraging progress, most existing end-to-end methods focus on balancing the model between detection and retrieval sub-tasks, while ignoring to enhance the learned representation for retrieval, which leads to inferior accuracy to two-step approaches. To that end, we propose a novel hierarchical framework that jointly optimizes instance-aware and part -aware embedding to enable discriminative representation learning. Specifically, we develop a region-of-interest cosegment (ROICoseg) module that captures part-aware information without requiring extra annotations to enable fine-grained discriminative representation. On top of that, a Contextual Instance Batch Sampling (CIBS) method is introduced to effectively employ contextual information for constructing training batches, thus facilitating effective instance-aware representation learning. We further introduce the first cross-door person search dataset (CDPS) that retrieves a target person in outdoor cameras with an indoor captured image or vice versa. Extensive experiments show that our proposed model achieves competitive performance on CUHK-SYSU and outperforms state-of-the-art end-to-end methods on the more challenging PRW and CDPS.1
引用
收藏
页数:11
相关论文
共 50 条
[21]   Sparse Bayesian Learning for End-to-End EEG Decoding [J].
Wang, Wenlong ;
Qi, Feifei ;
Wipf, David Paul ;
Cai, Chang ;
Yu, Tianyou ;
Li, Yuanqing ;
Zhang, Yu ;
Yu, Zhuliang ;
Wu, Wei .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 45 (12) :15632-15649
[22]   End-to-end person re-identification: Real-time video surveillance over edge-cloud environment [J].
Gaikwad, Bipin ;
Karmakar, Abhijit .
COMPUTERS & ELECTRICAL ENGINEERING, 2022, 99
[23]   Detecting web attacks with end-to-end deep learning [J].
Pan, Yao ;
Sun, Fangzhou ;
Teng, Zhongwei ;
White, Jules ;
Schmidt, Douglas C. ;
Staples, Jacob ;
Krause, Lee .
JOURNAL OF INTERNET SERVICES AND APPLICATIONS, 2019, 10 (01)
[24]   An End-to-End Foreground-Aware Network for Person Re-Identification [J].
Liu, Yiheng ;
Zhou, Wengang ;
Liu, Jianzhuang ;
Qi, Guo-Jun ;
Tian, Qi ;
Li, Houqiang .
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2021, 30 :2060-2071
[25]   Weakly supervised end-to-end domain adaptation for person re-identification [J].
Zhang, Lei ;
Li, Haisheng ;
Liu, Ruijun ;
Wang, Xiaochuan ;
Wu, Xiaoqun .
COMPUTERS & ELECTRICAL ENGINEERING, 2024, 113
[26]   Deep Regression Neural Network for End-to-End Person Re-Identification [J].
Guo, Yingchun ;
Zhao, Kunpeng ;
Hao, Xiaoke ;
Yu, Ming .
IEEE ACCESS, 2019, 7 :92825-92837
[27]   Person Search Based on Improved Joint Learning Network [J].
Zhang, Huimei ;
Chen, Changhong ;
Gan, Zongliang .
PROCEEDINGS OF THE THIRD INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND APPLICATION ENGINEERING (CSAE2019), 2019,
[28]   End-to-End Video Surveillance Framework for Anomaly Detection and Person Re-identification [J].
Nandan, Rohan ;
Lingeri, Rohan ;
Mehta, Rohan ;
Kanwal, Preet ;
Atluri, Rishita .
DEEP LEARNING THEORY AND APPLICATIONS, PT I, DELTA 2024, 2024, 2171 :328-339
[29]   LEARNING DISCRIMINATIVE PART FEATURES THROUGH ATTENTIONS FOR EFFECTIVE AND SCALABLE PERSON SEARCH [J].
Park, Jicheol ;
Jeong, Boseung ;
Shin, Jongju ;
Lee, Juyoung ;
Kwak, Suha .
2020 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2020, :2376-2380
[30]   Prototype-Guided Attention Distillation for Discriminative Person Search [J].
Kim, Hanjae ;
Lee, Jiyoung ;
Sohn, Kwanghoon .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2025, 47 (01) :99-115