Joint discriminative representation learning for end-to-end person search

被引:23
作者
Zhang, Pengcheng [1 ]
Yu, Xiaohan [2 ,3 ]
Bai, Xiao [1 ]
Wang, Chen [1 ]
Zheng, Jin [1 ]
Ning, Xin [4 ]
机构
[1] Beihang Univ, Jiangxi Res Inst, Sch Comp Sci & Engn, State Key Lab Software Dev Environm, Beijing, Peoples R China
[2] Macquarie Univ, Sch Comp, Sydney, Australia
[3] Griffith Univ, Inst Integrated & Intelligent Syst, Brisbane, Australia
[4] Chinese Acad Sci, Inst Semicond, Beijing, Peoples R China
基金
美国国家科学基金会;
关键词
Person search; Person re-identification; Part segmentation; Batch sampling; NETWORK;
D O I
10.1016/j.patcog.2023.110053
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Person search simultaneously detects and retrieves a query person from uncropped scene images. Existing methods are either two-step or end-to-end. The former employs two standalone models for the two sub-tasks, while the latter conducts person search with a unified model. Despite encouraging progress, most existing end-to-end methods focus on balancing the model between detection and retrieval sub-tasks, while ignoring to enhance the learned representation for retrieval, which leads to inferior accuracy to two-step approaches. To that end, we propose a novel hierarchical framework that jointly optimizes instance-aware and part -aware embedding to enable discriminative representation learning. Specifically, we develop a region-of-interest cosegment (ROICoseg) module that captures part-aware information without requiring extra annotations to enable fine-grained discriminative representation. On top of that, a Contextual Instance Batch Sampling (CIBS) method is introduced to effectively employ contextual information for constructing training batches, thus facilitating effective instance-aware representation learning. We further introduce the first cross-door person search dataset (CDPS) that retrieves a target person in outdoor cameras with an indoor captured image or vice versa. Extensive experiments show that our proposed model achieves competitive performance on CUHK-SYSU and outperforms state-of-the-art end-to-end methods on the more challenging PRW and CDPS.1
引用
收藏
页数:11
相关论文
共 50 条
[41]   End-to-End Multi-Task Learning for Lung Nodule Segmentation and Diagnosis [J].
Chen, Wei ;
Wang, Qiuli ;
Yang, Dan ;
Zhang, Xiaohong ;
Liu, Chen ;
Li, Yucong .
2020 25TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2021, :6710-6717
[42]   Unlocking efficiency: End-to-end optimization learning for recurrent facility operational planning [J].
Lin, Yun Hui ;
Yin, Xiao Feng ;
Tian, Qingyun .
TRANSPORTATION RESEARCH PART E-LOGISTICS AND TRANSPORTATION REVIEW, 2024, 189
[43]   Joint Person Objectness and Repulsion for Person Search [J].
Yao, Hantao ;
Xu, Changsheng .
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2021, 30 :685-696
[44]   An Improved Deep-Layer Architecture for Real-Time End-to-End Person Recognition System [J].
Jayavarthini, C. ;
Malathy, C. .
COMPUTERS & ELECTRICAL ENGINEERING, 2021, 96
[45]   End-to-End Domain Adaptive Attention Network for Cross-Domain Person Re-Identification [J].
Khatun, Amena ;
Denman, Simon ;
Sridharan, Sridha ;
Fookes, Clinton .
IEEE TRANSACTIONS ON INFORMATION FORENSICS AND SECURITY, 2021, 16 :3803-3813
[46]   Discriminative representation learning for person re-identification via multi-loss training [J].
Zhong, Weilin ;
Zhang, Tao ;
Jiang, Linfeng ;
Ji, Jinsheng ;
Zhang, Zenghui ;
Xiong, Huilin .
JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2019, 62 :267-278
[47]   Joint Discriminative and Metric Embedding Learning for Person Re-identification [J].
Sabri, Sinan, I ;
Randhawa, Zaigham A. ;
Doretto, Gianfranco .
ADVANCES IN VISUAL COMPUTING, ISVC 2022, PT II, 2022, 13599 :165-178
[48]   Unknown Instance Learning for Person Search [J].
Yan, Lan ;
Li, Kenli .
2024 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, ICME 2024, 2024,
[49]   End-to-End Person Re-identification including Camera Zooming based on Meta-Analysis for Images [J].
Noguchi, Hirofumi ;
Isoda, Takuma ;
Arai, Seisuke .
2021 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN, AND CYBERNETICS (SMC), 2021, :3285-3290
[50]   ResNet-WGAN-Based End-to-End Learning for IoV Communication With Unknown Channels [J].
Zhao, Junhui ;
Mu, Huiqin ;
Zhang, Qingmiao ;
Zhang, Huan .
IEEE INTERNET OF THINGS JOURNAL, 2023, 10 (19) :17184-17192