Joint discriminative representation learning for end-to-end person search

被引:23
作者
Zhang, Pengcheng [1 ]
Yu, Xiaohan [2 ,3 ]
Bai, Xiao [1 ]
Wang, Chen [1 ]
Zheng, Jin [1 ]
Ning, Xin [4 ]
机构
[1] Beihang Univ, Jiangxi Res Inst, Sch Comp Sci & Engn, State Key Lab Software Dev Environm, Beijing, Peoples R China
[2] Macquarie Univ, Sch Comp, Sydney, Australia
[3] Griffith Univ, Inst Integrated & Intelligent Syst, Brisbane, Australia
[4] Chinese Acad Sci, Inst Semicond, Beijing, Peoples R China
基金
美国国家科学基金会;
关键词
Person search; Person re-identification; Part segmentation; Batch sampling; NETWORK;
D O I
10.1016/j.patcog.2023.110053
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Person search simultaneously detects and retrieves a query person from uncropped scene images. Existing methods are either two-step or end-to-end. The former employs two standalone models for the two sub-tasks, while the latter conducts person search with a unified model. Despite encouraging progress, most existing end-to-end methods focus on balancing the model between detection and retrieval sub-tasks, while ignoring to enhance the learned representation for retrieval, which leads to inferior accuracy to two-step approaches. To that end, we propose a novel hierarchical framework that jointly optimizes instance-aware and part -aware embedding to enable discriminative representation learning. Specifically, we develop a region-of-interest cosegment (ROICoseg) module that captures part-aware information without requiring extra annotations to enable fine-grained discriminative representation. On top of that, a Contextual Instance Batch Sampling (CIBS) method is introduced to effectively employ contextual information for constructing training batches, thus facilitating effective instance-aware representation learning. We further introduce the first cross-door person search dataset (CDPS) that retrieves a target person in outdoor cameras with an indoor captured image or vice versa. Extensive experiments show that our proposed model achieves competitive performance on CUHK-SYSU and outperforms state-of-the-art end-to-end methods on the more challenging PRW and CDPS.1
引用
收藏
页数:11
相关论文
共 50 条
[31]   An end-to-end deep learning model for robust smooth filtering identification [J].
Zhang, Yujin ;
Yu, Luo ;
Fang, Zhijun ;
Xiong, Neal N. ;
Zhang, Lijun ;
Tian, Haiyue .
FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2022, 127 :263-275
[32]   End-to-End Streaming Video Temporal Action Segmentation With Reinforcement Learning [J].
Zhang, Jin-Rong ;
Wen, Wu-Jun ;
Liu, Sheng-Lan ;
Huang, Gao ;
Li, Yun-Heng ;
Li, Qi-Feng ;
Feng, Lin .
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2025,
[33]   A half-precision compressive sensing framework for end-to-end person re-identification [J].
Longlong Liao ;
Zhibang Yang ;
Qing Liao ;
Kenli Li ;
Keqin Li ;
Jie Liu ;
Qi Tian .
Neural Computing and Applications, 2020, 32 :1141-1155
[34]   A half-precision compressive sensing framework for end-to-end person re-identification [J].
Liao, Longlong ;
Yang, Zhibang ;
Liao, Qing ;
Li, Kenli ;
Li, Keqin ;
Liu, Jie ;
Tian, Qi .
NEURAL COMPUTING & APPLICATIONS, 2020, 32 (04) :1141-1155
[35]   Video-Based Person Re-Identification by an End-To-End Learning Architecture with Hybrid Deep Appearance-Temporal Feature [J].
Sun, Rui ;
Huang, Qiheng ;
Xia, Miaomiao ;
Zhang, Jun .
SENSORS, 2018, 18 (11)
[36]   Learning adaptive shift and task decoupling for discriminative one-step person search [J].
Zhang, Qixian ;
Miao, Duoqian ;
Zhang, Qi ;
Wang, Changwei ;
Li, Yanping ;
Zhang, Hongyun ;
Zhao, Cairong .
KNOWLEDGE-BASED SYSTEMS, 2024, 304
[37]   Enhancing scene understanding based on deep learning for end-to-end autonomous driving [J].
Hu, Jie ;
Kong, Huifang ;
Zhang, Qian ;
Liu, Runwu .
ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2022, 116
[38]   Recognizing Multiple Text Sequences from an Image by Pure End-to-End Learning [J].
Xu, Zhenlong ;
Zhou, Shuigeng ;
Bai, Fan ;
Cheng, Zhanzhan ;
Niu, Yi ;
Pu, Shiliang .
2020 25TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2021, :7058-7065
[39]   Unified medical image segmentation by learning from uncertainty in an end-to-end manner [J].
Tang, Pin ;
Yang, Pinli ;
Nie, Dong ;
Wu, Xi ;
Zhou, Jiliu ;
Wang, Yan .
KNOWLEDGE-BASED SYSTEMS, 2022, 241
[40]   An end-to-end active learning framework for limited labelled hyperspectral image classification [J].
Karaca, Ali Can ;
Bilgin, Gokhan .
INTERNATIONAL JOURNAL OF REMOTE SENSING, 2025, 46 (08) :3179-3206