Joint discriminative representation learning for end-to-end person search

被引：23

作者：

Zhang, Pengcheng ^{[1
]}

Yu, Xiaohan ^{[2
,3
]}

Bai, Xiao ^{[1
]}

Wang, Chen ^{[1
]}

Zheng, Jin ^{[1
]}

Ning, Xin ^{[4
]}

机构：

[1] Beihang Univ, Jiangxi Res Inst, Sch Comp Sci & Engn, State Key Lab Software Dev Environm, Beijing, Peoples R China

[2] Macquarie Univ, Sch Comp, Sydney, Australia

[3] Griffith Univ, Inst Integrated & Intelligent Syst, Brisbane, Australia

[4] Chinese Acad Sci, Inst Semicond, Beijing, Peoples R China

来源：

PATTERN RECOGNITION | 2024年 / 147卷

基金：

美国国家科学基金会;

关键词：

Person search; Person re-identification; Part segmentation; Batch sampling; NETWORK;

D O I：

10.1016/j.patcog.2023.110053

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Person search simultaneously detects and retrieves a query person from uncropped scene images. Existing methods are either two-step or end-to-end. The former employs two standalone models for the two sub-tasks, while the latter conducts person search with a unified model. Despite encouraging progress, most existing end-to-end methods focus on balancing the model between detection and retrieval sub-tasks, while ignoring to enhance the learned representation for retrieval, which leads to inferior accuracy to two-step approaches. To that end, we propose a novel hierarchical framework that jointly optimizes instance-aware and part -aware embedding to enable discriminative representation learning. Specifically, we develop a region-of-interest cosegment (ROICoseg) module that captures part-aware information without requiring extra annotations to enable fine-grained discriminative representation. On top of that, a Contextual Instance Batch Sampling (CIBS) method is introduced to effectively employ contextual information for constructing training batches, thus facilitating effective instance-aware representation learning. We further introduce the first cross-door person search dataset (CDPS) that retrieves a target person in outdoor cameras with an indoor captured image or vice versa. Extensive experiments show that our proposed model achieves competitive performance on CUHK-SYSU and outperforms state-of-the-art end-to-end methods on the more challenging PRW and CDPS.1

引用

页数：11

共 50 条

[31] An end-to-end deep learning model for robust smooth filtering identification [J].

Zhang, Yujin ;

Yu, Luo ;

Fang, Zhijun ;

Xiong, Neal N. ;

Zhang, Lijun ;

Tian, Haiyue .

FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2022, 127 :263-275

[32] End-to-End Streaming Video Temporal Action Segmentation With Reinforcement Learning [J].

Zhang, Jin-Rong ;

Wen, Wu-Jun ;

Liu, Sheng-Lan ;

Huang, Gao ;

Li, Yun-Heng ;

Li, Qi-Feng ;

Feng, Lin .

IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2025,

[33] A half-precision compressive sensing framework for end-to-end person re-identification [J].

Longlong Liao ;

Zhibang Yang ;

Qing Liao ;

Kenli Li ;

Keqin Li ;

Jie Liu ;

Qi Tian .

Neural Computing and Applications, 2020, 32 :1141-1155

[34] A half-precision compressive sensing framework for end-to-end person re-identification [J].

Liao, Longlong ;

Yang, Zhibang ;

Liao, Qing ;

Li, Kenli ;

Li, Keqin ;

Liu, Jie ;

Tian, Qi .

NEURAL COMPUTING & APPLICATIONS, 2020, 32 (04) :1141-1155

[35] Video-Based Person Re-Identification by an End-To-End Learning Architecture with Hybrid Deep Appearance-Temporal Feature [J].

Sun, Rui ;

Huang, Qiheng ;

Xia, Miaomiao ;

Zhang, Jun .

SENSORS, 2018, 18 (11)

[36] Learning adaptive shift and task decoupling for discriminative one-step person search [J].

Zhang, Qixian ;

Miao, Duoqian ;

Zhang, Qi ;

Wang, Changwei ;

Li, Yanping ;

Zhang, Hongyun ;

Zhao, Cairong .

KNOWLEDGE-BASED SYSTEMS, 2024, 304

[37] Enhancing scene understanding based on deep learning for end-to-end autonomous driving [J].

Hu, Jie ;

Kong, Huifang ;

Zhang, Qian ;

Liu, Runwu .

ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2022, 116

[38] Recognizing Multiple Text Sequences from an Image by Pure End-to-End Learning [J].

Xu, Zhenlong ;

Zhou, Shuigeng ;

Bai, Fan ;

Cheng, Zhanzhan ;

Niu, Yi ;

Pu, Shiliang .

2020 25TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2021, :7058-7065

[39] Unified medical image segmentation by learning from uncertainty in an end-to-end manner [J].

Tang, Pin ;

Yang, Pinli ;

Nie, Dong ;

Wu, Xi ;

Zhou, Jiliu ;

Wang, Yan .

KNOWLEDGE-BASED SYSTEMS, 2022, 241

[40] An end-to-end active learning framework for limited labelled hyperspectral image classification [J].

Karaca, Ali Can ;

Bilgin, Gokhan .

INTERNATIONAL JOURNAL OF REMOTE SENSING, 2025, 46 (08) :3179-3206

← 1 2 3 4 5 →