Joint discriminative representation learning for end-to-end person search

被引：23

作者：

Zhang, Pengcheng ^{[1
]}

Yu, Xiaohan ^{[2
,3
]}

Bai, Xiao ^{[1
]}

Wang, Chen ^{[1
]}

Zheng, Jin ^{[1
]}

Ning, Xin ^{[4
]}

机构：

[1] Beihang Univ, Jiangxi Res Inst, Sch Comp Sci & Engn, State Key Lab Software Dev Environm, Beijing, Peoples R China

[2] Macquarie Univ, Sch Comp, Sydney, Australia

[3] Griffith Univ, Inst Integrated & Intelligent Syst, Brisbane, Australia

[4] Chinese Acad Sci, Inst Semicond, Beijing, Peoples R China

来源：

PATTERN RECOGNITION | 2024年 / 147卷

基金：

美国国家科学基金会;

关键词：

Person search; Person re-identification; Part segmentation; Batch sampling; NETWORK;

D O I：

10.1016/j.patcog.2023.110053

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Person search simultaneously detects and retrieves a query person from uncropped scene images. Existing methods are either two-step or end-to-end. The former employs two standalone models for the two sub-tasks, while the latter conducts person search with a unified model. Despite encouraging progress, most existing end-to-end methods focus on balancing the model between detection and retrieval sub-tasks, while ignoring to enhance the learned representation for retrieval, which leads to inferior accuracy to two-step approaches. To that end, we propose a novel hierarchical framework that jointly optimizes instance-aware and part -aware embedding to enable discriminative representation learning. Specifically, we develop a region-of-interest cosegment (ROICoseg) module that captures part-aware information without requiring extra annotations to enable fine-grained discriminative representation. On top of that, a Contextual Instance Batch Sampling (CIBS) method is introduced to effectively employ contextual information for constructing training batches, thus facilitating effective instance-aware representation learning. We further introduce the first cross-door person search dataset (CDPS) that retrieves a target person in outdoor cameras with an indoor captured image or vice versa. Extensive experiments show that our proposed model achieves competitive performance on CUHK-SYSU and outperforms state-of-the-art end-to-end methods on the more challenging PRW and CDPS.1

引用

页数：11

共 50 条

[41] End-to-End Multi-Task Learning for Lung Nodule Segmentation and Diagnosis [J].

Chen, Wei ;

Wang, Qiuli ;

Yang, Dan ;

Zhang, Xiaohong ;

Liu, Chen ;

Li, Yucong .

2020 25TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2021, :6710-6717

[42] Unlocking efficiency: End-to-end optimization learning for recurrent facility operational planning [J].

Lin, Yun Hui ;

Yin, Xiao Feng ;

Tian, Qingyun .

TRANSPORTATION RESEARCH PART E-LOGISTICS AND TRANSPORTATION REVIEW, 2024, 189

[43] Joint Person Objectness and Repulsion for Person Search [J].

Yao, Hantao ;

Xu, Changsheng .

IEEE TRANSACTIONS ON IMAGE PROCESSING, 2021, 30 :685-696

[44] An Improved Deep-Layer Architecture for Real-Time End-to-End Person Recognition System [J].

Jayavarthini, C. ;

Malathy, C. .

COMPUTERS & ELECTRICAL ENGINEERING, 2021, 96

[45] End-to-End Domain Adaptive Attention Network for Cross-Domain Person Re-Identification [J].

Khatun, Amena ;

Denman, Simon ;

Sridharan, Sridha ;

Fookes, Clinton .

IEEE TRANSACTIONS ON INFORMATION FORENSICS AND SECURITY, 2021, 16 :3803-3813

[46] Discriminative representation learning for person re-identification via multi-loss training [J].

Zhong, Weilin ;

Zhang, Tao ;

Jiang, Linfeng ;

Ji, Jinsheng ;

Zhang, Zenghui ;

Xiong, Huilin .

JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2019, 62 :267-278

[47] Joint Discriminative and Metric Embedding Learning for Person Re-identification [J].

Sabri, Sinan, I ;

Randhawa, Zaigham A. ;

Doretto, Gianfranco .

ADVANCES IN VISUAL COMPUTING, ISVC 2022, PT II, 2022, 13599 :165-178

[48] Unknown Instance Learning for Person Search [J].

Yan, Lan ;

Li, Kenli .

2024 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, ICME 2024, 2024,

[49] End-to-End Person Re-identification including Camera Zooming based on Meta-Analysis for Images [J].

Noguchi, Hirofumi ;

Isoda, Takuma ;

Arai, Seisuke .

2021 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN, AND CYBERNETICS (SMC), 2021, :3285-3290

[50] ResNet-WGAN-Based End-to-End Learning for IoV Communication With Unknown Channels [J].

Zhao, Junhui ;

Mu, Huiqin ;

Zhang, Qingmiao ;

Zhang, Huan .

IEEE INTERNET OF THINGS JOURNAL, 2023, 10 (19) :17184-17192

← 1 2 3 4 5 →