A Unified Multi-Task Learning Architecture for Fast and Accurate Pedestrian Detection

被引：8

作者：

Zhou, Chengju ^{[1
]}

Wu, Meiqing ^{[1
]}

Lam, Siew-Kei ^{[1
]}

机构：

[1] Nanyang Technol Univ, Sch Comp Sci & Engn, Singapore 639798, Singapore

来源：

IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS | 2022年 / 23卷 / 02期

基金：

新加坡国家研究基金会;

关键词：

Semantics; Task analysis; Computer architecture; Computational complexity; Robustness; Feature extraction; Neural networks; Multi-task learning; pedestrian detection; semantic segmentation; feature aggregation;

D O I：

10.1109/TITS.2020.3019390

中图分类号：

TU [建筑科学];

学科分类号：

0813 ;

摘要：

We present a unified multi-task learning architecture for fast and accurate pedestrian detection. Different from existing methods which often focus on either a new loss function or architecture, we propose an improved multi-task convolutional neural network learning architecture to effectively and efficiently interfuse the task of pedestrian detection and semantic segmentation. To achieve this, we integrate a lightweight semantic segmentation branch to Faster R-CNN detection framework that enables end-to-end hard parameter sharing in order to boost the detection performance and maintain computational efficiency as follows. Firstly, a Semantic Segmentation to Feature Module (SS2FM) refines the convolutional features in RPN stage by integrating the features generated from the semantic segmentation branch. Secondly, a Semantic Segmentation to Confidence Module (SS2CM) refines the classification confidence in RPN stage by fusing it with the semantic segmentation confidence. We also introduce an effective anchor matching point transform to alleviate the problem of feature misalignment for heavily occluded pedestrians. The proposed unified multi-task learning architecture lends itself well to more robust pedestrian detection in diverse scenarios with negligible computation overhead. In addition, the proposed architecture can achieve high detection performance with low resolution input images, which significantly reduces the computational complexity. Experiment results on CityPersons and Caltech datasets show that our method is the fastest among all state-of-the-art pedestrian detection methods while exhibiting competitive detection performance.

引用

页码：982 / 996

页数：15

共 50 条

[1] Enhanced Multi-Task Learning Architecture for Detecting Pedestrian at Far Distance
Zhou, Chengju
Wu, Meiqing
Lam, Siew-Kei
IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2022, 23 (09) : 15588 - 15604
[2] Unified Transformer Multi-Task Learning for Intent Classification With Entity Recognition
Benayas Alamos, Alberto Jose
Hashempou, Reyhaneh
Rumble, Damian
Jameel, Shoaib
De Amorim, Renato Cordeiro
IEEE ACCESS, 2021, 9 : 147306 - 147314
[3] Multi-task infrared pedestrian detection method
Zhang, Jianlong
Liu, Chishuai
Wang, Bin
Chen, Chen
He, Jianhui
Zhou, Yang
Li, Ji
INTERNATIONAL SYMPOSIUM ON ARTIFICIAL INTELLIGENCE AND ROBOTICS 2021, 2021, 11884
[4] A Boosted Multi-Task Model for Pedestrian Detection With Occlusion Handling
Zhu, Chao
Peng, Yuxin
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2015, 24 (12) : 5619 - 5629
[5] Multi-Modal Meta Multi-Task Learning for Social Media Rumor Detection
Zhang, Huaiwen
Qian, Shengsheng
Fang, Quan
Xu, Changsheng
IEEE TRANSACTIONS ON MULTIMEDIA, 2022, 24 : 1449 - 1459
[6] A Multi-Task Framework for Infrared Small Target Detection and Segmentation
Chen, Yuhang
Li, Liyuan
Liu, Xin
Su, Xiaofeng
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2022, 60
[7] MULTI-TASK LEARNING FOR VOICE TRIGGER DETECTION
Sigtia, Siddharth
Clark, Pascal
Haynes, Rob
Richards, Hywel
Bridle, John
2020 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2020, : 7449 - 7453
[8] Radar Target Detection With Multi-Task Learning in Heterogeneous Environment
Jing, He
Cheng, Yongqiang
Wu, Hao
Wang, Hongqiang
IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2022, 19
[9] YOLO-FD: An accurate fish disease detection method based on multi-task learning
Li, Xuefei
Zhao, Shili
Chen, Chunlin
Cui, Hongwu
Li, Daoliang
Zhao, Ran
EXPERT SYSTEMS WITH APPLICATIONS, 2024, 258
[10] Multi-Task Deep Learning for Pedestrian Detection, Action Recognition and Time to Cross Prediction
Pop, Danut Ovidiu
Rogozan, Alexandrina
Chatelain, Clement
Nashashibi, Fawzi
Bensrhair, Abdelaziz
IEEE ACCESS, 2019, 7 : 149318 - 149327

← 1 2 3 4 5 →