A Unified Multi-Task Learning Architecture for Fast and Accurate Pedestrian Detection

被引:8
|
作者
Zhou, Chengju [1 ]
Wu, Meiqing [1 ]
Lam, Siew-Kei [1 ]
机构
[1] Nanyang Technol Univ, Sch Comp Sci & Engn, Singapore 639798, Singapore
基金
新加坡国家研究基金会;
关键词
Semantics; Task analysis; Computer architecture; Computational complexity; Robustness; Feature extraction; Neural networks; Multi-task learning; pedestrian detection; semantic segmentation; feature aggregation;
D O I
10.1109/TITS.2020.3019390
中图分类号
TU [建筑科学];
学科分类号
0813 ;
摘要
We present a unified multi-task learning architecture for fast and accurate pedestrian detection. Different from existing methods which often focus on either a new loss function or architecture, we propose an improved multi-task convolutional neural network learning architecture to effectively and efficiently interfuse the task of pedestrian detection and semantic segmentation. To achieve this, we integrate a lightweight semantic segmentation branch to Faster R-CNN detection framework that enables end-to-end hard parameter sharing in order to boost the detection performance and maintain computational efficiency as follows. Firstly, a Semantic Segmentation to Feature Module (SS2FM) refines the convolutional features in RPN stage by integrating the features generated from the semantic segmentation branch. Secondly, a Semantic Segmentation to Confidence Module (SS2CM) refines the classification confidence in RPN stage by fusing it with the semantic segmentation confidence. We also introduce an effective anchor matching point transform to alleviate the problem of feature misalignment for heavily occluded pedestrians. The proposed unified multi-task learning architecture lends itself well to more robust pedestrian detection in diverse scenarios with negligible computation overhead. In addition, the proposed architecture can achieve high detection performance with low resolution input images, which significantly reduces the computational complexity. Experiment results on CityPersons and Caltech datasets show that our method is the fastest among all state-of-the-art pedestrian detection methods while exhibiting competitive detection performance.
引用
收藏
页码:982 / 996
页数:15
相关论文
共 50 条
  • [31] Deep multi-task learning with flexible and compact architecture search
    Zhao, Jiejie
    Lv, Weifeng
    Du, Bowen
    Ye, Junchen
    Sun, Leilei
    Xiong, Guixi
    INTERNATIONAL JOURNAL OF DATA SCIENCE AND ANALYTICS, 2023, 15 (02) : 187 - 199
  • [32] Deep multi-task learning with flexible and compact architecture search
    Jiejie Zhao
    Weifeng Lv
    Bowen Du
    Junchen Ye
    Leilei Sun
    Guixi Xiong
    International Journal of Data Science and Analytics, 2023, 15 : 187 - 199
  • [33] A Unified Multi-Task Learning Framework for Joint Extraction of Entities and Relations
    Zhao, Tianyang
    Yan, Zhao
    Cao, Yunbo
    Li, Zhoujun
    THIRTY-FIFTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THIRTY-THIRD CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE AND THE ELEVENTH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2021, 35 : 14524 - 14531
  • [34] DeepNOMA: A Unified Framework for NOMA Using Deep Multi-Task Learning
    Ye, Neng
    Li, Xiangming
    Yu, Hanxiao
    Zhao, Lian
    Liu, Wenjia
    Hou, Xiaolin
    IEEE TRANSACTIONS ON WIRELESS COMMUNICATIONS, 2020, 19 (04) : 2208 - 2225
  • [35] Removing Hidden Confounding in Recommendation: A Unified Multi-Task Learning Approach
    Li, Haoxuan
    Wu, Kunhan
    Zheng, Chunyuan
    Xiao, Yanghao
    Wang, Hao
    Geng, Zhi
    Feng, Fuli
    He, Xiangnan
    Wu, Peng
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
  • [36] MULTI-TASK LEARNING IMPROVES SYNTHETIC SPEECH DETECTION
    Mo, Yichuan
    Wang, Shilin
    2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 6392 - 6396
  • [37] Multi-Task Learning for Intrusion Detection on web logs
    Li, Bo
    Lin, Ying
    Zhang, Simin
    JOURNAL OF SYSTEMS ARCHITECTURE, 2017, 81 : 92 - 100
  • [38] Multi-task Learning for Stance and Early Rumor Detection
    Chen, Yongheng
    Yin, Chunyan
    Zuo, Wanli
    OPTICAL MEMORY AND NEURAL NETWORKS, 2021, 30 (02) : 131 - 139
  • [39] Multi-task learning for object keypoints detection and classification
    Xu, Jie
    Zhao, Lin
    Zhang, Shanshan
    Gong, Chen
    Yang, Jian
    PATTERN RECOGNITION LETTERS, 2020, 130 : 182 - 188
  • [40] Interdependent Multi-task Learning for Simultaneous Segmentation and Detection
    Reginthala, Mahesh
    Iwahori, Yuji
    Bhuyan, M. K.
    Hayashi, Yoshitsugu
    Achariyaviriya, Witsarut
    Kijsirikul, Boonserm
    ICPRAM: PROCEEDINGS OF THE 9TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION APPLICATIONS AND METHODS, 2020, : 167 - 174