A Unified Multi-Task Learning Architecture for Fast and Accurate Pedestrian Detection

被引:8
|
作者
Zhou, Chengju [1 ]
Wu, Meiqing [1 ]
Lam, Siew-Kei [1 ]
机构
[1] Nanyang Technol Univ, Sch Comp Sci & Engn, Singapore 639798, Singapore
基金
新加坡国家研究基金会;
关键词
Semantics; Task analysis; Computer architecture; Computational complexity; Robustness; Feature extraction; Neural networks; Multi-task learning; pedestrian detection; semantic segmentation; feature aggregation;
D O I
10.1109/TITS.2020.3019390
中图分类号
TU [建筑科学];
学科分类号
0813 ;
摘要
We present a unified multi-task learning architecture for fast and accurate pedestrian detection. Different from existing methods which often focus on either a new loss function or architecture, we propose an improved multi-task convolutional neural network learning architecture to effectively and efficiently interfuse the task of pedestrian detection and semantic segmentation. To achieve this, we integrate a lightweight semantic segmentation branch to Faster R-CNN detection framework that enables end-to-end hard parameter sharing in order to boost the detection performance and maintain computational efficiency as follows. Firstly, a Semantic Segmentation to Feature Module (SS2FM) refines the convolutional features in RPN stage by integrating the features generated from the semantic segmentation branch. Secondly, a Semantic Segmentation to Confidence Module (SS2CM) refines the classification confidence in RPN stage by fusing it with the semantic segmentation confidence. We also introduce an effective anchor matching point transform to alleviate the problem of feature misalignment for heavily occluded pedestrians. The proposed unified multi-task learning architecture lends itself well to more robust pedestrian detection in diverse scenarios with negligible computation overhead. In addition, the proposed architecture can achieve high detection performance with low resolution input images, which significantly reduces the computational complexity. Experiment results on CityPersons and Caltech datasets show that our method is the fastest among all state-of-the-art pedestrian detection methods while exhibiting competitive detection performance.
引用
收藏
页码:982 / 996
页数:15
相关论文
共 50 条
  • [21] Multi-task learning for video anomaly detection*
    Chang, Xingya
    Zhang, Yuxin
    Xue, Dingyu
    Chen, Dongyue
    JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2022, 87
  • [22] Multi-task learning for video anomaly detection
    Chang, Xingya
    Zhang, Yuxin
    Xue, Dingyu
    Chen, Dongyue
    Journal of Visual Communication and Image Representation, 2022, 87
  • [23] Automatic Cataract Detection with Multi-Task Learning
    Wu, Hongjie
    Lv, Jiancheng
    Wang, Jian
    2021 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2021,
  • [24] Multi-task gradient descent for multi-task learning
    Lu Bai
    Yew-Soon Ong
    Tiantian He
    Abhishek Gupta
    Memetic Computing, 2020, 12 : 355 - 369
  • [25] Multi-task gradient descent for multi-task learning
    Bai, Lu
    Ong, Yew-Soon
    He, Tiantian
    Gupta, Abhishek
    MEMETIC COMPUTING, 2020, 12 (04) : 355 - 369
  • [26] PAR Contest 2023: Pedestrian Attributes Recognition with Multi-task Learning
    Greco, Antonio
    Vento, Bruno
    COMPUTER ANALYSIS OF IMAGES AND PATTERNS, CAIP 2023, PT I, 2023, 14184 : 3 - 12
  • [27] A novel high accuracy fast gas detection algorithm based on multi-task learning
    Wang, Xue
    Zhao, Wenlong
    Ma, Ruilong
    Zhuo, Junwei
    Zeng, Yuanhu
    Wu, Pengcheng
    Chu, Jin
    MEASUREMENT, 2024, 228
  • [28] YOLO-FD: An accurate fish disease detection method based on multi-task learning
    Li, Xuefei
    Zhao, Shili
    Chen, Chunlin
    Cui, Hongwu
    Li, Daoliang
    Zhao, Ran
    EXPERT SYSTEMS WITH APPLICATIONS, 2024, 258
  • [29] A Multi-Task Representation Learning Architecture for Enhanced Graph Classification
    Xie, Yu
    Gong, Maoguo
    Gao, Yuan
    Qin, A. K.
    Fan, Xiaolong
    FRONTIERS IN NEUROSCIENCE, 2020, 13
  • [30] Unified Transformer Multi-Task Learning for Intent Classification With Entity Recognition
    Benayas Alamos, Alberto Jose
    Hashempou, Reyhaneh
    Rumble, Damian
    Jameel, Shoaib
    De Amorim, Renato Cordeiro
    IEEE ACCESS, 2021, 9 : 147306 - 147314