A Unified Multi-Task Learning Architecture for Fast and Accurate Pedestrian Detection

被引:8
|
作者
Zhou, Chengju [1 ]
Wu, Meiqing [1 ]
Lam, Siew-Kei [1 ]
机构
[1] Nanyang Technol Univ, Sch Comp Sci & Engn, Singapore 639798, Singapore
基金
新加坡国家研究基金会;
关键词
Semantics; Task analysis; Computer architecture; Computational complexity; Robustness; Feature extraction; Neural networks; Multi-task learning; pedestrian detection; semantic segmentation; feature aggregation;
D O I
10.1109/TITS.2020.3019390
中图分类号
TU [建筑科学];
学科分类号
0813 ;
摘要
We present a unified multi-task learning architecture for fast and accurate pedestrian detection. Different from existing methods which often focus on either a new loss function or architecture, we propose an improved multi-task convolutional neural network learning architecture to effectively and efficiently interfuse the task of pedestrian detection and semantic segmentation. To achieve this, we integrate a lightweight semantic segmentation branch to Faster R-CNN detection framework that enables end-to-end hard parameter sharing in order to boost the detection performance and maintain computational efficiency as follows. Firstly, a Semantic Segmentation to Feature Module (SS2FM) refines the convolutional features in RPN stage by integrating the features generated from the semantic segmentation branch. Secondly, a Semantic Segmentation to Confidence Module (SS2CM) refines the classification confidence in RPN stage by fusing it with the semantic segmentation confidence. We also introduce an effective anchor matching point transform to alleviate the problem of feature misalignment for heavily occluded pedestrians. The proposed unified multi-task learning architecture lends itself well to more robust pedestrian detection in diverse scenarios with negligible computation overhead. In addition, the proposed architecture can achieve high detection performance with low resolution input images, which significantly reduces the computational complexity. Experiment results on CityPersons and Caltech datasets show that our method is the fastest among all state-of-the-art pedestrian detection methods while exhibiting competitive detection performance.
引用
收藏
页码:982 / 996
页数:15
相关论文
共 50 条
  • [1] Enhanced Multi-Task Learning Architecture for Detecting Pedestrian at Far Distance
    Zhou, Chengju
    Wu, Meiqing
    Lam, Siew-Kei
    IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2022, 23 (09) : 15588 - 15604
  • [2] Fast and Accurate Multi-Task Learning for Encrypted Network Traffic Classification
    Park, Jee-Tae
    Shin, Chang-Yui
    Baek, Ui-Jun
    Kim, Myung-Sup
    APPLIED SCIENCES-BASEL, 2024, 14 (07):
  • [3] Multi-task infrared pedestrian detection method
    Zhang, Jianlong
    Liu, Chishuai
    Wang, Bin
    Chen, Chen
    He, Jianhui
    Zhou, Yang
    Li, Ji
    INTERNATIONAL SYMPOSIUM ON ARTIFICIAL INTELLIGENCE AND ROBOTICS 2021, 2021, 11884
  • [4] MULTI-TASK LEARNING FOR PEDESTRIAN BODY PARTS DETECTION AND MULTI-ATTRIBUTE CLASSIFICATION
    Lou, Miaomiao
    Chen, Lin
    Guo, Feng
    2019 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA & EXPO WORKSHOPS (ICMEW), 2019, : 549 - 554
  • [5] Latent Multi-Task Architecture Learning
    Ruder, Sebastian
    Bingel, Joachim
    Augenstein, Isabelle
    Sogaard, Anders
    THIRTY-THIRD AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FIRST INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE / NINTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2019, : 4822 - 4829
  • [6] An Efficient Multi-Task Network for Pedestrian Intrusion Detection
    Shi, Zhenyu
    He, Shibo
    Sun, Jingchen
    Chen, Tao
    Chen, Jiming
    Dong, Hairong
    IEEE TRANSACTIONS ON INTELLIGENT VEHICLES, 2023, 8 (01): : 649 - 660
  • [7] Multi-Task Deep Learning for Pedestrian Detection, Action Recognition and Time to Cross Prediction
    Pop, Danut Ovidiu
    Rogozan, Alexandrina
    Chatelain, Clement
    Nashashibi, Fawzi
    Bensrhair, Abdelaziz
    IEEE ACCESS, 2019, 7 : 149318 - 149327
  • [8] Unified Voice Embedding through Multi-task Learning
    Rajenthiran, Jenarthanan
    Sithamaparanathan, Lakshikka
    Uthayakumar, Saranya
    Thayasivam, Uthayasanker
    2022 INTERNATIONAL CONFERENCE ON ASIAN LANGUAGE PROCESSING (IALP 2022), 2022, : 178 - 183
  • [9] Fast and accurate driver action recognition with multi-task learning of driver pose and action
    Nishiyuki, Kenta
    Hyuga, Tadashi
    Tasaki, Hiroshi
    Kinoshita, Koichi
    Hasegawa, Yuki
    Yamashita, Takayoshi
    Fujiyoshi, Hironobu
    Transactions of the Japanese Society for Artificial Intelligence, 2021, 36 (02) : 1 - 10
  • [10] A Boosted Multi-Task Model for Pedestrian Detection With Occlusion Handling
    Zhu, Chao
    Peng, Yuxin
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2015, 24 (12) : 5619 - 5629