Real-Time Multi-task Network for Autonomous Driving

被引:1
作者
Dat, Vu Thanh [1 ]
Bao, Ngo Viet Hoai [1 ]
Hung, Phan Duy [1 ]
机构
[1] FPT Univ, Comp Sci Dept, Hanoi, Vietnam
来源
ADVANCES IN COMPUTING AND DATA SCIENCES (ICACDS 2022), PT I | 2022年 / 1613卷
关键词
Deep learning; Multi-task learning; Detection; Segmentation; Autonomous-driving;
D O I
10.1007/978-3-031-12638-3_18
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
End-to-end Network has become increasingly important in multi-tasking, especially a driving perception system in autonomous driving. This work systematically introduces an end-to-end perception network for multi-tasking and proposes several key optimizations to improve accuracy. First, we propose efficient segmentation head and box/class prediction networks based on weighted bidirectional feature network. Second, we propose automatically customized anchor for each level in the weighted bidirectional feature network. Third, we propose an efficient training loss function. Based on these optimizations, we develope an end-to-end perception network to perform multi-tasking, including traffic object detection, drivable area segmentation and lane detection simultaneously which achieves better accuracy than prior art. In particular, our network design achieves the state-of-the art 77 mAP@.5 on BDD100K Dataset, outperforms lane detection with 0.293 mIOU on 12.83 parameters and 15.6 FLOPs. The network can perform visual perception tasks in real-time and thus is a practical and accurate solution to the multi-tasking problem.
引用
收藏
页码:207 / 218
页数:12
相关论文
共 50 条
  • [41] LDMSNet: Lightweight Dual-Branch Multi-Scale Network for Real-Time Semantic Segmentation of Autonomous Driving
    Yang, Haoran
    Zhang, Dan
    Liu, Jiazai
    Cao, Zekun
    Wang, Na
    INTERNATIONAL JOURNAL OF AUTOMOTIVE TECHNOLOGY, 2024, : 577 - 591
  • [42] Real-time estimation of the optimal coil placement in transcranial magnetic stimulation using multi-task deep learning
    Moser, Philipp
    Reishofer, Gernot
    Prueckl, Robert
    Schaffelhofer, Stefan
    Freigang, Sascha
    Thumfart, Stefan
    Mahdy Ali, Kariem
    SCIENTIFIC REPORTS, 2024, 14 (01):
  • [43] UMT-Net: A Uniform Multi-Task Network With Adaptive Task Weighting
    Chen, Sihan
    Zheng, Lianqing
    Huang, Libo
    Bai, Jie
    Zhu, Xichan
    Ma, Zhixiong
    IEEE TRANSACTIONS ON INTELLIGENT VEHICLES, 2024, 9 (01): : 2304 - 2317
  • [44] DRMNet: A Multi-Task Detection Model Based on Image Processing for Autonomous Driving Scenarios
    Zhao, Jiandong
    Wu, Di
    Yu, Zhixin
    Gao, Ziyou
    IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2023, 72 (12) : 15341 - 15355
  • [45] MULTI-TASK LEARNING IN AUTONOMOUS DRIVING SCENARIOS VIA ADAPTIVE FEATURE REFINEMENT NETWORKS
    Zhai, Mingliang
    Xiang, Xuezhi
    Lv, Ning
    El Saddik, Abdulmotaleb
    2020 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2020, : 2323 - 2327
  • [46] A Multi-Task Approach for Real-Time Quality of Experience Factors Prediction from Physiological Data
    Begue, Joshua
    Labiod, Mohamed Aymen
    Mellouk, Abdelhamid
    IEEE CONFERENCE ON GLOBAL COMMUNICATIONS, GLOBECOM, 2023, : 5853 - 5858
  • [47] Real-Time Multi-Task Deep Learning Model for Polyp Detection, Characterization, and Size Estimation
    Sunthornwetchapong, Phanukorn
    Hombubpha, Kasichon
    Tiankanon, Kasenee
    Aniwan, Satimai
    Jakkrawankul, Pasit
    Nupairoj, Natawut
    Vateekul, Peerapon
    Rerknimitr, Rungsun
    IEEE ACCESS, 2025, 13 : 8469 - 8481
  • [48] Multi-task network embedding
    Linchuan Xu
    Xiaokai Wei
    Jiannong Cao
    Philip S. Yu
    International Journal of Data Science and Analytics, 2019, 8 : 183 - 198
  • [49] Real-Time Semantic Image Segmentation with Deep Learning for Autonomous Driving: A Survey
    Papadeas, Ilias
    Tsochatzidis, Lazaros
    Amanatiadis, Angelos
    Pratikakis, Ioannis
    APPLIED SCIENCES-BASEL, 2021, 11 (19):
  • [50] Real-time semantic segmentation for autonomous driving: A review of CNNs, Transformers, and Beyond
    Elhassan, Mohammed A. M.
    Zhou, Changjun
    Khan, Ali
    Benabid, Amina
    Adam, Abuzar B. M.
    Mehmood, Atif
    Wambugu, Naftaly
    JOURNAL OF KING SAUD UNIVERSITY-COMPUTER AND INFORMATION SCIENCES, 2024, 36 (10)