GDMNet: A Unified Multi-Task Network for Panoptic Driving Perception

被引:0
作者
Liu, Yunxiang [1 ]
Ma, Haili [1 ]
Zhu, Jianlin [1 ]
Zhang, Qiangbo [1 ]
机构
[1] Shanghai Inst Technol, Sch Comp Sci & Informat Engn, Shanghai 201418, Peoples R China
来源
CMC-COMPUTERS MATERIALS & CONTINUA | 2024年 / 80卷 / 02期
关键词
Autonomous driving; multitask learning; drivable area segmentation; lane detection; vehicle detection;
D O I
10.32604/cmc.2024.053710
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
To enhance the efficiency and accuracy of environmental perception for autonomous vehicles, we propose GDMNet, a unified multi-task perception network for autonomous driving, capable of performing drivable area segmentation, lane detection, and traffic object detection. Firstly, in the encoding stage, features are extracted, and Generalized Efficient Layer Aggregation Network (GELAN) is utilized to enhance feature extraction and gradient flow. Secondly, in the decoding stage, specialized detection heads are designed; the drivable area segmentation head employs DySample to expand feature maps, the lane detection head merges early-stage features and processes the output through the Focal Modulation Network (FMN). Lastly, the Minimum Point Distance IoU (MPDIoU) loss function is employed to compute the matching degree between traffic object detection boxes and predicted boxes, facilitating model training adjustments. Experimental results on the BDD100K dataset demonstrate that the proposed network achieves a drivable area segmentation mean intersection over union (mIoU) of 92.2%, lane detection accuracy and intersection over union (IoU) of 75.3% and 26.4%, respectively, and traffic object detection recall and mAP of 89.7% and 78.2%, respectively. The detection performance surpasses that of other single-task or multi-task algorithm models.
引用
收藏
页码:2963 / 2978
页数:16
相关论文
共 50 条
  • [41] A fast instance segmentation with one-stage multi-task deep neural network for autonomous driving
    Tseng, Kuo-Kun
    Lin, Jiangrui
    Chen, Chien-Ming
    Hassan, Mohammad Mehedi
    COMPUTERS & ELECTRICAL ENGINEERING, 2021, 93
  • [42] Multi-Task Y-Shaped Graph Neural Network for Point Cloud Learning in Autonomous Driving
    Zou, Xiaofeng
    Li, Kenli
    Li, Yangfan
    Wei, Wei
    Chen, Cen
    IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2022, 23 (07) : 9568 - 9579
  • [43] DeepNOMA: A Unified Framework for NOMA Using Deep Multi-Task Learning
    Ye, Neng
    Li, Xiangming
    Yu, Hanxiao
    Zhao, Lian
    Liu, Wenjia
    Hou, Xiaolin
    IEEE TRANSACTIONS ON WIRELESS COMMUNICATIONS, 2020, 19 (04) : 2208 - 2225
  • [44] A Unified Multi-Task Learning Architecture for Fast and Accurate Pedestrian Detection
    Zhou, Chengju
    Wu, Meiqing
    Lam, Siew-Kei
    IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2022, 23 (02) : 982 - 996
  • [45] OmiEmbed: A Unified Multi-Task Deep Learning Framework for Multi-Omics Data
    Zhang, Xiaoyu
    Xing, Yuting
    Sun, Kai
    Guo, Yike
    CANCERS, 2021, 13 (12)
  • [46] DMTMV: A Unified Learning Framework for Deep Multi-Task Multi-View Learning
    Wu, Yi-Feng
    Zhan, De-Chuan
    Jiang, Yuan
    2018 9TH IEEE INTERNATIONAL CONFERENCE ON BIG KNOWLEDGE (ICBK), 2018, : 49 - 56
  • [47] Multi-task Network Embedding with Adaptive Loss Weighting
    Rizi, Fatemeh Salehi
    Granitzer, Michael
    2020 IEEE/ACM INTERNATIONAL CONFERENCE ON ADVANCES IN SOCIAL NETWORKS ANALYSIS AND MINING (ASONAM), 2020, : 1 - 5
  • [48] Multi-Task Attention Network for Contactless Sleep Monitoring
    Siheng, Li
    Beihong, Jin
    Fusang, Zhang
    Zhi, Wang
    Junqi, Ma
    Chang, Su
    Xiaoyong, Ren
    Haiqin, Liu
    Jisuanji Yanjiu yu Fazhan/Computer Research and Development, 2024, 61 (11): : 2739 - 2753
  • [49] MTKDN: Multi-Task Knowledge Disentanglement Network for Recommendation
    Wu, Haotian
    Xing, Bowen
    Tsang, Ivor
    PROCEEDINGS OF THE 32ND ACM INTERNATIONAL CONFERENCE ON INFORMATION AND KNOWLEDGE MANAGEMENT, CIKM 2023, 2023, : 4360 - 4364
  • [50] Multi-task Transfer Learning for Bayesian Network Structures
    Benikhlef, Sarah
    Leray, Philippe
    Raschia, Guillaume
    Ben Messaoud, Montassar
    Sakly, Fayrouz
    SYMBOLIC AND QUANTITATIVE APPROACHES TO REASONING WITH UNCERTAINTY, ECSQARU 2021, 2021, 12897 : 217 - 228