GDMNet: A Unified Multi-Task Network for Panoptic Driving Perception

被引:0
作者
Liu, Yunxiang [1 ]
Ma, Haili [1 ]
Zhu, Jianlin [1 ]
Zhang, Qiangbo [1 ]
机构
[1] Shanghai Inst Technol, Sch Comp Sci & Informat Engn, Shanghai 201418, Peoples R China
来源
CMC-COMPUTERS MATERIALS & CONTINUA | 2024年 / 80卷 / 02期
关键词
Autonomous driving; multitask learning; drivable area segmentation; lane detection; vehicle detection;
D O I
10.32604/cmc.2024.053710
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
To enhance the efficiency and accuracy of environmental perception for autonomous vehicles, we propose GDMNet, a unified multi-task perception network for autonomous driving, capable of performing drivable area segmentation, lane detection, and traffic object detection. Firstly, in the encoding stage, features are extracted, and Generalized Efficient Layer Aggregation Network (GELAN) is utilized to enhance feature extraction and gradient flow. Secondly, in the decoding stage, specialized detection heads are designed; the drivable area segmentation head employs DySample to expand feature maps, the lane detection head merges early-stage features and processes the output through the Focal Modulation Network (FMN). Lastly, the Minimum Point Distance IoU (MPDIoU) loss function is employed to compute the matching degree between traffic object detection boxes and predicted boxes, facilitating model training adjustments. Experimental results on the BDD100K dataset demonstrate that the proposed network achieves a drivable area segmentation mean intersection over union (mIoU) of 92.2%, lane detection accuracy and intersection over union (IoU) of 75.3% and 26.4%, respectively, and traffic object detection recall and mAP of 89.7% and 78.2%, respectively. The detection performance surpasses that of other single-task or multi-task algorithm models.
引用
收藏
页码:2963 / 2978
页数:16
相关论文
共 50 条
  • [1] YOLOPX: Anchor-free multi-task learning network for panoptic driving perception
    Zhan, Jiao
    Luo, Yarong
    Guo, Chi
    Wu, Yejun
    Meng, Jiawei
    Liu, Jingnan
    PATTERN RECOGNITION, 2024, 148
  • [2] A Multi-Task Network Based on Dual-Neck Structure for Autonomous Driving Perception
    Tan, Guopeng
    Wang, Chao
    Li, Zhihua
    Zhang, Yuanbiao
    Li, Ruikai
    SENSORS, 2024, 24 (05)
  • [3] Learning cross-task relations for panoptic driving perception
    Song, Zhanjie
    Zhao, Linqing
    PATTERN RECOGNITION LETTERS, 2023, 176 : 89 - 95
  • [4] Optimal Configuration of Multi-Task Learning for Autonomous Driving
    Jun, Woomin
    Son, Minjun
    Yoo, Jisang
    Lee, Sungjin
    SENSORS, 2023, 23 (24)
  • [5] Multi-Task Environmental Perception Methods for Autonomous Driving
    Liu, Ri
    Yang, Shubin
    Tang, Wansha
    Yuan, Jie
    Chan, Qiqing
    Yang, Yunchuan
    SENSORS, 2024, 24 (17)
  • [6] Real-Time Multi-task Network for Autonomous Driving
    Dat, Vu Thanh
    Bao, Ngo Viet Hoai
    Hung, Phan Duy
    ADVANCES IN COMPUTING AND DATA SCIENCES (ICACDS 2022), PT I, 2022, 1613 : 207 - 218
  • [7] UF-Net: A unified network for panoptic driving perception with two-stage feature refinement
    Zhou, Zilong
    Liu, Ping
    Huang, Haibo
    EXPERT SYSTEMS WITH APPLICATIONS, 2025, 260
  • [8] UMD-Net: A Unified Multi-Task Assistive Driving Network Based on Multimodal Fusion
    Liu, Wenzhuo
    Qiao, Yicheng
    Li, Zhiwei
    Wang, Wenshuo
    Zhang, Wei
    Zhu, Jiayin
    Jiang, Yanhuan
    Wang, Li
    Wang, Hong
    Liu, Huaping
    Wang, Kunfeng
    IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2025,
  • [9] Ehsinet: Efficient High-Order Spatial Interaction Multi-task Network for Adaptive Autonomous Driving Perception
    Jianjun Yao
    Yingzhao Li
    Chongjun Liu
    Ruizhuo Tang
    Neural Processing Letters, 2023, 55 : 11353 - 11370
  • [10] Ehsinet: Efficient High-Order Spatial Interaction Multi-task Network for Adaptive Autonomous Driving Perception
    Yao, Jianjun
    Li, Yingzhao
    Liu, Chongjun
    Tang, Ruizhuo
    NEURAL PROCESSING LETTERS, 2023, 55 (08) : 11353 - 11370