GDMNet: A Unified Multi-Task Network for Panoptic Driving Perception

被引：0

作者：

Liu, Yunxiang ^{[1
]}

Ma, Haili ^{[1
]}

Zhu, Jianlin ^{[1
]}

Zhang, Qiangbo ^{[1
]}

机构：

[1] Shanghai Inst Technol, Sch Comp Sci & Informat Engn, Shanghai 201418, Peoples R China

来源：

CMC-COMPUTERS MATERIALS & CONTINUA | 2024年 / 80卷 / 02期

关键词：

Autonomous driving; multitask learning; drivable area segmentation; lane detection; vehicle detection;

D O I：

10.32604/cmc.2024.053710

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

To enhance the efficiency and accuracy of environmental perception for autonomous vehicles, we propose GDMNet, a unified multi-task perception network for autonomous driving, capable of performing drivable area segmentation, lane detection, and traffic object detection. Firstly, in the encoding stage, features are extracted, and Generalized Efficient Layer Aggregation Network (GELAN) is utilized to enhance feature extraction and gradient flow. Secondly, in the decoding stage, specialized detection heads are designed; the drivable area segmentation head employs DySample to expand feature maps, the lane detection head merges early-stage features and processes the output through the Focal Modulation Network (FMN). Lastly, the Minimum Point Distance IoU (MPDIoU) loss function is employed to compute the matching degree between traffic object detection boxes and predicted boxes, facilitating model training adjustments. Experimental results on the BDD100K dataset demonstrate that the proposed network achieves a drivable area segmentation mean intersection over union (mIoU) of 92.2%, lane detection accuracy and intersection over union (IoU) of 75.3% and 26.4%, respectively, and traffic object detection recall and mAP of 89.7% and 78.2%, respectively. The detection performance surpasses that of other single-task or multi-task algorithm models.

引用

页码：2963 / 2978

页数：16

共 50 条

[1] YOLOPX: Anchor-free multi-task learning network for panoptic driving perception
Zhan, Jiao
Luo, Yarong
Guo, Chi
Wu, Yejun
Meng, Jiawei
Liu, Jingnan
PATTERN RECOGNITION, 2024, 148
[2] A Multi-Task Network Based on Dual-Neck Structure for Autonomous Driving Perception
Tan, Guopeng
Wang, Chao
Li, Zhihua
Zhang, Yuanbiao
Li, Ruikai
SENSORS, 2024, 24 (05)
[3] Learning cross-task relations for panoptic driving perception
Song, Zhanjie
Zhao, Linqing
PATTERN RECOGNITION LETTERS, 2023, 176 : 89 - 95
[4] Optimal Configuration of Multi-Task Learning for Autonomous Driving
Jun, Woomin
Son, Minjun
Yoo, Jisang
Lee, Sungjin
SENSORS, 2023, 23 (24)
[5] Multi-Task Environmental Perception Methods for Autonomous Driving
Liu, Ri
Yang, Shubin
Tang, Wansha
Yuan, Jie
Chan, Qiqing
Yang, Yunchuan
SENSORS, 2024, 24 (17)
[6] Real-Time Multi-task Network for Autonomous Driving
Dat, Vu Thanh
Bao, Ngo Viet Hoai
Hung, Phan Duy
ADVANCES IN COMPUTING AND DATA SCIENCES (ICACDS 2022), PT I, 2022, 1613 : 207 - 218
[7] UF-Net: A unified network for panoptic driving perception with two-stage feature refinement
Zhou, Zilong
Liu, Ping
Huang, Haibo
EXPERT SYSTEMS WITH APPLICATIONS, 2025, 260
[8] UMD-Net: A Unified Multi-Task Assistive Driving Network Based on Multimodal Fusion
Liu, Wenzhuo
Qiao, Yicheng
Li, Zhiwei
Wang, Wenshuo
Zhang, Wei
Zhu, Jiayin
Jiang, Yanhuan
Wang, Li
Wang, Hong
Liu, Huaping
Wang, Kunfeng
IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2025,
[9] Ehsinet: Efficient High-Order Spatial Interaction Multi-task Network for Adaptive Autonomous Driving Perception
Jianjun Yao
Yingzhao Li
Chongjun Liu
Ruizhuo Tang
Neural Processing Letters, 2023, 55 : 11353 - 11370
[10] Ehsinet: Efficient High-Order Spatial Interaction Multi-task Network for Adaptive Autonomous Driving Perception
Yao, Jianjun
Li, Yingzhao
Liu, Chongjun
Tang, Ruizhuo
NEURAL PROCESSING LETTERS, 2023, 55 (08) : 11353 - 11370

← 1 2 3 4 5 →