GDMNet: A Unified Multi-Task Network for Panoptic Driving Perception

被引：0

作者：

Liu, Yunxiang ^{[1
]}

Ma, Haili ^{[1
]}

Zhu, Jianlin ^{[1
]}

Zhang, Qiangbo ^{[1
]}

机构：

[1] Shanghai Inst Technol, Sch Comp Sci & Informat Engn, Shanghai 201418, Peoples R China

来源：

CMC-COMPUTERS MATERIALS & CONTINUA | 2024年 / 80卷 / 02期

关键词：

Autonomous driving; multitask learning; drivable area segmentation; lane detection; vehicle detection;

D O I：

10.32604/cmc.2024.053710

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

To enhance the efficiency and accuracy of environmental perception for autonomous vehicles, we propose GDMNet, a unified multi-task perception network for autonomous driving, capable of performing drivable area segmentation, lane detection, and traffic object detection. Firstly, in the encoding stage, features are extracted, and Generalized Efficient Layer Aggregation Network (GELAN) is utilized to enhance feature extraction and gradient flow. Secondly, in the decoding stage, specialized detection heads are designed; the drivable area segmentation head employs DySample to expand feature maps, the lane detection head merges early-stage features and processes the output through the Focal Modulation Network (FMN). Lastly, the Minimum Point Distance IoU (MPDIoU) loss function is employed to compute the matching degree between traffic object detection boxes and predicted boxes, facilitating model training adjustments. Experimental results on the BDD100K dataset demonstrate that the proposed network achieves a drivable area segmentation mean intersection over union (mIoU) of 92.2%, lane detection accuracy and intersection over union (IoU) of 75.3% and 26.4%, respectively, and traffic object detection recall and mAP of 89.7% and 78.2%, respectively. The detection performance surpasses that of other single-task or multi-task algorithm models.

引用

页码：2963 / 2978

页数：16

共 50 条

[21] ConnectomeNet: A Unified Deep Neural Network Modeling Framework for Multi-Task Learning
Lim, Heechul
Chon, Kang-Wook
Kim, Min-Soo
IEEE ACCESS, 2023, 11 : 34297 - 34308
[22] Multi-task network embedding
Xu, Linchuan
Wei, Xiaokai
Cao, Jiannong
Yu, Philip S.
INTERNATIONAL JOURNAL OF DATA SCIENCE AND ANALYTICS, 2019, 8 (02) : 183 - 198
[23] A LiDAR-Based Dynamic Driving Scene Multi-task Segmentation Network
Wang, Hai
Li, Jianguo
Cai, Yingfeng
Chen, Long
Qiche Gongcheng/Automotive Engineering, 2024, 46 (09): : 1608 - 1616
[24] Multi-task learning for dangerous object detection in autonomous driving
Chen, Yaran
Zhao, Dongbin
Lv, Le
Zhang, Qichao
INFORMATION SCIENCES, 2018, 432 : 559 - 571
[25] LiDAR-Based Multi-Task Road Perception Network for Autonomous Vehicles
Yan, Fuwu
Wang, Kewei
Zou, Bin
Tang, Luqi
Li, Wenbo
Lv, Chen
IEEE ACCESS, 2020, 8 : 86753 - 86764
[26] Unified Voice Embedding through Multi-task Learning
Rajenthiran, Jenarthanan
Sithamaparanathan, Lakshikka
Uthayakumar, Saranya
Thayasivam, Uthayasanker
2022 INTERNATIONAL CONFERENCE ON ASIAN LANGUAGE PROCESSING (IALP 2022), 2022, : 178 - 183
[27] Multi-Task Network Representation Learning
Xie, Yu
Jin, Peixuan
Gong, Maoguo
Zhang, Chen
Yu, Bin
FRONTIERS IN NEUROSCIENCE, 2020, 14
[28] Network Clustering for Multi-task Learning
Mu, Zhiying
Gao, Dehong
Guo, Sensen
NEURAL PROCESSING LETTERS, 2025, 57 (01)
[29] Multi-Task Assisted Driving Policy Learning Method for Autonomous Driving
Luo, Yutao
Xue, Zhicheng
Huanan Ligong Daxue Xuebao/Journal of South China University of Technology (Natural Science), 2024, 52 (10): : 31 - 40
[30] Scalable Parallel Task Scheduling for Autonomous Driving Using Multi-Task Deep Reinforcement Learning
Qi, Qi
Zhang, Lingxin
Wang, Jingyu
Sun, Haifeng
Zhuang, Zirui
Liao, Jianxin
Yu, F. Richard
IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2020, 69 (11) : 13861 - 13874

← 1 2 3 4 5 →