UF-Net: A unified network for panoptic driving perception with two-stage feature refinement

被引：0

作者：

Zhou, Zilong

Liu, Ping ^{[1
]}

Huang, Haibo ^{[1
]}

机构：

[1] Southwest Jiaotong Univ, Sch Mech Engn, Chengdu 610031, Peoples R China

来源：

EXPERT SYSTEMS WITH APPLICATIONS | 2025年 / 260卷

关键词：

Panoptic driving perception; Multi-task learning; Autonomous driving;

D O I：

10.1016/j.eswa.2024.125434

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

The panoptic driving perception system stands as a pivotal element in autonomous driving, encapsulating the detection of traffic objects, the segmentation of drivable areas, and the identification of lane lines. In this work, we demonstrate the practicability of performing these perception tasks concurrently under heterogeneous dataset domains. We discern that these tasks, originating from diverse dataset domains, inherently possess both general and specific characteristics unique to each dataset. Inspired by this insight, we design UF-Net, a unified network for multiple perception tasks with a novel two-stage feature refinement strategy, meticulously engineered to investigate both task-universal and task-specific attributes. Specifically, at the first stage, by taking the images under various dataset domains as inputs, UF-Net learns the task-universal features and outputs coarse predictions, which serve as a foundational understanding of the commonalities that exist across various tasks. In addition, we propose a gradient homogenization surgery (GHS) to facilitate the optimization of task-shared parameters, thus mitigating the conflicting gradients stemming from the different dataset domains In the second stage, UF-Net implements an adaptive sharing scheme (ASS) to selectively expand task-specific parameters within the deep model, intelligently pinpointing and learning the optimal locations for this tailored expansion, thus fine-tuning the performance for each task. Benefiting from the proposed techniques, we acquire a unified yet efficient model architecture for multiple perception tasks in autonomous driving. Extensive experiments reveal that UF-Net surpasses current state-of-the-art methods in a variety of perception tasks with significantly reduced total storage requirements. In addition, we demonstrate that our proposed GHS and ASS are designed as generic modules that can be integrated into modern multi-task learning frameworks to enhance performance.

引用

页数：14

共 3 条

[1] GDMNet: A Unified Multi-Task Network for Panoptic Driving Perception
Liu, Yunxiang
Ma, Haili
Zhu, Jianlin
Zhang, Qiangbo
CMC-COMPUTERS MATERIALS & CONTINUA, 2024, 80 (02): : 2963 - 2978
[2] CO-Net plus plus : A Cohesive Network for Multiple Point Cloud Tasks at Once With Two-Stage Feature Rectification
Xie, Tao
Dai, Kun
Sun, Qihao
Jiang, Zhiqiang
Cao, Chuqing
Zhao, Lijun
Wang, Ke
Li, Ruifeng
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2024, 46 (12) : 10911 - 10928
[3] Efficient flexible voxel-based two-stage network for 3D object detection in autonomous driving
Sun, Fanyue
Tong, Guoxiang
Song, Yan
APPLIED SOFT COMPUTING, 2024, 162

← 1 →