UF-Net: A unified network for panoptic driving perception with two-stage feature refinement

被引:0
作者
Zhou, Zilong
Liu, Ping [1 ]
Huang, Haibo [1 ]
机构
[1] Southwest Jiaotong Univ, Sch Mech Engn, Chengdu 610031, Peoples R China
关键词
Panoptic driving perception; Multi-task learning; Autonomous driving;
D O I
10.1016/j.eswa.2024.125434
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The panoptic driving perception system stands as a pivotal element in autonomous driving, encapsulating the detection of traffic objects, the segmentation of drivable areas, and the identification of lane lines. In this work, we demonstrate the practicability of performing these perception tasks concurrently under heterogeneous dataset domains. We discern that these tasks, originating from diverse dataset domains, inherently possess both general and specific characteristics unique to each dataset. Inspired by this insight, we design UF-Net, a unified network for multiple perception tasks with a novel two-stage feature refinement strategy, meticulously engineered to investigate both task-universal and task-specific attributes. Specifically, at the first stage, by taking the images under various dataset domains as inputs, UF-Net learns the task-universal features and outputs coarse predictions, which serve as a foundational understanding of the commonalities that exist across various tasks. In addition, we propose a gradient homogenization surgery (GHS) to facilitate the optimization of task-shared parameters, thus mitigating the conflicting gradients stemming from the different dataset domains In the second stage, UF-Net implements an adaptive sharing scheme (ASS) to selectively expand task-specific parameters within the deep model, intelligently pinpointing and learning the optimal locations for this tailored expansion, thus fine-tuning the performance for each task. Benefiting from the proposed techniques, we acquire a unified yet efficient model architecture for multiple perception tasks in autonomous driving. Extensive experiments reveal that UF-Net surpasses current state-of-the-art methods in a variety of perception tasks with significantly reduced total storage requirements. In addition, we demonstrate that our proposed GHS and ASS are designed as generic modules that can be integrated into modern multi-task learning frameworks to enhance performance.
引用
收藏
页数:14
相关论文
共 3 条
  • [1] GDMNet: A Unified Multi-Task Network for Panoptic Driving Perception
    Liu, Yunxiang
    Ma, Haili
    Zhu, Jianlin
    Zhang, Qiangbo
    CMC-COMPUTERS MATERIALS & CONTINUA, 2024, 80 (02): : 2963 - 2978
  • [2] CO-Net plus plus : A Cohesive Network for Multiple Point Cloud Tasks at Once With Two-Stage Feature Rectification
    Xie, Tao
    Dai, Kun
    Sun, Qihao
    Jiang, Zhiqiang
    Cao, Chuqing
    Zhao, Lijun
    Wang, Ke
    Li, Ruifeng
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2024, 46 (12) : 10911 - 10928
  • [3] Efficient flexible voxel-based two-stage network for 3D object detection in autonomous driving
    Sun, Fanyue
    Tong, Guoxiang
    Song, Yan
    APPLIED SOFT COMPUTING, 2024, 162