CO-Net plus plus : A Cohesive Network for Multiple Point Cloud Tasks at Once With Two-Stage Feature Rectification

被引:0
作者
Xie, Tao [1 ,2 ,3 ]
Dai, Kun [1 ]
Sun, Qihao [1 ]
Jiang, Zhiqiang [1 ]
Cao, Chuqing [3 ]
Zhao, Lijun [1 ,2 ,3 ]
Wang, Ke [1 ,2 ]
Li, Ruifeng [1 ]
机构
[1] Harbin Inst Technol, State Key Lab Robot & Syst, Harbin 150006, Peoples R China
[2] Harbin Inst Technol, Zhengzhou Res Inst, Harbin 150006, Peoples R China
[3] Wuhu HIT Robot Ind Technol Res Inst Co Ltd, Wuhu 241000, Peoples R China
基金
中国国家自然科学基金;
关键词
Deep learning; multi-task learning; point cloud tasks; two-stage feature rectification;
D O I
10.1109/TPAMI.2024.3447008
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We present CO-Net++, a cohesive framework that optimizes multiple point cloud tasks collectively across heterogeneous dataset domains with a two-stage feature rectification strategy. The core of CO-Net++ lies in optimizing task-shared parameters to capture universal features across various tasks while discerning task-specific parameters tailored to encapsulate the unique characteristics of each task. Specifically, CO-Net++ develops a two-stage feature rectification strategy (TFRS) that distinctly separates the optimization processes for task-shared and task-specific parameters. At the first stage, TFRS configures all parameters in backbone as task-shared, which encourages CO-Net++ to thoroughly assimilate universal attributes pertinent to all tasks. In addition, TFRS introduces a sign-based gradient surgery to facilitate the optimization of task-shared parameters, thus alleviating conflicting gradients induced by various dataset domains. In the second stage, TFRS freezes task-shared parameters and flexibly integrates task-specific parameters into the network for encoding specific characteristics of each dataset domain. CO-Net++ prominently mitigates conflicting optimization caused by parameter entanglement, ensuring the sufficient identification of universal and specific features. Extensive experiments reveal that CO-Net++ realizes exceptional performances on both 3D object detection and 3D semantic segmentation tasks. Moreover, CO-Net++ delivers an impressive incremental learning capability and prevents catastrophic amnesia when generalizing to new point cloud tasks.
引用
收藏
页码:10911 / 10928
页数:18
相关论文
共 6 条
  • [1] Two-stage U-Net plus plus for Medical Image Segmentation
    Al Suman, Abdulla
    Sarda, Shubham
    Asikuzzaman, Md
    Webb, Alexandra Louise
    Diana, M. Perriman
    Tahtali, Murat
    Di Ieva, Antonio
    Pickering, Mark R.
    2021 INTERNATIONAL CONFERENCE ON DIGITAL IMAGE COMPUTING: TECHNIQUES AND APPLICATIONS (DICTA 2021), 2021, : 260 - 265
  • [2] PCRMLP: A Two-Stage Network for Point Cloud Registration in Urban Scenes
    Liu, Jingyang
    Xu, Yucheng
    Zhou, Lu
    Sun, Lei
    SENSORS, 2023, 23 (12)
  • [3] Two-Stage Point Cloud Registration Framework Based on Graph Neural Network and Attention
    Zhang, Xiaoqian
    Li, Junlin
    Zhang, Wei
    Xu, Yansong
    Li, Feng
    ELECTRONICS, 2024, 13 (03)
  • [4] UF-Net: A unified network for panoptic driving perception with two-stage feature refinement
    Zhou, Zilong
    Liu, Ping
    Huang, Haibo
    EXPERT SYSTEMS WITH APPLICATIONS, 2025, 260
  • [5] Two-stage graph matching point cloud registration method based on graph attention network
    Guo, Jiacheng
    Liu, Xuejun
    Zhang, Shuo
    Yan, Yong
    Sha, Yun
    Jiang, Yinan
    JOURNAL OF APPLIED REMOTE SENSING, 2024, 18 (03)
  • [6] PointAttentionVLAD: A Two-Stage Self-Attention Network for Point Cloud-Based Place Recognition Task
    Yi, Yanjiang
    Fu, Chuanmao
    Zhang, Weizhe
    Wang, Hongbo
    IEEE ACCESS, 2024, 12 : 65192 - 65201