CO-Net plus plus : A Cohesive Network for Multiple Point Cloud Tasks at Once With Two-Stage Feature Rectification

被引：0

作者：

Xie, Tao ^{[1
,2
,3
]}

Dai, Kun ^{[1
]}

Sun, Qihao ^{[1
]}

Jiang, Zhiqiang ^{[1
]}

Cao, Chuqing ^{[3
]}

Zhao, Lijun ^{[1
,2
,3
]}

Wang, Ke ^{[1
,2
]}

Li, Ruifeng ^{[1
]}

机构：

[1] Harbin Inst Technol, State Key Lab Robot & Syst, Harbin 150006, Peoples R China

[2] Harbin Inst Technol, Zhengzhou Res Inst, Harbin 150006, Peoples R China

[3] Wuhu HIT Robot Ind Technol Res Inst Co Ltd, Wuhu 241000, Peoples R China

来源：

IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE | 2024年 / 46卷 / 12期

基金：

中国国家自然科学基金;

关键词：

Deep learning; multi-task learning; point cloud tasks; two-stage feature rectification;

D O I：

10.1109/TPAMI.2024.3447008

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

We present CO-Net++, a cohesive framework that optimizes multiple point cloud tasks collectively across heterogeneous dataset domains with a two-stage feature rectification strategy. The core of CO-Net++ lies in optimizing task-shared parameters to capture universal features across various tasks while discerning task-specific parameters tailored to encapsulate the unique characteristics of each task. Specifically, CO-Net++ develops a two-stage feature rectification strategy (TFRS) that distinctly separates the optimization processes for task-shared and task-specific parameters. At the first stage, TFRS configures all parameters in backbone as task-shared, which encourages CO-Net++ to thoroughly assimilate universal attributes pertinent to all tasks. In addition, TFRS introduces a sign-based gradient surgery to facilitate the optimization of task-shared parameters, thus alleviating conflicting gradients induced by various dataset domains. In the second stage, TFRS freezes task-shared parameters and flexibly integrates task-specific parameters into the network for encoding specific characteristics of each dataset domain. CO-Net++ prominently mitigates conflicting optimization caused by parameter entanglement, ensuring the sufficient identification of universal and specific features. Extensive experiments reveal that CO-Net++ realizes exceptional performances on both 3D object detection and 3D semantic segmentation tasks. Moreover, CO-Net++ delivers an impressive incremental learning capability and prevents catastrophic amnesia when generalizing to new point cloud tasks.

引用

页码：10911 / 10928

页数：18

共 6 条

[1] Two-stage U-Net plus plus for Medical Image Segmentation
Al Suman, Abdulla
Sarda, Shubham
Asikuzzaman, Md
Webb, Alexandra Louise
Diana, M. Perriman
Tahtali, Murat
Di Ieva, Antonio
Pickering, Mark R.
2021 INTERNATIONAL CONFERENCE ON DIGITAL IMAGE COMPUTING: TECHNIQUES AND APPLICATIONS (DICTA 2021), 2021, : 260 - 265
[2] PCRMLP: A Two-Stage Network for Point Cloud Registration in Urban Scenes
Liu, Jingyang
Xu, Yucheng
Zhou, Lu
Sun, Lei
SENSORS, 2023, 23 (12)
[3] Two-Stage Point Cloud Registration Framework Based on Graph Neural Network and Attention
Zhang, Xiaoqian
Li, Junlin
Zhang, Wei
Xu, Yansong
Li, Feng
ELECTRONICS, 2024, 13 (03)
[4] UF-Net: A unified network for panoptic driving perception with two-stage feature refinement
Zhou, Zilong
Liu, Ping
Huang, Haibo
EXPERT SYSTEMS WITH APPLICATIONS, 2025, 260
[5] Two-stage graph matching point cloud registration method based on graph attention network
Guo, Jiacheng
Liu, Xuejun
Zhang, Shuo
Yan, Yong
Sha, Yun
Jiang, Yinan
JOURNAL OF APPLIED REMOTE SENSING, 2024, 18 (03)
[6] PointAttentionVLAD: A Two-Stage Self-Attention Network for Point Cloud-Based Place Recognition Task
Yi, Yanjiang
Fu, Chuanmao
Zhang, Weizhe
Wang, Hongbo
IEEE ACCESS, 2024, 12 : 65192 - 65201

← 1 →