Dynamic Convolution for 3D Point Cloud Instance Segmentation

被引：7

作者：

He, Tong ^{[1
]}

Shen, Chunhua ^{[2
]}

van den Hengel, Anton ^{[3
]}

机构：

[1] Univ Adelaide, Shanghai AI Lab, Adelaide, SA 5005, Australia

[2] Zhejiang Univ, Hangzhou 310027, Zhejiang, Peoples R China

[3] Univ Adelaide, Adelaide, SA 5005, Australia

来源：

IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE | 2023年 / 45卷 / 05期

关键词：

Point cloud; instance segmentation; dynamic convolution; deep learning;

D O I：

10.1109/TPAMI.2022.3216926

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

In this paper, we come up with a simple yet effective approach for instance segmentation on 3D point cloud with strong robustness. Previous top-performing methods for this task adopt a bottom-up strategy, which often involves various inefficient operations or complex pipelines, such as grouping over-segmented components, introducing heuristic post-processing steps, and designing complex loss functions. As a result, the inevitable variations of the instances sizes make it vulnerable and sensitive to the values of pre-defined hyper-parameters. To this end, we instead propose a novel pipeline that applies dynamic convolution to generate instance-aware parameters in response to the characteristics of the instances. The representation capability of the parameters is greatly improved by gathering homogeneous points that have identical semantic categories and close votes for the geometric centroids. Instances are then decoded via several simple convolution layers, where the parameters are generated depending on the input. In addition, to introduce a large context and maintain limited computational overheads, a light-weight transformer is built upon the bottleneck layer to capture the long-range dependencies. With the only post-processing step, non-maximum suppression (NMS), we demonstrate a simpler and more robust approach that achieves promising performance on various datasets: ScanNetV2, S3DIS, and PartNet. The consistent improvements on both voxel- and point-based architectures imply the effectiveness of the proposed method. Code is available at: https://git.io/DyCo3D.

引用

页码：5697 / 5711

页数：15

共 120 条

[1] [Anonymous], 2020, ARXIV200205709
[2] [Anonymous], P IEEE C COMP VIS PA
[3] [Anonymous], 2023, P IEEE C COMP VIS PA, DOI DOI 10.1080/23249935.2022.2033348
[4] [Anonymous], 2019, P ADV NEUR INF PROC
[5] 3D Semantic Parsing of Large-Scale Indoor Spaces
Armeni, Iro
Sener, Ozan
Zamir, Amir R.
Jiang, Helen
Brilakis, Ioannis
Fischer, Martin
Savarese, Silvio
[J]. 2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, : 1534 - 1543
[6] Carion N., 2020, ARXIV201011929
[7] BlendMask: Top-Down Meets Bottom-Up for Instance Segmentation
Chen, Hao
Sun, Kunyang
Tian, Zhi
Shen, Chunhua
Huang, Yongming
Yan, Youliang
[J]. 2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2020), 2020, : 8570 - 8578
[8] Hybrid Task Cascade for Instance Segmentation
Chen, Kai
Pang, Jiangmiao
Wang, Jiaqi
Xiong, Yu
Li, Xiaoxiao
Sun, Shuyang
Feng, Wansen
Liu, Ziwei
Shi, Jianping
Ouyang, Wanli
Loy, Chen Change
Lin, Dahua
[J]. 2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 4969 - 4978
[9] Encoder-Decoder with Atrous Separable Convolution for Semantic Image Segmentation
Chen, Liang-Chieh
Zhu, Yukun
Papandreou, George
Schroff, Florian
Adam, Hartwig
[J]. COMPUTER VISION - ECCV 2018, PT VII, 2018, 11211 : 833 - 851
[10] Hierarchical Aggregation for 3D Instance Segmentation
Chen, Shaoyu
Fang, Jiemin
Zhang, Qian
Liu, Wenyu
Wang, Xinggang
[J]. 2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 15447 - 15456

← 1 2 3 4 5 6 7 8 9 10 →