Dynamic Convolution for 3D Point Cloud Instance Segmentation

被引:7
作者
He, Tong [1 ]
Shen, Chunhua [2 ]
van den Hengel, Anton [3 ]
机构
[1] Univ Adelaide, Shanghai AI Lab, Adelaide, SA 5005, Australia
[2] Zhejiang Univ, Hangzhou 310027, Zhejiang, Peoples R China
[3] Univ Adelaide, Adelaide, SA 5005, Australia
关键词
Point cloud; instance segmentation; dynamic convolution; deep learning;
D O I
10.1109/TPAMI.2022.3216926
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, we come up with a simple yet effective approach for instance segmentation on 3D point cloud with strong robustness. Previous top-performing methods for this task adopt a bottom-up strategy, which often involves various inefficient operations or complex pipelines, such as grouping over-segmented components, introducing heuristic post-processing steps, and designing complex loss functions. As a result, the inevitable variations of the instances sizes make it vulnerable and sensitive to the values of pre-defined hyper-parameters. To this end, we instead propose a novel pipeline that applies dynamic convolution to generate instance-aware parameters in response to the characteristics of the instances. The representation capability of the parameters is greatly improved by gathering homogeneous points that have identical semantic categories and close votes for the geometric centroids. Instances are then decoded via several simple convolution layers, where the parameters are generated depending on the input. In addition, to introduce a large context and maintain limited computational overheads, a light-weight transformer is built upon the bottleneck layer to capture the long-range dependencies. With the only post-processing step, non-maximum suppression (NMS), we demonstrate a simpler and more robust approach that achieves promising performance on various datasets: ScanNetV2, S3DIS, and PartNet. The consistent improvements on both voxel- and point-based architectures imply the effectiveness of the proposed method. Code is available at: https://git.io/DyCo3D.
引用
收藏
页码:5697 / 5711
页数:15
相关论文
共 120 条
  • [1] [Anonymous], 2020, ARXIV200205709
  • [2] [Anonymous], P IEEE C COMP VIS PA
  • [3] [Anonymous], 2023, P IEEE C COMP VIS PA, DOI DOI 10.1080/23249935.2022.2033348
  • [4] [Anonymous], 2019, P ADV NEUR INF PROC
  • [5] 3D Semantic Parsing of Large-Scale Indoor Spaces
    Armeni, Iro
    Sener, Ozan
    Zamir, Amir R.
    Jiang, Helen
    Brilakis, Ioannis
    Fischer, Martin
    Savarese, Silvio
    [J]. 2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, : 1534 - 1543
  • [6] Carion N., 2020, ARXIV201011929
  • [7] BlendMask: Top-Down Meets Bottom-Up for Instance Segmentation
    Chen, Hao
    Sun, Kunyang
    Tian, Zhi
    Shen, Chunhua
    Huang, Yongming
    Yan, Youliang
    [J]. 2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2020), 2020, : 8570 - 8578
  • [8] Hybrid Task Cascade for Instance Segmentation
    Chen, Kai
    Pang, Jiangmiao
    Wang, Jiaqi
    Xiong, Yu
    Li, Xiaoxiao
    Sun, Shuyang
    Feng, Wansen
    Liu, Ziwei
    Shi, Jianping
    Ouyang, Wanli
    Loy, Chen Change
    Lin, Dahua
    [J]. 2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 4969 - 4978
  • [9] Encoder-Decoder with Atrous Separable Convolution for Semantic Image Segmentation
    Chen, Liang-Chieh
    Zhu, Yukun
    Papandreou, George
    Schroff, Florian
    Adam, Hartwig
    [J]. COMPUTER VISION - ECCV 2018, PT VII, 2018, 11211 : 833 - 851
  • [10] Hierarchical Aggregation for 3D Instance Segmentation
    Chen, Shaoyu
    Fang, Jiemin
    Zhang, Qian
    Liu, Wenyu
    Wang, Xinggang
    [J]. 2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 15447 - 15456