Superpoint Transformer for 3D Scene Instance Segmentation

被引:0
|
作者
Sun, Jiahao [1 ]
Qing, Chunmei [1 ]
Tan, Junpeng [1 ]
Xu, Xiangmin [2 ]
机构
[1] South China Univ Technol, Sch Elect & Informat Engn, Guangzhou, Peoples R China
[2] South China Univ Technol, Sch Future Technol, Guangzhou, Peoples R China
基金
国家重点研发计划; 中国国家自然科学基金;
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Most existing methods realize 3D instance segmentation by extending those models used for 3D object detection or 3D semantic segmentation. However, these non-straight-forward methods suffer from two drawbacks: 1) Imprecise bounding boxes or unsatisfactory semantic predictions limit the performance of the overall 3D instance segmentation framework. 2) Existing methods require a time-consuming intermediate step of aggregation. To address these issues, this paper proposes a novel end-to-end 3D instance segmentation method based on Superpoint Transformer, named as SPFormer. It groups potential features from point clouds into superpoints, and directly predicts instances through query vectors without relying on the results of object detection or semantic segmentation. The key step in this framework is a novel query decoder with transformers that can capture the instance information through the superpoint cross-attention mechanism and generate the superpoint masks of the instances. Through bipartite matching based on superpoint masks, SPFormer can implement the network training without the intermediate aggregation step, which accelerates the network. Extensive experiments on ScanNetv2 and S3DIS benchmarks verify that our method is concise yet efficient. Notably, SPFormer exceeds compared state-of-the-art methods by 4.3% on Scan-Netv2 hidden test set in terms of mAP and keeps fast inference speed (247ms per frame) simultaneously. Code is available at https://github.com/sunjiahao1999/SPFormer.
引用
收藏
页码:2393 / 2401
页数:9
相关论文
共 50 条
  • [1] Efficient 3D Semantic Segmentation with Superpoint Transformer
    Robert, Damien
    Raguet, Hugo
    Landrieu, Loic
    2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2023), 2023, : 17149 - 17158
  • [2] Learning Superpoint Graph Cut for 3D Instance Segmentation
    Hui, Le
    Tang, Linghua
    Shen, Yaqi
    Xie, Jin
    Yang, Jian
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35 (NEURIPS 2022), 2022,
  • [3] Instance Segmentation in 3D Scenes using Semantic Superpoint Tree Networks
    Liang, Zhihao
    Li, Zhihao
    Xu, Songcen
    Tan, Mingkui
    Jia, Kui
    2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 2763 - 2772
  • [4] Learning Inter-superpoint Affinity for Weakly Supervised 3D Instance Segmentation
    Tang, Linghua
    Hui, Le
    Xie, Jin
    COMPUTER VISION - ACCV 2022, PT I, 2023, 13841 : 176 - 192
  • [5] Query Refinement Transformer for 3D Instance Segmentation
    Lu, Jiahao
    Deng, Jiacheng
    Wang, Chuxin
    He, Jianfeng
    Zhang, Tianzhu
    2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2023), 2023, : 18470 - 18480
  • [6] Semantic Instance Segmentation in a 3D Traffic Scene Reconstruction task
    Hadi, Shiqah
    Phon-Amnuaisuk, Somnuk
    Tan, Soon-Jiann
    2020 59TH ANNUAL CONFERENCE OF THE SOCIETY OF INSTRUMENT AND CONTROL ENGINEERS OF JAPAN (SICE), 2020, : 186 - 191
  • [7] Mask3D: Mask Transformer for 3D Semantic Instance Segmentation
    Schult, Jonas
    Engelmann, Francis
    Hermans, Alexander
    Litany, Or
    Tang, Siyu
    Leibe, Bastian
    2023 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA 2023), 2023, : 8216 - 8223
  • [8] Mask-Attention-Free Transformer for 3D Instance Segmentation
    Lai, Xin
    Yuan, Yuhui
    Chu, Ruihang
    Chen, Yukang
    Hu, Han
    Jia, Jiaya
    2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION, ICCV, 2023, : 3670 - 3680
  • [9] Scalable 3D Panoptic Segmentation As Superpoint Graph Clustering
    Robert, Damien
    Raguet, Hugo
    Landrieu, Loic
    2024 INTERNATIONAL CONFERENCE IN 3D VISION, 3DV 2024, 2024, : 179 - 189
  • [10] Transformer-based 3D Instance Segmentation With Auxiliary Denoising Learning
    Song S.-H.
    Kim I.
    Journal of Institute of Control, Robotics and Systems, 2023, 29 (12) : 954 - 965