Active self-training for weakly supervised 3D scene semantic segmentation

被引:2
作者
Liu, Gengxin [1 ]
van Kaick, Oliver [2 ]
Huang, Hui [1 ]
Hu, Ruizhen [1 ]
机构
[1] Shenzhen Univ, Coll Comp Sci & Software Engn, Shenzhen 518060, Peoples R China
[2] Carleton Univ, Sch Comp Sci, Ottawa, ON K1S 5B6, Canada
基金
加拿大自然科学与工程研究理事会; 中国国家自然科学基金;
关键词
semantic segmentation; weakly supervised; self-training; active learning; DEFORMATION TRANSFER;
D O I
10.1007/s41095-022-0311-7
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Since the preparation of labeled data for training semantic segmentation networks of point clouds is a time-consuming process, weakly supervised approaches have been introduced to learn from only a small fraction of data. These methods are typically based on learning with contrastive losses while automatically deriving per-point pseudo-labels from a sparse set of user-annotated labels. In this paper, our key observation is that the selection of which samples to annotate is as important as how these samples are used for training. Thus, we introduce a method for weakly supervised segmentation of 3D scenes that combines self-training with active learning. Active learning selects points for annotation that are likely to result in improvements to the trained model, while self-training makes efficient use of the user-provided labels for learning the model. We demonstrate that our approach leads to an effective method that provides improvements in scene segmentation over previous work and baselines, while requiring only a few user annotations.
引用
收藏
页码:1063 / 1078
页数:16
相关论文
共 39 条
  • [1] [Anonymous], 2021, ACM Transactions on Graphics, V40
  • [2] [Anonymous], 2016, ACM Transactions on Graphics, V35
  • [3] [Anonymous], 2018, ACM Transactions on Graphics, V37
  • [4] Armeni S., 2017, arXiv
  • [5] Cheng MM, 2021, AAAI CONF ARTIF INTE, V35, P1140
  • [6] 4D Spatio-Temporal ConvNets: Minkowski Convolutional Neural Networks
    Choy, Christopher
    Gwak, JunYoung
    Savarese, Silvio
    [J]. 2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 3070 - 3079
  • [7] 3DMV: Joint 3D-Multi-view Prediction for 3D Semantic Scene Segmentation
    Dai, Angela
    Niessner, Matthias
    [J]. COMPUTER VISION - ECCV 2018, PT X, 2018, 11214 : 458 - 474
  • [8] Shape Completion using 3D-Encoder-Predictor CNNs and Shape Synthesis
    Dai, Angela
    Qi, Charles Ruizhongtai
    Niessner, Matthias
    [J]. 30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, : 6545 - 6554
  • [9] ScanNet: Richly-annotated 3D Reconstructions of Indoor Scenes
    Dai, Angela
    Chang, Angel X.
    Savva, Manolis
    Halber, Maciej
    Funkhouser, Thomas
    Niessner, Matthias
    [J]. 30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, : 2432 - 2443
  • [10] Gadelha Matheus, 2020, Computer Vision - ECCV 2020. 16th European Conference. Proceedings. Lecture Notes in Computer Science (LNCS 12355), P473, DOI 10.1007/978-3-030-58607-2_28