ALSO: Automotive Lidar Self-supervision by Occupancy estimation

被引:16
作者
Boulch, Alexandre [1 ]
Sautier, Corentin [1 ,2 ]
Michele, Bjorn [1 ,3 ]
Puy, Gilles [1 ]
Marlet, Renaud [1 ,2 ]
机构
[1] Valeo ai, Paris, France
[2] Univ Gustave Eiffel, CNRS, LIGM, Ecole Ponts, Marne La Vallee, France
[3] Univ Bretagne Sud, CNRS, IRISA, Vannes, France
来源
2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR) | 2023年
关键词
D O I
10.1109/CVPR52729.2023.01293
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We propose a new self-supervised method for pre-training the backbone of deep perception models operating on point clouds. The core idea is to train the model on a pretext task which is the reconstruction of the surface on which the 3D points are sampled, and to use the underlying latent vectors as input to the perception head. The intuition is that if the network is able to reconstruct the scene surface, given only sparse input points, then it probably also captures some fragments of semantic information, that can be used to boost an actual perception task. This principle has a very simple formulation, which makes it both easy to implement and widely applicable to a large range of 3D sensors and deep networks performing semantic segmentation or object detection. In fact, it supports a single-stream pipeline, as opposed to most contrastive learning approaches, allowing training on limited resources. We conducted extensive experiments on various autonomous driving datasets, involving very different kinds of lidars, for both semantic segmentation and object detection. The results show the effectiveness of our method to learn useful representations without any annotation, compared to existing approaches. The code is available at github.com/valeoai/ALSO
引用
收藏
页码:13455 / 13465
页数:11
相关论文
共 93 条
  • [1] Achlioptas P., 2018, ICML
  • [2] [Anonymous], 2019, CVPR, DOI DOI 10.1109/CVPR.2019.01298
  • [3] [Anonymous], 2019, NEURIPS
  • [4] [Anonymous], 2020, CVPR, DOI DOI 10.1109/CVPR42600.2020.00264
  • [5] [Anonymous], 2021, CVPR, DOI DOI 10.1109/CVPR46437.2021.00676
  • [6] Atzmon M., 2021, INT C LEARN REPR ICL
  • [7] Bao Hangbo, 2021, ICLR
  • [8] SemanticKITTI: A Dataset for Semantic Scene Understanding of LiDAR Sequences
    Behley, Jens
    Garbade, Martin
    Milioto, Andres
    Quenzel, Jan
    Behnke, Sven
    Stachniss, Cyrill
    Gall, Juergen
    [J]. 2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 9296 - 9306
  • [9] Boulch A., 2022, P IEEE CVF C COMP VI, P6302
  • [10] Boulch Alexandre, 2021, 3DV