Domain adaptation learning for 3D point clouds:A survey

被引:0
|
作者
Fan W. [1 ]
Lin X. [1 ]
Luo H. [1 ]
Guo W. [1 ]
Wang H. [2 ]
Dai C. [2 ]
机构
[1] College of Computer and Data Science, Fuzhou University, Fuzhou
[2] School of Surveying and Mapping, Information Engineering University, Zhengzhou
基金
中国国家自然科学基金;
关键词
3D point cloud; adversarial learning; cross-modal learning; data alignment; domain adaption learning; pseudo-label learning; remote sensing;
D O I
10.11834/jrs.20233140
中图分类号
学科分类号
摘要
Three-dimensional (3D) point cloud data have been widely used in many fields, such as autonomous driving, robotics, and high-precision mapping. At present, the state-of-the-art deep learning-based methods for 3D point cloud processing are mainly supervised learning methods. The performance of these methods depends heavily on large-scale, high-quality annotated datasets. However, annotating a large-scale, high-quality, category-diverse, and scenario-rich dataset is time-consuming and labor-intensive. In particular, obtaining sufficiently large numbers of samples for model optimization is also quite difficult in some special cases. In addition, 3D point cloud processing models trained on a single device in a special environment are difficult to generalize to different devices and environments. Their generalizability to various devices and environments is limited. Thus, how to reduce dependencies on high-quality annotated 3D point cloud datasets and how to improve the generalizability of current point cloud processing models are important research topics. In recent years, various kinds of impressive and elaborate technologies, such as meta-learning, few-shot learning, transfer learning, self-supervised learning, semisupervised learning, and weakly supervised learning, have been proposed to solve this problem. As an important research branch of transfer learning, domain adaptive learning aims to eliminate differences in feature distributions across domains and promote the generalization ability of deep learning models, thereby providing a novel solution to address this problem effectively. The academic community has conducted preliminary research on domain adaptive learning for point cloud processing. However, the domain adaptive learning field for point clouds still requires in-depth and effective exploration. Consequently, this study systematically summarizes and classifies recent 3D point cloud domain adaptive learning methods into five categories: adversarial learning, cross-modal learning, pseudo-label learning, data alignment, and other kinds of methods. First, we present the mathematical definition of the domain adaptive learning task and depict the chronological overview of the development of different domain adaptive learning methods to provide readers with a clear understanding. Second, we present the general solution for each category of domain adaptive learning methods and summarize the advantages and disadvantages of the current methods for each category. Third, we compare the performance of current methods on three-point cloud processing tasks, including 3D shape classification, 3D object detection, and 3D semantic segmentation. For each task, we also summarize the commonly used datasets and evaluation metrics for an intuitional comparison. Finally, we conclude the advantages and disadvantages of these five categories of methods and discuss future research directions about the 3D point cloud domain adaptive learning. © 2024 Science Press. All rights reserved.
引用
收藏
页码:825 / 842
页数:17
相关论文
共 79 条
  • [1] Achituve I, Maron H, Chechik G., Self-supervised learning for domain adaptation on point clouds, 2021 IEEE Winter Conference on Applications of Computer Vision (WACV), pp. 123-133, (2021)
  • [2] Alliegro A, Boscaini D, Tommasi T., Joint supervised and self-supervised learning for 3D real world challenges, 2020 25th International Conference on Pattern Recognition (ICPR), pp. 6718-6725, (2021)
  • [3] Behley J, Garbade M, Milioto A, Quenzel J, Behnke S, Stachniss C, Gall J., SemanticKITTI: a dataset for semantic scene understanding of LiDAR sequences, 2019 IEEE/CVF International Conference on Computer Vision (ICCV), pp. 9296-9306, (2019)
  • [4] Bian Y K, Hui L, Qian J J, Xie J., Unsupervised domain adaptation for point cloud semantic segmentation via graph matching, 2022 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), pp. 9899-9904, (2022)
  • [5] Caesar H, Bankiti V, Lang A H, Vora S, Liong V E, Xu Q, Krishnan A, Pan Y, Baldan G, Beijbom O., NuScenes: a multimodal dataset for autonomous driving, 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), pp. 11618-11628, (2020)
  • [6] Cardace A, Spezialetti R, Ramirez P Z, Salti S, Stefano L D., RefRec: pseudo-labels refinement via shape reconstruction for unsupervised 3D domain adaptation, 2021 International Conference on 3D Vision (3DV), pp. 331-341, (2021)
  • [7] Chang A X, Funkhouser T, Guibas L, Hanrahan P, Huang Q X, Li Z M, Savarese S, Savva M, Song S R, Su H, Xiao J X, Yi L, Yu F., ShapeNet: an information-rich 3D model repository, (2015)
  • [8] Charles R Q, Su H, Kaichun M, Guibas L J., PointNet: deep learning on point sets for 3D classification and segmentation, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 77-85, (2017)
  • [9] Chen C, Fu Z H, Chen Z H, Jin S., Cheng Z W, Jin X Y, Hua X S., Homm: Higher-order moment matching for unsupervised domain adaptation, Proceedings of the AAAI conference on artificial intelligence, 34, pp. 3422-3429, (2020)
  • [10] Chen J J, Kakillioglu B, Ren H, Velipasalar S., Why discard if you can recycle?: A recycling max pooling module for 3d point cloud analysis, 2022 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 559-567, (2022)