Point Cloud Domain Adaptation via Masked Local 3D Structure Prediction

被引：21

作者：

Liang, Hanxue ^{[1
]}

Fan, Hehe ^{[2
]}

Fan, Zhiwen ^{[1
]}

Wang, Yi ^{[1
]}

Chen, Tianlong ^{[1
]}

Cheng, Yu ^{[3
]}

Wang, Zhangyang ^{[1
]}

机构：

[1] Univ Texas Austin, Austin, TX 78712 USA

[2] Natl Univ Singapore, Singapore, Singapore

[3] Microsoft Res Redmond, Redmond, WA USA

来源：

COMPUTER VISION - ECCV 2022, PT III | 2022年 / 13663卷

关键词：

Point cloud representation learning; Unsupervised domain adaptation; Shape classification; Semantic segmentation;

D O I：

10.1007/978-3-031-20062-5_10

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

The superiority of deep learning based point cloud representations relies on large-scale labeled datasets, while the annotation of point clouds is notoriously expensive. One of the most effective solutions is to transfer the knowledge from existing labeled source data to unlabeled target data. However, domain bias typically hinders knowledge transfer and leads to accuracy degradation. In this paper, we propose a Masked Local Structure Prediction (MLSP) method to encode target data. Along with the supervised learning on the source domain, our method enables models to embed source and target data in a shared feature space. Specifically, we predict masked local structure via estimating point cardinality, position and normal. Our design philosophies lie in: 1) Point cardinality reflects basic structures (e.g., line, edge and plane) that are invariant to specific domains. 2) Predicting point positions in masked areas generalizes learned representations so that they are robust to incompletion-caused domain bias. 3) Point normal is generated by neighbors and thus robust to noise across domains. We conduct experiments on shape classification and semantic segmentation with different transfer permutations and the results demonstrate the effectiveness of our method. Code is available at https://github.com/VITA-Group/MLSP.

引用

页码：156 / 172

页数：17

共 47 条

[1] Self-Supervised Learning for Domain Adaptation on Point Clouds [J].

Achituve, Idan ;

Maron, Haggai ;

Chechik, Gal .

2021 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV 2021), 2021, :123-133

[2]

Chen XY, 2019, PR MACH LEARN RES, V97

[3] ScanNet: Richly-annotated 3D Reconstructions of Indoor Scenes [J].

Dai, Angela ;

Chang, Angel X. ;

Savva, Manolis ;

Halber, Maciej ;

Funkhouser, Thomas ;

Niessner, Matthias .

30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, :2432-2443

[4] Cluster Alignment with a Teacher for Unsupervised Domain Adaptation [J].

Deng, Zhijie ;

Luo, Yucen ;

Zhu, Jun .

2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, :9943-9952

[5]

Fan H., 2022, P IEEE CVF C COMP VI, P6377

[6]

Fan H., 2020, INT C LEARNING REPRE

[7] A Point Set Generation Network for 3D Object Reconstruction from a Single Image [J].

Fan, Haoqiang ;

Su, Hao ;

Guibas, Leonidas .

30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, :2463-2471

[8] Unsupervised Visual Representation Learning via Dual-Level Progressive Similar Instance Selection [J].

Fan, Hehe ;

Liu, Ping ;

Xu, Mingliang ;

Yang, Yi .

IEEE TRANSACTIONS ON CYBERNETICS, 2022, 52 (09) :8851-8861

[9] Unsupervised Person Re-identification: Clustering and Fine-tuning [J].

Fan, Hehe ;

Zheng, Liang ;

Yan, Chenggang ;

Yang, Yi .

ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS, 2018, 14 (04)

[10]

Ganin Y, 2016, J MACH LEARN RES, V17

← 1 2 3 4 5 →