Self-Supervised Global-Local Structure Modeling for Point Cloud Domain Adaptation with Reliable Voted Pseudo Labels

被引：41

作者：

Fan, Hehe ^{[1
]}

Chang, Xiaojun ^{[2
]}

Zhang, Wanyue ^{[3
,4
]}

Cheng, Yi ^{[4
]}

Sun, Ying ^{[4
]}

Kankanhalli, Mohan ^{[1
]}

机构：

[1] Natl Univ Singapore, Sch Comp, Singapore, Singapore

[2] Univ Technol, ReLER Lab, AAII, Sydney, NSW, Australia

[3] Max Planck Inst Informat, Saarbrucken, Germany

[4] Agcy Sci Technol & Res, Inst Infocomm Res, Singapore, Singapore

来源：

2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022) | 2022年

关键词：

D O I：

10.1109/CVPR52688.2022.00627

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

In this paper, we propose an unsupervised domain adaptation method for deep point cloud representation learning. To model the internal structures in target point clouds, we first propose to learn the global representations of unlabeled data by scaling up or down point clouds and then predicting the scales. Second, to capture the local structure in a self-supervised manner, we propose to project a 3D local area onto a 2D plane and then learn to reconstruct the squeezed region. Moreover, to effectively transfer the knowledge from source domain, we propose to vote pseudo labels for target samples based on the labels of their nearest source neighbors in the shared feature space. To avoid the noise caused by incorrect pseudo labels, we only select reliable target samples, whose voting consistencies are high enough, for enhancing adaptation. The voting method is able to adaptively select more and more target samples during training, which in return facilitates adaptation because the amount of labeled target data increases. Experiments on PointDA (ModelNet-10, ShapeNet-10 and ScanNet-10) and Sim-to-Real (ModelNet-11, ScanObjectNN-11, ShapeNet-9 and ScanObjectNN-9) demonstrate the effectiveness of our method.

引用

页码：6367 / 6376

页数：10

共 43 条

[1]

[Anonymous], IEEE T PATTERN ANAL

[2]

[Anonymous], 2021, WACV, DOI DOI 10.1109/WACV48630.2021.00017

[3]

Chang Angel X., 2015, arXiv

[4] Shape Completion using 3D-Encoder-Predictor CNNs and Shape Synthesis [J].

Dai, Angela ;

Qi, Charles Ruizhongtai ;

Niessner, Matthias .

30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, :6545-6554

[5] ScanNet: Richly-annotated 3D Reconstructions of Indoor Scenes [J].

Dai, Angela ;

Chang, Angel X. ;

Savva, Manolis ;

Halber, Maciej ;

Funkhouser, Thomas ;

Niessner, Matthias .

30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, :2432-2443

[6] Adaptive Exploration for Unsupervised Person Re-identification [J].

Ding, Yuhang ;

Fan, Hehe ;

Xu, Mingliang ;

Yang, Yi .

ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS, 2020, 16 (01)

[7] Unsupervised Visual Representation Learning by Context Prediction [J].

Doersch, Carl ;

Gupta, Abhinav ;

Efros, Alexei A. .

2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2015, :1422-1430

[8] Discriminative Unsupervised Feature Learning with Exemplar Convolutional Neural Networks [J].

Dosovitskiy, Alexey ;

Fischer, Philipp ;

Springenberg, Jost Tobias ;

Riedmiller, Martin ;

Brox, Thomas .

IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2016, 38 (09) :1734-1747

[9]

Fan H., 2021, PROC IEEECVF C COMPU, P14204

[10] Heterogeneous Hypergraph Variational Autoencoder for Link Prediction [J].

Fan, Haoyi ;

Zhang, Fengbin ;

Wei, Yuxuan ;

Li, Zuoyong ;

Zou, Changqing ;

Gao, Yue ;

Dai, Qionghai .

IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2022, 44 (08) :4125-4138

← 1 2 3 4 5 →