Two-stage feature distribution rectification for few-shot point cloud semantic segmentation

被引：2

作者：

Wang, Tichao ^{[1
,2
]}

Hao, Fusheng ^{[1
,3
]}

Cui, Guosheng ^{[5
,6
]}

Wu, Fuxiang ^{[1
,3
]}

Yang, Mengjie ^{[4
]}

Zhang, Qieshi ^{[1
,2
,3
]}

Cheng, Jun ^{[1
,2
,3
]}

机构：

[1] Chinese Acad Sci, Shenzhen Inst Adv Technol, CAS Key Lab Human Machine Intelligence Synergy Sys, Shenzhen 518055, Peoples R China

[2] Univ Chinese Acad Sci, Sch Artificial Intelligence, Beijing 100049, Peoples R China

[3] Chinese Univ Hong Kong, Hong Kong, Peoples R China

[4] ShengYun Technol Co Ltd, Kunming, Peoples R China

[5] Chinese Acad Sci, Shenzhen Inst Adv Technol, Shenzhen 518055, Peoples R China

[6] Joint Engn Res Ctr Hlth Big Data Intelligent Anal, Shenzhen 518055, Peoples R China

来源：

PATTERN RECOGNITION LETTERS | 2024年 / 177卷

基金：

中国国家自然科学基金;

关键词：

Few-shot learning; Point cloud semantic segmentation; Feature distribution rectification; NETWORK;

D O I：

10.1016/j.patrec.2023.12.008

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Few-shot point cloud semantic segmentation segments new classes given few labeled examples and has attracted much attention recently. However, due to the scarcity of labeled data, there are biases between the ideal and the actual feature distributions. Addressing the above issues, we propose a two-stage feature distribution rectification method (TFDR) to reduce these biases. We define the biases in two aspects: interclass and intraclass distribution biases. Interclass distribution bias refers to the distribution shifting introduced by the difference between support data and query data. To reduce this bias, we design a novel feature alignment module (FAM). Intraclass distribution bias is defined as the bias between the ideal and the actual feature distribution of a class, which is introduced by the difference in local structures such as the seats and the legs of chairs. To mitigate the effects of intraclass distribution, we propose a distribution canonicalization module (DCM) rectifying the feature distributions of query data. The experimental results show that the proposed method outperforms several state-of-the-art methods with great significance on the S3DIS and ScanNet datasets, thus demonstrating the effectiveness of our model.

引用

页码：142 / 149

页数：8

共 38 条

[1] 3D Semantic Parsing of Large-Scale Indoor Spaces
Armeni, Iro
Sener, Ozan
Zamir, Amir R.
Jiang, Helen
Brilakis, Ioannis
Fischer, Martin
Savarese, Silvio
[J]. 2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, : 1534 - 1543
[2] Unsupervised Domain Adaptation for Point Cloud Semantic Segmentation via Graph Matching
Bian, Yikai
Hui, Le
Qian, Jianjun
Xie, Jin
[J]. 2022 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2022, : 9899 - 9904
[3] Vision-based Large-scale 3D Semantic Mapping for Autonomous Driving Applications
Cheng, Qing
Zeller, Niclas
Cremers, Daniel
[J]. 2022 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION, ICRA 2022, 2022, : 9235 - 9242
[4] ScanNet: Richly-annotated 3D Reconstructions of Indoor Scenes
Dai, Angela
Chang, Angel X.
Savva, Manolis
Halber, Maciej
Funkhouser, Thomas
Niessner, Matthias
[J]. 30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, : 2432 - 2443
[5] PCT: Point cloud transformer
Guo, Meng-Hao
Cai, Jun-Xiong
Liu, Zheng-Ning
Mu, Tai-Jiang
Martin, Ralph R.
Hu, Shi-Min
[J]. COMPUTATIONAL VISUAL MEDIA, 2021, 7 (02) : 187 - 199
[6] Prototype Adaption and Projection for Few- and Zero-Shot 3D Point Cloud Semantic Segmentation
He, Shuting
Jiang, Xudong
Jiang, Wei
Ding, Henghui
[J]. IEEE TRANSACTIONS ON IMAGE PROCESSING, 2023, 32 : 3199 - 3211
[7] Arbitrary Style Transfer in Real-time with Adaptive Instance Normalization
Huang, Xun
Belongie, Serge
[J]. 2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, : 1510 - 1519
[8] Jinlu Liu, 2020, Computer Vision - ECCV 2020. 16th European Conference. Proceedings. Lecture Notes in Computer Science (LNCS 12346), P741, DOI 10.1007/978-3-030-58452-8_43
[9] Stratified Transformer for 3D Point Cloud Segmentation
Lai, Xin
Liu, Jianhui
Jiang, Li
Wang, Liwei
Zhao, Hengshuang
Liu, Shu
Qi, Xiaojuan
Jia, Jiaya
[J]. 2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2022, : 8490 - 8499
[10] Transductive distribution calibration for few-shot learning
Li, Gang
Zheng, Changwen
Su, Bing
[J]. NEUROCOMPUTING, 2022, 500 : 604 - 615

← 1 2 3 4 →