Two-stage feature distribution rectification for few-shot point cloud semantic segmentation

被引:2
作者
Wang, Tichao [1 ,2 ]
Hao, Fusheng [1 ,3 ]
Cui, Guosheng [5 ,6 ]
Wu, Fuxiang [1 ,3 ]
Yang, Mengjie [4 ]
Zhang, Qieshi [1 ,2 ,3 ]
Cheng, Jun [1 ,2 ,3 ]
机构
[1] Chinese Acad Sci, Shenzhen Inst Adv Technol, CAS Key Lab Human Machine Intelligence Synergy Sys, Shenzhen 518055, Peoples R China
[2] Univ Chinese Acad Sci, Sch Artificial Intelligence, Beijing 100049, Peoples R China
[3] Chinese Univ Hong Kong, Hong Kong, Peoples R China
[4] ShengYun Technol Co Ltd, Kunming, Peoples R China
[5] Chinese Acad Sci, Shenzhen Inst Adv Technol, Shenzhen 518055, Peoples R China
[6] Joint Engn Res Ctr Hlth Big Data Intelligent Anal, Shenzhen 518055, Peoples R China
基金
中国国家自然科学基金;
关键词
Few-shot learning; Point cloud semantic segmentation; Feature distribution rectification; NETWORK;
D O I
10.1016/j.patrec.2023.12.008
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Few-shot point cloud semantic segmentation segments new classes given few labeled examples and has attracted much attention recently. However, due to the scarcity of labeled data, there are biases between the ideal and the actual feature distributions. Addressing the above issues, we propose a two-stage feature distribution rectification method (TFDR) to reduce these biases. We define the biases in two aspects: interclass and intraclass distribution biases. Interclass distribution bias refers to the distribution shifting introduced by the difference between support data and query data. To reduce this bias, we design a novel feature alignment module (FAM). Intraclass distribution bias is defined as the bias between the ideal and the actual feature distribution of a class, which is introduced by the difference in local structures such as the seats and the legs of chairs. To mitigate the effects of intraclass distribution, we propose a distribution canonicalization module (DCM) rectifying the feature distributions of query data. The experimental results show that the proposed method outperforms several state-of-the-art methods with great significance on the S3DIS and ScanNet datasets, thus demonstrating the effectiveness of our model.
引用
收藏
页码:142 / 149
页数:8
相关论文
共 38 条
  • [1] 3D Semantic Parsing of Large-Scale Indoor Spaces
    Armeni, Iro
    Sener, Ozan
    Zamir, Amir R.
    Jiang, Helen
    Brilakis, Ioannis
    Fischer, Martin
    Savarese, Silvio
    [J]. 2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, : 1534 - 1543
  • [2] Unsupervised Domain Adaptation for Point Cloud Semantic Segmentation via Graph Matching
    Bian, Yikai
    Hui, Le
    Qian, Jianjun
    Xie, Jin
    [J]. 2022 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2022, : 9899 - 9904
  • [3] Vision-based Large-scale 3D Semantic Mapping for Autonomous Driving Applications
    Cheng, Qing
    Zeller, Niclas
    Cremers, Daniel
    [J]. 2022 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION, ICRA 2022, 2022, : 9235 - 9242
  • [4] ScanNet: Richly-annotated 3D Reconstructions of Indoor Scenes
    Dai, Angela
    Chang, Angel X.
    Savva, Manolis
    Halber, Maciej
    Funkhouser, Thomas
    Niessner, Matthias
    [J]. 30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, : 2432 - 2443
  • [5] PCT: Point cloud transformer
    Guo, Meng-Hao
    Cai, Jun-Xiong
    Liu, Zheng-Ning
    Mu, Tai-Jiang
    Martin, Ralph R.
    Hu, Shi-Min
    [J]. COMPUTATIONAL VISUAL MEDIA, 2021, 7 (02) : 187 - 199
  • [6] Prototype Adaption and Projection for Few- and Zero-Shot 3D Point Cloud Semantic Segmentation
    He, Shuting
    Jiang, Xudong
    Jiang, Wei
    Ding, Henghui
    [J]. IEEE TRANSACTIONS ON IMAGE PROCESSING, 2023, 32 : 3199 - 3211
  • [7] Arbitrary Style Transfer in Real-time with Adaptive Instance Normalization
    Huang, Xun
    Belongie, Serge
    [J]. 2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, : 1510 - 1519
  • [8] Jinlu Liu, 2020, Computer Vision - ECCV 2020. 16th European Conference. Proceedings. Lecture Notes in Computer Science (LNCS 12346), P741, DOI 10.1007/978-3-030-58452-8_43
  • [9] Stratified Transformer for 3D Point Cloud Segmentation
    Lai, Xin
    Liu, Jianhui
    Jiang, Li
    Wang, Liwei
    Zhao, Hengshuang
    Liu, Shu
    Qi, Xiaojuan
    Jia, Jiaya
    [J]. 2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2022, : 8490 - 8499
  • [10] Transductive distribution calibration for few-shot learning
    Li, Gang
    Zheng, Changwen
    Su, Bing
    [J]. NEUROCOMPUTING, 2022, 500 : 604 - 615