Uncertainty-Aware Superpoint Graph Transformer for Weakly Supervised 3-D Semantic Segmentation

被引：0

作者：

Fan, Yan ^{[1
,2
,3
]}

Wang, Yu ^{[1
,2
,3
]}

Zhu, Pengfei ^{[1
,2
,3
]}

Hui, Le ^{[4
,5
,6
]}

Xie, Jin ^{[7
,8
]}

Hu, Qinghua ^{[1
,2
,3
]}

机构：

[1] Tianjin Univ, Coll Intelligence & Comp, Tianjin 300350, Peoples R China

[2] Tianjin Univ, Engn Res Ctr City Intelligence & Digital Governanc, Minist Educ Peoples Republic China, Tianjin 300350, Peoples R China

[3] Tianjin Univ, Haihe Lab ITAI, Tianjin 300350, Peoples R China

[4] Northwestern Polytech Univ, Sch Elect & Informat, Xian 710072, Peoples R China

[5] Northwestern Polytech Univ, Shaanxi Key Lab Informat Acquisit & Proc, Xian 710072, Peoples R China

[6] Nanjing Univ Sci & Technol, PCA Lab, Key Lab Intelligent Percept & Syst High Dimens Inf, Minist Educ, Xian 710072, Peoples R China

[7] Nanjing Univ, State Key Lab Novel Software Technol, Nanjing 210093, Peoples R China

[8] Nanjing Univ, Sch Intelligence Sci & Technol, Suzhou 215163, Peoples R China

来源：

IEEE TRANSACTIONS ON FUZZY SYSTEMS | 2025年 / 33卷 / 06期

基金：

中国国家自然科学基金;

关键词：

Annotations; Point cloud compression; Semantic segmentation; Three-dimensional displays; Transformers; Uncertainty; Reliability; Training; Feature extraction; Semantics; 3-D semantic segmentation; fuzzy set; transformer; uncertainty learning; weakly supervised learning; POINT; NET;

D O I：

10.1109/TFUZZ.2025.3543036

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Weakly supervised 3-D semantic segmentation has successfully mitigated the labor-intensive and time-consuming task of annotating 3-D point clouds. However, reliably utilizing the minimal point-wise annotations for unlabeled data in complex and large-scale scenes is still challenging, such as only 20 points labeled in 2 million points. To tackle this challenge, we propose a new Uncertainty-aware Superpoint Graph Transformer (UaSGT) framework that utilizes minimal annotations for unlabeled data learning through reliable long-range supervision propagation from labeled superpoints to unlabeled superpoints. First, we propose a superpoint graph transformer to achieve long-range supervision propagation along the attention-based fuzzy subsets defined on superpoints. The attention-based fuzzy subset measures the membership of unlabeled superpoints to clusters centered on labeled superpoints. Second, we employ an uncertainty-aware membership rectification technique on the fuzzy subset to ensure reliable propagation among superpoints within the same category. This technique integrates an uncertainty prediction module to mask the influence of unreliable membership and a spatial prior refinement module to reduce uncertainty in intraclass membership degrees. Finally, experimental results on two large-scale benchmarks S3DIS and ScanNet-V2 demonstrate the superiority of our approach compared to the state-of-the-art with at least 90% annotation reduction, and our method also achieves comparable performance to fully supervised methods with less than 0.1% labeled points.

引用

页码：1899 / 1912

页数：14

共 72 条

[1]

Armeni I., 2017, arXiv, DOI DOI 10.48550/ARXIV.1702.01105

[2]

Chen YL, 2024, Arxiv, DOI arXiv:2404.12861

[3]

Cheng MM, 2021, AAAI CONF ARTIF INTE, V35, P1140

[4] (AF)2-S3Net: Attentive Feature Fusion with Adaptive Feature Selection for Sparse Semantic Segmentation Network [J].

Cheng, Ran ;

Razani, Ryan ;

Taghavi, Ehsan ;

Li, Enxu ;

Liu, Bingbing .

2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, :12542-12551

[5] 4D Spatio-Temporal ConvNets: Minkowski Convolutional Neural Networks [J].

Choy, Christopher ;

Gwak, JunYoung ;

Savarese, Silvio .

2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, :3070-3079

[6]

Contreras J, 2019, INT GEOSCI REMOTE SE, P5236, DOI [10.1109/IGARSS.2019.8899303, 10.1109/igarss.2019.8899303]

[7] Shape Completion using 3D-Encoder-Predictor CNNs and Shape Synthesis [J].

Dai, Angela ;

Qi, Charles Ruizhongtai ;

Niessner, Matthias .

30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, :6545-6554

[8] ScanNet: Richly-annotated 3D Reconstructions of Indoor Scenes [J].

Dai, Angela ;

Chang, Angel X. ;

Savva, Manolis ;

Halber, Maciej ;

Funkhouser, Thomas ;

Niessner, Matthias .

30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, :2432-2443

[9] PCL: Point Contrast and Labeling for Weakly Supervised Point Cloud Semantic Segmentation [J].

Du, Anan ;

Zhou, Tianfei ;

Pang, Shuchao ;

Wu, Qiang ;

Zhang, Jian .

IEEE TRANSACTIONS ON MULTIMEDIA, 2024, 26 :8902-8914

[10] Graph Regulation Network for Point Cloud Segmentation [J].

Du, Zijin ;

Liang, Jianqing ;

Liang, Jiye ;

Yao, Kaixuan ;

Cao, Feilong .

IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2024, 46 (12) :7940-7955

← 1 2 3 4 5 6 7 8 →