Hypergraph convolutional network based weakly supervised point cloud semantic segmentation with scene-level annotations

被引:0
作者
Lu, Zhuheng [1 ]
Zhang, Peng [2 ]
Dai, Yuewei [2 ]
Li, Weiqing [3 ]
Su, Zhiyong [2 ]
机构
[1] Wuxi Univ, Sch Automat, Wuxi, Peoples R China
[2] Nanjing Univ Sci & Technol, Sch Automat, Nanjing, Peoples R China
[3] Nanjing Univ Sci & Technol, Sch Comp Sci & Engn, Nanjing, Peoples R China
基金
国家重点研发计划;
关键词
Weakly supervised segmentation; Hypergraph; Scene-level supervision; Semantic segmentation; Point cloud; IMAGE; RETRIEVAL; GRAPH;
D O I
10.1016/j.neucom.2024.129264
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Point cloud segmentation with scene-level annotations is a promising but challenging task. Currently, the most popular way is to employ the class activation map (CAM) to locate discriminative regions and then generate point-level pseudo labels from scene-level annotations. However, these methods always suffer from the point imbalance among categories, as well as the sparse and incomplete supervision from CAM. In this paper, we propose a novel weighted hypergraph convolutional network-based method, called WHCN, to confront the challenges of learning point-wise labels from scene-level annotations. Firstly, in order to simultaneously overcome the point imbalance among different categories and reduce the model complexity, superpoints of a training point cloud are generated by exploiting the geometrically homogeneous partition. Then, a hypergraph is constructed based on the high-confidence superpoint-level seeds which are converted from scene-level annotations. Secondly, the WHCN takes the hypergraph as input and learns to predict high-precision point- level pseudo labels by label propagation. Besides the backbone network consisting of spectral hypergraph convolution blocks, a hyperedge attention module is learned to adjust the weights of hyperedges in the WHCN. Finally, a segmentation network is trained by these pseudo point cloud labels. Experimental results on the scanNet, S3DIS, Semantic3D, and ShapeNet Part benchmarks demonstrate that the proposed WHCN is effective to predict the point labels with scene annotations, which outperforms the state-of-the-art by 3.5% to 36.1% mIou. The source code is available at https://github.com/VCG-NJUST/WHCN.
引用
收藏
页数:12
相关论文
共 80 条
[1]  
Agarwal S., 2006, P 23 INT C MACHINE L, P17
[2]  
[Anonymous], NIPS, P5099, DOI [10.48550/arXiv.1706.02413, DOI 10.1109/CVPR.2017.16]
[3]  
Armeni I., 2017, arXiv, DOI [10.48550/arXiv.1702.01105, DOI 10.48550/ARXIV.1702.01105]
[4]   Hypergraph convolution and hypergraph attention [J].
Bai, Song ;
Zhang, Feihu ;
Torr, Philip H. S. .
PATTERN RECOGNITION, 2021, 110
[5]  
Bandyopadhyay S, 2020, Arxiv, DOI arXiv:2002.03392
[6]   Review: Deep Learning on 3D Point Clouds [J].
Bello, Saifullahi Aminu ;
Yu, Shangshu ;
Wang, Cheng ;
Adam, Jibril Muhmmad ;
Li, Jonathan .
REMOTE SENSING, 2020, 12 (11)
[7]  
Boulch A., 2017, 3DOR, P1, DOI [10.2312/3dor.20171047, DOI 10.2312/3DOR.20171047]
[8]   ConvPoint: Continuous convolutions for point cloud processing [J].
Boulch, Alexandre .
COMPUTERS & GRAPHICS-UK, 2020, 88 :24-34
[9]   SnapNet: 3D point cloud semantic labeling with 2D deep segmentation networks [J].
Boulch, Alexandre ;
Guerry, Yids ;
Le Saux, Bertrand ;
Audebert, Nicolas .
COMPUTERS & GRAPHICS-UK, 2018, 71 :189-198
[10]  
Chenfeng Xu, 2020, Computer Vision - ECCV 2020. 16th European Conference. Proceedings. Lecture Notes in Computer Science (LNCS 12373), P1, DOI 10.1007/978-3-030-58604-1_1