Structural Relation Modeling of 3D Point Clouds

被引:0
作者
Zheng, Yu [1 ]
Lu, Jiwen [1 ]
Duan, Yueqi [2 ]
Zhou, Jie [1 ]
机构
[1] Tsinghua Univ, Dept Automat, Beijing 100084, Peoples R China
[2] Tsinghua Univ, Dept Elect Engn, Beijing 100084, Peoples R China
基金
中国国家自然科学基金;
关键词
Three-dimensional displays; Point cloud compression; Solid modeling; Feature extraction; Semantics; Aggregates; Deep learning; Structural modeling; relational learning; point cloud recognition; 3D deep learning; CLASSIFICATION; NETWORK;
D O I
10.1109/TIP.2024.3451940
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, we propose an effective plug-and-play module called structural relation network (SRN) to model structural dependencies in 3D point clouds for feature representation. Existing network architectures such as PointNet++ and RS-CNN capture local structures individually and ignore the inner interactions between different sub-clouds. Motivated by the fact that structural relation modeling plays critical roles for humans to understand 3D objects, our SRN exploits local information by modeling structural relations in 3D spaces. For a given sub-cloud of point sets, SRN firstly extracts its geometrical and locational relations with the other sub-clouds and maps them into the embedding space, then aggregates both relational features with the other sub-clouds. As the variation of semantics embedded in different sub-clouds is ignored by SRN, we further extend SRN to enable dynamic message passing between different sub-clouds. We propose a graph-based structural relation network (GSRN) where sub-clouds and their pairwise relations are modeled as nodes and edges respectively, so that the node features are updated by the messages along the edges. Since the node features might not be well preserved when acquiring the global representation, we propose a Combined Entropy Readout (CER) function to adaptively aggregate them into the holistic representation, so that GSRN simultaneously models the local-local and local-global region-wise interaction. The proposed SRN and GSRN modules are simple, interpretable, and do not require any additional supervision signals, which can be easily equipped with the existing networks. Experimental results on the benchmark datasets (ScanObjectNN, ModelNet40, ShapeNet Part, S3DIS, ScanNet and SUN-RGBD) indicate promising boosts on the tasks of 3D point cloud classification, segmentation and object detection.
引用
收藏
页码:4867 / 4881
页数:15
相关论文
共 81 条
[1]  
[Anonymous], 2015, 3 INT C LEARN REPR I
[2]   3D Scene Graph: A structure for unified semantics, 3D space, and camera [J].
Armeni, Iro ;
He, Zhi-Yang ;
Gwak, JunYoung ;
Zamir, Amir R. ;
Fischer, Martin ;
Malik, Jitendra ;
Savarese, Silvio .
2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, :5663-5672
[3]   3D Semantic Parsing of Large-Scale Indoor Spaces [J].
Armeni, Iro ;
Sener, Ozan ;
Zamir, Amir R. ;
Jiang, Helen ;
Brilakis, Ioannis ;
Fischer, Martin ;
Savarese, Silvio .
2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, :1534-1543
[4]  
Atzmon M, 2018, Arxiv, DOI arXiv:1803.10091
[5]   3DmFV: Three-Dimensional Point Cloud Classification in Real-Time Using Convolutional Neural Networks [J].
Ben-Shabat, Yizhak ;
Lindenbaum, Michael ;
Fischer, Anath .
IEEE ROBOTICS AND AUTOMATION LETTERS, 2018, 3 (04) :3145-3152
[6]  
Chen MD, 2022, Arxiv, DOI [arXiv:2203.09065, DOI 10.48550/ARXIV.2203.09065,2203.09065, DOI 10.48550/ARXIV.2203.09065]
[7]   PRA-Net: Point Relation-Aware Network for 3D Point Cloud Analysis [J].
Cheng, Silin ;
Chen, Xiwu ;
He, Xinwei ;
Liu, Zhe ;
Bai, Xiang .
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2021, 30 :4436-4448
[8]  
Cho K., 2014, ARXIV
[9]   Shape Completion using 3D-Encoder-Predictor CNNs and Shape Synthesis [J].
Dai, Angela ;
Qi, Charles Ruizhongtai ;
Niessner, Matthias .
30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, :6545-6554
[10]   ScanNet: Richly-annotated 3D Reconstructions of Indoor Scenes [J].
Dai, Angela ;
Chang, Angel X. ;
Savva, Manolis ;
Halber, Maciej ;
Funkhouser, Thomas ;
Niessner, Matthias .
30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, :2432-2443