A new self-augment CNN for 3D point cloud classification and segmentation

被引:5
作者
Meng, Xinhong [1 ]
Lu, Xinyu [2 ]
Ye, Hailiang [2 ]
Yang, Bing [2 ]
Cao, Feilong [2 ]
机构
[1] Quanzhou Univ Informat Engn, Sch Elect & Commun Engn, Quanzhou 362000, Fujian, Peoples R China
[2] China Jiliang Univ, Coll Sci, Hangzhou 310018, Zhejiang, Peoples R China
基金
中国国家自然科学基金;
关键词
Deep learning; Point clouds; Self-augment; Classification; Segmentation;
D O I
10.1007/s13042-023-01940-4
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Point cloud classification and segmentation are challenging tasks due to the irregular structures, especially when there is translation variance in the point clouds. To overcome this barrier, this paper proposes a self-augment convolutional neural network (SACNN), which can not only extract more discriminative features from the points cloud but also alleviate the translation variance problem. Specifically, we first represent the point cloud through the dynamic graph, with the intention to keep the number of point clouds during the feature learning to avoid information loss. Benefiting from the dynamic graph, the global and local features of point clouds can be learned. Then, to reduce the translation variance in the dynamic graphs, a self-augment convolution (SAConv) module is designed to make points align their coordinates based on learned features. Finally, the local mixed aggregation module is proposed to combine the overview and the detailed descriptor of the neighbors. Experiments on several standard benchmarks verify the superiority of the SACNN over state-of-the-art methods in both 3D point cloud classification and segmentation tasks.
引用
收藏
页码:807 / 818
页数:12
相关论文
共 55 条
[1]   Point Convolutional Neural Networks by Extension Operators [J].
Atzmon, Matan ;
Maron, Haggai ;
Lipman, Yaron .
ACM TRANSACTIONS ON GRAPHICS, 2018, 37 (04)
[2]   3DmFV: Three-Dimensional Point Cloud Classification in Real-Time Using Convolutional Neural Networks [J].
Ben-Shabat, Yizhak ;
Lindenbaum, Michael ;
Fischer, Anath .
IEEE ROBOTICS AND AUTOMATION LETTERS, 2018, 3 (04) :3145-3152
[3]   Graph-Based Object Classification for Neuromorphic Vision Sensing [J].
Bi, Yin ;
Chadha, Aaron ;
Abbas, Alhabib ;
Bourtsoulatze, Eirina ;
Andreopoulos, Yiannis .
2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, :491-501
[4]   ConvPoint: Continuous convolutions for point cloud processing [J].
Boulch, Alexandre .
COMPUTERS & GRAPHICS-UK, 2020, 88 :24-34
[5]  
Chang AX., 2015, Technical Report, V1512, P03012
[6]   3D-R2N2: A Unified Approach for Single and Multi-view 3D Object Reconstruction [J].
Choy, Christopher B. ;
Xu, Danfei ;
Gwak, Jun Young ;
Chen, Kevin ;
Savarese, Silvio .
COMPUTER VISION - ECCV 2016, PT VIII, 2016, 9912 :628-644
[7]  
Cicek O., 2016, Medical Image Computing and Computer-Assisted Intervention, V9901, P424, DOI [DOI 10.1007/978-3-319-46723-849, DOI 10.1007/978-3-319-46723-8_49, 10.1007/978-3-319-46723-8_49]
[8]   Shape Completion using 3D-Encoder-Predictor CNNs and Shape Synthesis [J].
Dai, Angela ;
Qi, Charles Ruizhongtai ;
Niessner, Matthias .
30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, :6545-6554
[9]   Multiresolution Tree Networks for 3D Point Cloud Processing [J].
Gadelha, Matheus ;
Wang, Rui ;
Maji, Subhransu .
COMPUTER VISION - ECCV 2018, PT VII, 2018, 11211 :105-122
[10]   Rich feature hierarchies for accurate object detection and semantic segmentation [J].
Girshick, Ross ;
Donahue, Jeff ;
Darrell, Trevor ;
Malik, Jitendra .
2014 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2014, :580-587