RFNet: Convolutional Neural Network for 3D Point Cloud Classification

被引:0
作者
Shan X.-Y. [1 ]
Sun Z.-L. [2 ]
Zeng Z.-G. [3 ]
机构
[1] School of Electrical Engineering and Automation, Anhui University, Hefei
[2] School of Artificial Intelligence, Anhui University, Hefei
[3] School of Artificial Intelligence and Automation, Huazhong University of Science and Technology, Wuhan
来源
Zidonghua Xuebao/Acta Automatica Sinica | 2023年 / 49卷 / 11期
基金
中国国家自然科学基金;
关键词
3D point cloud; attention mechanism; Deep learning; loss function; point cloud classification;
D O I
10.16383/j.aas.c210532
中图分类号
TB18 [人体工程学]; Q98 [人类学];
学科分类号
030303 ; 1201 ;
摘要
Due to the unstructured and disordered nature of point cloud, the classification accuracy is still needed to improve for the currently existed point cloud classification approaches. In this paper, an effective point cloud classification network is proposed by considering local structure construction, global feature aggregation, and loss function improvement. Firstly, in view of the unstructured nature of point cloud, the local structure is constructed via assigning different weights to those irregular neighborhood points, which are computed via the relationship between the center point feature and its neighbor feature. Moreover, a strategy of weighted average pooling (WAP) is designed to learn the attention score of each high-dimensional feature through self-attention mechanization, which can effectively aggregate redundant high-dimensional features while coping with point cloud disorder. In addition, a joint loss function (JL) is presented by utilizing the complementary relationship between the cross-entropy loss and the center loss, which increases the inter-class distance while reduces the intro-class distance, and further improves the classification ability of the network. Compared to several state-of-the-art networks, the experimental results on the synthetic dataset ModelNet40 and ShapeNetCore and the real-world dataset ScanObjectNN demonstrate the superiority of the overall network structure. © 2023 Science Press. All rights reserved.
引用
收藏
页码:2350 / 2359
页数:9
相关论文
共 30 条
[1]  
Guo Y, Bennamoun M, Sohel F, Min L, Wan J., 3D object recognition in cluttered scenes with local surface features: A survey, IEEE Transactions on Pattern Analysis & Machine Intelligence, 36, 11, pp. 2270-2287, (2014)
[2]  
Tian Yong-Lin, Shen Yu, Li Qiang, Wang Fei-Yue, Parallel point clouds: Point clouds generation and 3D model evolution via virtual-real interaction, Acta Automatica Sinica, 46, 12, pp. 2572-2582, (2020)
[3]  
Qi C R, Su H, Mo K, Guibas L J., PointNet: Deep learning on point sets for 3D classification and segmentation, Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 77-85, (2017)
[4]  
Guo Y, Wang H, Hu Q, Liu H, Bennamoun M., Deep learning for 3D point clouds: A survey, IEEE Transactions on Pattern Analysis and Machine Intelligence, 43, 12, pp. 4438-4364, (2020)
[5]  
Su H, Maji S, Kalogerakis E, Learned-Miller E., Multi-view convolutional neural networks for 3D shape recognition, Proceedings of the IEEE International Conference on Computer Vision, pp. 945-953, (2015)
[6]  
Yang Z, Wang L., Learning relationships for multi-view 3D object recognition, Proceedings of the IEEE/CVF International Conference on Computer Vision, pp. 7504-7513, (2019)
[7]  
Xu Si-Yu, Zhu Ji-Hua, Tian Zhi-Qiang, Li Yao-Chen, Pang Shan-Min, Stepwise refinement approach for registration of multi-view point sets, Acta Automatica Sinica, 45, 8, pp. 1486-1494, (2019)
[8]  
Maturana D, Scherer S., VoxNet: A 3D convolutional neural network for real-time object recognition, Proceedings of the IEEE/RSJ International Conference on Intelligent Robots and Systems, pp. 922-928, (2015)
[9]  
Riegler G, Osman U A, Geiger A., OctNet: Learning deep 3D representations at high resolutions, Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 6620-6629, (2017)
[10]  
Qi C R, Yi L, Su H, Guibas L J., PointNet++: Deep hierarchical feature learning on point sets in a metric space, Proceedings of the Advances in Neural Information Processing Systems, pp. 5099-5108, (2017)