VGPCNet: viewport group point clouds network for 3D shape recognition

被引:3
作者
Zhang, Ziyu [1 ,2 ]
Yu, Yi [1 ,2 ]
Da, Feipeng [1 ,2 ]
机构
[1] Southeast Univ, Sch Automat, Nanjing 210096, Jiangsu, Peoples R China
[2] Southeast Univ, Key Lab Measurement & Control Complex Syst Engn, Nanjing 210096, Jiangsu, Peoples R China
关键词
3D shape recognition; Point clouds; Viewport group;
D O I
10.1007/s10489-023-04498-4
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
3D point cloud recognition is fundamental and popular in vision perceptual systems such as autonomous driving, robotics, and virtual reality. Due to the sparse distribution and irregularity of point clouds, previous 3D point networks perform convolution on nearby points, ignoring the long-range dependence on the global structure. To solve this problem, we propose a Viewport Group Point Cloud Network for 3D Shape Recognition (VGPCNet) in which features are grouped according to viewports instead of local neighbor points to model the long-range global context. First, we propose to use viewport as proxy to capture both local and global features from an outside view of the object. The related points are grouped by visibility attribute effectively and efficiently which can not only capture the inside local geometry details but also obtain the global structure from the outside viewport. Second, we use a graph-based feature consolidation module to enhance the viewport features by modeling interactions between different viewports. Finally, to aggregate a global representation from multiple viewport features, we propose a novel attention-based feature aggregation module. We evaluate our VGPCNet on three widely used benchmarks including ModelNet40/10, ScanObjectNN, and ShapeCore55 for shape classification and retrieval tasks. Extensive experiments have demonstrated the effectiveness and superior performance (94.1% on ModelNet40) of our method over state-of-the-art methods.
引用
收藏
页码:19060 / 19073
页数:14
相关论文
共 66 条
[1]  
[Anonymous], 2015, Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition
[2]   Why Discard if You can Recycle?: A Recycling Max Pooling Module for 3D Point Cloud Analysis [J].
Chen, Jiajing ;
Kakillioglu, Burak ;
Ren, Huantao ;
Velipasalar, Senem .
2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, :549-557
[3]   VolumeFusion: Deep Depth Fusion for 3D Scene Reconstruction [J].
Choe, Jaesung ;
Im, Sunghoon ;
Rameau, Francois ;
Kang, Minjun ;
Kweon, In So .
2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, :16066-16075
[4]   3DMV: Joint 3D-Multi-view Prediction for 3D Semantic Scene Segmentation [J].
Dai, Angela ;
Niessner, Matthias .
COMPUTER VISION - ECCV 2018, PT X, 2018, 11214 :458-474
[5]   Shape Completion using 3D-Encoder-Predictor CNNs and Shape Synthesis [J].
Dai, Angela ;
Qi, Charles Ruizhongtai ;
Niessner, Matthias .
30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, :6545-6554
[6]   Structural Relational Reasoning of Point Clouds [J].
Duan, Yueqi ;
Zheng, Yu ;
Lu, Jiwen ;
Zhou, Jie ;
Tian, Qi .
2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, :949-958
[7]  
Fei JJ, 2022, AAAI CONF ARTIF INTE, P598
[8]  
Feng YF, 2019, AAAI CONF ARTIF INTE, P3558
[9]  
Feng YX, 2018, 2018 3RD IEEE INTERNATIONAL CONFERENCE ON INTELLIGENT TRANSPORTATION ENGINEERING (ICITE), P264, DOI 10.1109/ICITE.2018.8492700
[10]  
Han ZZ, 2019, PROCEEDINGS OF THE TWENTY-EIGHTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, P758