DualMLP: a two-stream fusion model for 3D point cloud classification

被引：4

作者：

Paul, Sneha ^{[1
]}

Patterson, Zachary ^{[1
]}

Bouguila, Nizar ^{[1
]}

机构：

[1] Concordia Univ, Concordia Inst Informat Syst Engn CIISE, Montreal, PQ, Canada

来源：

VISUAL COMPUTER | 2024年 / 40卷 / 08期

基金：

英国科研创新办公室;

关键词：

Point cloud classification; 3D computer vision; Supervised learning; NEURAL-NETWORKS;

D O I：

10.1007/s00371-023-03114-3

中图分类号：

TP31 [计算机软件];

学科分类号：

081202 ; 0835 ;

摘要：

In this paper, we present DualMLP, a novel 3D model that introduces the idea of a two-stream network for existing 3D models to handle the trade-off between the number of points and the computational overhead. Existing works on point clouds use a small subset of points sampled from the entire 3D object as input. Although increasing the number of input points can enhance scene understanding, it also incurs a higher computational cost for existing networks. To tackle this challenge, we propose a novel architecture called DualMLP, which effectively mitigates the linear increase in computational expense as the number of input points grows. While we evaluate this concept on PointMLP and demonstrate its effectiveness, the idea can be applied to other existing models with minimal adjustments. DualMLP consists of two branches: DenseNet and SparseNet. The SparseNet, a relatively larger network, samples a small number of points from the complete point cloud, while the DenseNet, a lightweight network, takes in a larger number of points as input. Extensive experiments on the ScanObjectNN and ModelNet40 datasets demonstrate the effectiveness of the proposed model, achieving a 1.00% and 0.81% improvement over PointMLP for ScanObjectNN and ModelNet40 while being computationally efficient than the original PointMLP. To ensure the reproducibility of our experimental results, the code for this work is publicly available at https://github.com/snehaputul/DualMLP.

引用

页码：5435 / 5449

页数：15

共 40 条

[1] DGCNN: A convolutional neural network over large-scale labeled graphs
Anh Viet Phan
Minh Le Nguyen
Yen Lam Hoang Nguyen
Lam Thu Bui
[J]. NEURAL NETWORKS, 2018, 108 : 533 - 543
[2] [Anonymous], 2015, P CVPR, DOI DOI 10.1109/CVPR.2015.7298801
[3] Bruna J., 2013, ABS13126203 CORR, P1
[4] PointMixer: MLP-Mixer for Point Cloud Understanding
Choe, Jaesung
Park, Chunghyun
Rameau, Francois
Park, Jaesik
Kweon, In So
[J]. COMPUTER VISION - ECCV 2022, PT XXVII, 2022, 13687 : 620 - 640
[5] Geometric attentional dynamic graph convolutional neural networks for point cloud analysis
Cui, Yiming
Liu, Xin
Liu, Hongmin
Zhang, Jiyong
Zare, Alina
Fan, Bin
[J]. NEUROCOMPUTING, 2021, 432 : 300 - 310
[6] Cui Yingqian, 2023, arXiv
[7] Shape Completion using 3D-Encoder-Predictor CNNs and Shape Synthesis
Dai, Angela
Qi, Charles Ruizhongtai
Niessner, Matthias
[J]. 30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, : 6545 - 6554
[8] Deng-Ping Fan, 2020, Medical Image Computing and Computer Assisted Intervention - MICCAI 2020. 23rd International Conference. Proceedings. Lecture Notes in Computer Science (LNCS 12266), P263, DOI 10.1007/978-3-030-59725-2_26
[9] Fang Y, 2015, PROC CVPR IEEE, P2319, DOI 10.1109/CVPR.2015.7298845
[10] SlowFast Networks for Video Recognition
Feichtenhofer, Christoph
Fan, Haoqi
Malik, Jitendra
He, Kaiming
[J]. 2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 6201 - 6210

← 1 2 3 4 →