DualMLP: a two-stream fusion model for 3D point cloud classification

被引:4
作者
Paul, Sneha [1 ]
Patterson, Zachary [1 ]
Bouguila, Nizar [1 ]
机构
[1] Concordia Univ, Concordia Inst Informat Syst Engn CIISE, Montreal, PQ, Canada
基金
英国科研创新办公室;
关键词
Point cloud classification; 3D computer vision; Supervised learning; NEURAL-NETWORKS;
D O I
10.1007/s00371-023-03114-3
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
In this paper, we present DualMLP, a novel 3D model that introduces the idea of a two-stream network for existing 3D models to handle the trade-off between the number of points and the computational overhead. Existing works on point clouds use a small subset of points sampled from the entire 3D object as input. Although increasing the number of input points can enhance scene understanding, it also incurs a higher computational cost for existing networks. To tackle this challenge, we propose a novel architecture called DualMLP, which effectively mitigates the linear increase in computational expense as the number of input points grows. While we evaluate this concept on PointMLP and demonstrate its effectiveness, the idea can be applied to other existing models with minimal adjustments. DualMLP consists of two branches: DenseNet and SparseNet. The SparseNet, a relatively larger network, samples a small number of points from the complete point cloud, while the DenseNet, a lightweight network, takes in a larger number of points as input. Extensive experiments on the ScanObjectNN and ModelNet40 datasets demonstrate the effectiveness of the proposed model, achieving a 1.00% and 0.81% improvement over PointMLP for ScanObjectNN and ModelNet40 while being computationally efficient than the original PointMLP. To ensure the reproducibility of our experimental results, the code for this work is publicly available at https://github.com/snehaputul/DualMLP.
引用
收藏
页码:5435 / 5449
页数:15
相关论文
共 40 条
  • [1] DGCNN: A convolutional neural network over large-scale labeled graphs
    Anh Viet Phan
    Minh Le Nguyen
    Yen Lam Hoang Nguyen
    Lam Thu Bui
    [J]. NEURAL NETWORKS, 2018, 108 : 533 - 543
  • [2] [Anonymous], 2015, P CVPR, DOI DOI 10.1109/CVPR.2015.7298801
  • [3] Bruna J., 2013, ABS13126203 CORR, P1
  • [4] PointMixer: MLP-Mixer for Point Cloud Understanding
    Choe, Jaesung
    Park, Chunghyun
    Rameau, Francois
    Park, Jaesik
    Kweon, In So
    [J]. COMPUTER VISION - ECCV 2022, PT XXVII, 2022, 13687 : 620 - 640
  • [5] Geometric attentional dynamic graph convolutional neural networks for point cloud analysis
    Cui, Yiming
    Liu, Xin
    Liu, Hongmin
    Zhang, Jiyong
    Zare, Alina
    Fan, Bin
    [J]. NEUROCOMPUTING, 2021, 432 : 300 - 310
  • [6] Cui Yingqian, 2023, arXiv
  • [7] Shape Completion using 3D-Encoder-Predictor CNNs and Shape Synthesis
    Dai, Angela
    Qi, Charles Ruizhongtai
    Niessner, Matthias
    [J]. 30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, : 6545 - 6554
  • [8] Deng-Ping Fan, 2020, Medical Image Computing and Computer Assisted Intervention - MICCAI 2020. 23rd International Conference. Proceedings. Lecture Notes in Computer Science (LNCS 12266), P263, DOI 10.1007/978-3-030-59725-2_26
  • [9] Fang Y, 2015, PROC CVPR IEEE, P2319, DOI 10.1109/CVPR.2015.7298845
  • [10] SlowFast Networks for Video Recognition
    Feichtenhofer, Christoph
    Fan, Haoqi
    Malik, Jitendra
    He, Kaiming
    [J]. 2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 6201 - 6210