Deformable convolutional networks for multi-view 3D shape classification

被引:12
|
作者
Ma, Pengfei [1 ]
Ma, Jie [1 ]
Wang, Xujiao [1 ]
Yang, Lichuang [1 ]
Wang, Nannan [1 ]
机构
[1] Hebei Univ Technol, Sch Elect & Informat Engn, Tianjin 300401, Peoples R China
关键词
learning (artificial intelligence); image classification; feature extraction; image representation; feedforward neural nets; computational geometry; deformable convolutional networks; multiview 3D shape classification; geometric transformation modelling capability; multiview convolutional networks; view-pooling layer; deformable convolutional layer; input; deformable 3D shape classification problems; MVCNN framework; ModelNet10; dataset; ModelNet40;
D O I
10.1049/el.2018.6851
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
This Letter suggests a novel method for improving the robustness and the geometric transformation modelling capability in multi-view convolutional networks (MVCNNs). First, the deformable convolutional networks are used to learn more details and features related to the geometric transformation which the standard convolutional neural networks cannot handle. Then a view-pooling layer is specifically designed for combining the descriptors from multiple views as the final representations of the 3D shapes. The key idea is to insert the deformable convolutional layer between the input and convolutional layer, making it possible to solve deformable 3D shape classification problems, which was a challenging task for MVCNN framework. The proposed method achieves state-of-the-art classification results on two subsets of the ModelNet dataset (ModelNet10 and ModelNet40) over previous methods by a significant margin.
引用
收藏
页码:1373 / 1374
页数:2
相关论文
共 50 条
  • [31] Generative Essential Graph Convolutional Network for Multi-View Semi-Supervised Classification
    Lu, Jielong
    Wu, Zhihao
    Zhong, Luying
    Chen, Zhaoliang
    Zhao, Hong
    Wang, Shiping
    IEEE TRANSACTIONS ON MULTIMEDIA, 2024, 26 : 7987 - 7999
  • [32] Dilated 3D Convolutional Neural Networks for Brain MRI Data Classification
    Wang, Zijian
    Sun, Yaoru
    Shen, Qianzi
    Cao, Lei
    IEEE ACCESS, 2019, 7 : 134388 - 134398
  • [33] MVF-GNN: Multi-View Fusion With GNN for 3D Semantic Segmentation
    Du, Zhenxiang
    Ren, Minglun
    Chu, Wei
    Chen, Nengying
    IEEE ROBOTICS AND AUTOMATION LETTERS, 2025, 10 (04): : 3262 - 3269
  • [34] Adaptive Multi-View and Temporal Fusing Transformer for 3D Human Pose Estimation
    Shuai, Hui
    Wu, Lele
    Liu, Qingshan
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 45 (04) : 4122 - 4135
  • [35] FEATURE MATCHING OF MULTI-VIEW 3D MODELS BASED ON HASH BINARY ENCODING
    Li, H.
    Zhao, T.
    Li, N.
    Cai, Q.
    Du, J.
    NEURAL NETWORK WORLD, 2017, 27 (01) : 95 - 105
  • [36] PointMCD: Boosting Deep Point Cloud Encoders via Multi-View Cross-Modal Distillation for 3D Shape Recognition
    Zhang, Qijian
    Hou, Junhui
    Qian, Yue
    IEEE TRANSACTIONS ON MULTIMEDIA, 2025, 27 : 754 - 767
  • [37] Disentangling 3D/4D Facial Affect Recognition With Faster Multi-View Transformer
    Behzad, Muzammil
    Li, Xiaobai
    Zhao, Guoying
    IEEE SIGNAL PROCESSING LETTERS, 2021, 28 : 1913 - 1917
  • [38] Multi-View Graph Convolutional Network With Spectral Component Decompose for Remote Sensing Images Classification
    Cheng, Xijie
    He, Xiaohui
    Qiao, Mengjia
    Li, Panle
    Chang, Peng
    Zhang, Tianhao
    Guo, Xiaoyu
    Wang, Jinyong
    Tian, Zhihui
    Zhou, Guangsheng
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2025, 35 (01) : 3 - 18
  • [39] Scale-aware limited deformable convolutional neural networks for traffic sign detection and classification
    Liu, Zhanwen
    Shen, Chao
    Fan, Xing
    Zeng, Gaowen
    Zhao, Xiangmo
    IET INTELLIGENT TRANSPORT SYSTEMS, 2020, 14 (12) : 1712 - 1722
  • [40] Attention-driven multi-feature fusion for hyperspectral image classification via multi-criteria optimization and multi-view convolutional neural networks
    Abidi, Sofiene
    Sellami, Akrem
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2024, 138