Deformable convolutional networks for multi-view 3D shape classification

被引:12
|
作者
Ma, Pengfei [1 ]
Ma, Jie [1 ]
Wang, Xujiao [1 ]
Yang, Lichuang [1 ]
Wang, Nannan [1 ]
机构
[1] Hebei Univ Technol, Sch Elect & Informat Engn, Tianjin 300401, Peoples R China
关键词
learning (artificial intelligence); image classification; feature extraction; image representation; feedforward neural nets; computational geometry; deformable convolutional networks; multiview 3D shape classification; geometric transformation modelling capability; multiview convolutional networks; view-pooling layer; deformable convolutional layer; input; deformable 3D shape classification problems; MVCNN framework; ModelNet10; dataset; ModelNet40;
D O I
10.1049/el.2018.6851
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
This Letter suggests a novel method for improving the robustness and the geometric transformation modelling capability in multi-view convolutional networks (MVCNNs). First, the deformable convolutional networks are used to learn more details and features related to the geometric transformation which the standard convolutional neural networks cannot handle. Then a view-pooling layer is specifically designed for combining the descriptors from multiple views as the final representations of the 3D shapes. The key idea is to insert the deformable convolutional layer between the input and convolutional layer, making it possible to solve deformable 3D shape classification problems, which was a challenging task for MVCNN framework. The proposed method achieves state-of-the-art classification results on two subsets of the ModelNet dataset (ModelNet10 and ModelNet40) over previous methods by a significant margin.
引用
收藏
页码:1373 / 1374
页数:2
相关论文
共 50 条
  • [41] Multi-view Multi-task Feature Extraction for Web Image Classification
    Zuo, Zhiqiang
    Luo, Yong
    Tao, Dacheng
    Xu, Chao
    PROCEEDINGS OF THE 2014 ACM CONFERENCE ON MULTIMEDIA (MM'14), 2014, : 1137 - 1140
  • [42] Image Classification Via Multi-View Model
    Cheng, Yanyun
    Zhu, Songhao
    Liang, Zhiwei
    Xu, Guozheng
    PROCEEDINGS OF THE 28TH CHINESE CONTROL AND DECISION CONFERENCE (2016 CCDC), 2016, : 3333 - 3337
  • [43] Multi-View Saliency Guided Deep Neural Network for 3-D Object Retrieval and Classification
    Zhou, He-Yu
    Liu, An-An
    Nie, Wei-Zhi
    Nie, Jie
    IEEE TRANSACTIONS ON MULTIMEDIA, 2020, 22 (06) : 1496 - 1506
  • [44] Study of 3D Finger Vein Biometrics on Imaging Device Design and Multi-View Verification
    Song, Yizhuo
    Zhao, Pengyang
    Wang, Siqi
    Liao, Qingmin
    Yang, Wenming
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2024, 34 (04) : 3043 - 3048
  • [45] Multi-View 3D Scene Abstraction From Drone-Captured RGB Images
    Jeong, Wooseong
    Kim, Jihun
    Kweon, Hyeokjun
    Yoon, Kuk-Jin
    IEEE ACCESS, 2025, 13 : 27641 - 27656
  • [46] Exploring Recurrent Long-Term Temporal Fusion for Multi-View 3D Perception
    Han, Chunrui
    Yang, Jinrong
    Sun, Jianjian
    Ge, Zheng
    Dong, Runpei
    Zhou, Hongyu
    Mao, Weixin
    Peng, Yuang
    Zhang, Xiangyu
    IEEE ROBOTICS AND AUTOMATION LETTERS, 2024, 9 (07): : 6544 - 6551
  • [47] Moving object recognition using multi-view three-dimensional convolutional neural networks
    He, Tao
    Mao, Hua
    Yi, Zhang
    NEURAL COMPUTING & APPLICATIONS, 2017, 28 (12) : 3827 - 3835
  • [48] Saliency detection of textured 3D models based on multi-view information and texel descriptor
    Zhang, Ya
    Chen, Chunyi
    Hu, Xiaojuan
    Li, Ling
    Li, Hailan
    PEERJ COMPUTER SCIENCE, 2023, 9
  • [49] DEEP MULTI-VIEW MODELS FOR GLITCH CLASSIFICATION
    Bahaadini, Sara
    Rohani, Neda
    Coughlin, Scott
    Zevin, Michael
    Kalogera, Vicky
    Katsaggelos, Aggelos K.
    2017 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2017, : 2931 - 2935
  • [50] Moving object recognition using multi-view three-dimensional convolutional neural networks
    Tao He
    Hua Mao
    Zhang Yi
    Neural Computing and Applications, 2017, 28 : 3827 - 3835