Deformable convolutional networks for multi-view 3D shape classification

被引：12

作者：

Ma, Pengfei ^{[1
]}

Ma, Jie ^{[1
]}

Wang, Xujiao ^{[1
]}

Yang, Lichuang ^{[1
]}

Wang, Nannan ^{[1
]}

机构：

[1] Hebei Univ Technol, Sch Elect & Informat Engn, Tianjin 300401, Peoples R China

来源：

ELECTRONICS LETTERS | 2018年 / 54卷 / 24期

关键词：

learning (artificial intelligence); image classification; feature extraction; image representation; feedforward neural nets; computational geometry; deformable convolutional networks; multiview 3D shape classification; geometric transformation modelling capability; multiview convolutional networks; view-pooling layer; deformable convolutional layer; input; deformable 3D shape classification problems; MVCNN framework; ModelNet10; dataset; ModelNet40;

D O I：

10.1049/el.2018.6851

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

This Letter suggests a novel method for improving the robustness and the geometric transformation modelling capability in multi-view convolutional networks (MVCNNs). First, the deformable convolutional networks are used to learn more details and features related to the geometric transformation which the standard convolutional neural networks cannot handle. Then a view-pooling layer is specifically designed for combining the descriptors from multiple views as the final representations of the 3D shapes. The key idea is to insert the deformable convolutional layer between the input and convolutional layer, making it possible to solve deformable 3D shape classification problems, which was a challenging task for MVCNN framework. The proposed method achieves state-of-the-art classification results on two subsets of the ModelNet dataset (ModelNet10 and ModelNet40) over previous methods by a significant margin.

引用

页码：1373 / 1374

页数：2

共 50 条

[41] Multi-view Multi-task Feature Extraction for Web Image Classification
Zuo, Zhiqiang
Luo, Yong
Tao, Dacheng
Xu, Chao
PROCEEDINGS OF THE 2014 ACM CONFERENCE ON MULTIMEDIA (MM'14), 2014, : 1137 - 1140
[42] Image Classification Via Multi-View Model
Cheng, Yanyun
Zhu, Songhao
Liang, Zhiwei
Xu, Guozheng
PROCEEDINGS OF THE 28TH CHINESE CONTROL AND DECISION CONFERENCE (2016 CCDC), 2016, : 3333 - 3337
[43] Multi-View Saliency Guided Deep Neural Network for 3-D Object Retrieval and Classification
Zhou, He-Yu
Liu, An-An
Nie, Wei-Zhi
Nie, Jie
IEEE TRANSACTIONS ON MULTIMEDIA, 2020, 22 (06) : 1496 - 1506
[44] Study of 3D Finger Vein Biometrics on Imaging Device Design and Multi-View Verification
Song, Yizhuo
Zhao, Pengyang
Wang, Siqi
Liao, Qingmin
Yang, Wenming
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2024, 34 (04) : 3043 - 3048
[45] Multi-View 3D Scene Abstraction From Drone-Captured RGB Images
Jeong, Wooseong
Kim, Jihun
Kweon, Hyeokjun
Yoon, Kuk-Jin
IEEE ACCESS, 2025, 13 : 27641 - 27656
[46] Exploring Recurrent Long-Term Temporal Fusion for Multi-View 3D Perception
Han, Chunrui
Yang, Jinrong
Sun, Jianjian
Ge, Zheng
Dong, Runpei
Zhou, Hongyu
Mao, Weixin
Peng, Yuang
Zhang, Xiangyu
IEEE ROBOTICS AND AUTOMATION LETTERS, 2024, 9 (07): : 6544 - 6551
[47] Moving object recognition using multi-view three-dimensional convolutional neural networks
He, Tao
Mao, Hua
Yi, Zhang
NEURAL COMPUTING & APPLICATIONS, 2017, 28 (12) : 3827 - 3835
[48] Saliency detection of textured 3D models based on multi-view information and texel descriptor
Zhang, Ya
Chen, Chunyi
Hu, Xiaojuan
Li, Ling
Li, Hailan
PEERJ COMPUTER SCIENCE, 2023, 9
[49] DEEP MULTI-VIEW MODELS FOR GLITCH CLASSIFICATION
Bahaadini, Sara
Rohani, Neda
Coughlin, Scott
Zevin, Michael
Kalogera, Vicky
Katsaggelos, Aggelos K.
2017 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2017, : 2931 - 2935
[50] Moving object recognition using multi-view three-dimensional convolutional neural networks
Tao He
Hua Mao
Zhang Yi
Neural Computing and Applications, 2017, 28 : 3827 - 3835

← 1 2 3 4 5 →