LIGHTWEIGHT MULTI-VIEW-GROUP NEURAL NETWORK FOR 3D SHAPE CLASSIFICATION

被引:1
作者
Sun, Jiaqi [1 ,2 ]
Niu, Dongmei [1 ,2 ]
Lv, Na [1 ,2 ]
Dou, Wentao [1 ,2 ]
Peng, Jingliang [1 ,2 ]
机构
[1] Jinan Univ, Shandong Prov Key Lab Network Based Intelligent C, Jinan 250022, Peoples R China
[2] Jinan Univ, Sch Informat Sci & Engn, Jinan 250022, Peoples R China
来源
2023 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, ICIP | 2023年
基金
中国国家自然科学基金;
关键词
3D shape classification; lightweight; multi-view-group; neural network;
D O I
10.1109/ICIP49359.2023.10222295
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this work, we propose LiteMVGNet, a novel lightweight neural network for 3D shape classification. It is based on depth maps generated by multi-view rendering of the corresponding 3D model. LiteMVGNet is designed to be lightweight and effective in various aspects. First, the views and corresponding depth maps are partitioned into groups. Next, depth map features for each group are separately extracted by an adapted MobileNetV2 block. Finally, the extracted group features are fused by an adapted MobileViT block. The views are partitioned by good geometrical semantics and ECAnet is utilized to facilitate extraction of effective features. As demonstrated by experiments, in comparison with the state-of-the-art benchmark models, the proposed one cuts the network parameter count by a third and more and reduces the floating-point operation count by even one or two orders of magnitude. Still, the proposed model yields classification accuracies comparable with the benchmark models.
引用
收藏
页码:3409 / 3413
页数:5
相关论文
共 18 条
[1]  
[Anonymous], 2015, Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition
[2]  
Chen Shuo, 2021, BRIT MACH VIS C
[3]   GVCNN: Group-View Convolutional Neural Networks for 3D Shape Recognition [J].
Feng, Yifan ;
Zhang, Zizhao ;
Zhao, Xibin ;
Ji, Rongrong ;
Gao, Yue .
2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, :264-272
[4]   3D2SeqViews: Aggregating Sequential Views for 3D Global Feature Learning by CNN With Hierarchical Attention Aggregation [J].
Han, Zhizhong ;
Lu, Honglei ;
Liu, Zhenbao ;
Vong, Chi-Man ;
Liu, Yu-Shen ;
Zwicker, Matthias ;
Han, Junwei ;
Chen, C. L. Philip .
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2019, 28 (08) :3986-3999
[5]   SeqViews2SeqLabels: Learning 3D Global Features via Aggregating Sequential Views by RNN With Attention [J].
Han, Zhizhong ;
Shang, Mingyang ;
Liu, Zhenbao ;
Vong, Chi-Man ;
Liu, Yu-Shen ;
Zwicker, Matthias ;
Han, Junwei ;
Chen, C. L. Philip .
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2019, 28 (02) :658-672
[6]   RotationNet: Joint Object Categorization and Pose Estimation Using Multiviews from Unsupervised Viewpoints [J].
Kanezaki, Asako ;
Matsushita, Yasuyuki ;
Nishida, Yoshifumi .
2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, :5010-5019
[7]  
Knopp J, 2010, LECT NOTES COMPUT SC, V6316, P589, DOI 10.1007/978-3-642-15567-3_43
[8]   Multi-view 3D object retrieval leveraging the aggregation of view and instance attentive features [J].
Lin, Dongyun ;
Li, Yiqun ;
Cheng, Yi ;
Prasad, Shitala ;
Nwe, Tin Lay ;
Dong, Sheng ;
Guo, Aiyuan .
KNOWLEDGE-BASED SYSTEMS, 2022, 247
[9]   Hierarchical multi-view context modelling for 3D object classification and retrieval [J].
Liu, An-An ;
Zhou, Heyu ;
Nie, Weizhi ;
Liu, Zhenguang ;
Liu, Wu ;
Xie, Hongtao ;
Mao, Zhendong ;
Li, Xuanya ;
Song, Dan .
INFORMATION SCIENCES, 2021, 547 :984-995
[10]  
Mehta S, 2022, Mobilevit: Light-weight, general-purpose, and mobilefriendly vision transformer