Hierarchical Graph Attention Based Multi-View Convolutional Neural Network for 3D Object Recognition

被引:8
作者
Zeng, Hui [1 ,2 ]
Zhao, Tianmeng [1 ]
Cheng, Ruting [1 ]
Wang, Fuzhou [1 ]
Liu, Jiwei [1 ,2 ]
机构
[1] Univ Sci & Technol Beijing, Sch Automat & Elect Engn, Beijing Engn Res Ctr Ind Spectrum Imaging, Beijing 100083, Peoples R China
[2] Univ Sci & Technol Beijing, Shunde Grad Sch, Foshan 528399, Peoples R China
来源
IEEE ACCESS | 2021年 / 9卷 / 09期
基金
中国国家自然科学基金;
关键词
Three-dimensional displays; Object recognition; Two dimensional displays; Neural networks; Feature extraction; Solid modeling; Convolutional neural networks; 3D object recognition; multi-view convolutional neural network; graph attention network; feature aggregation; CLASSIFICATION;
D O I
10.1109/ACCESS.2021.3059853
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
For multi-view convolutional neural network based 3D object recognition, how to fuse the information of multiple views is a key factor affecting the recognition performance. Most traditional methods use max-pooling algorithm to obtain the final 3D object feature, which does not take into account the correlative information between different views. To make full use of the effective information of multiple views, this paper introduces the hierarchical graph attention based multi-view convolutional neural network for 3D object recognition. At first, the view selection module is proposed to reduce redundant view information in multiple views, which can select the projective views with more effective information. Then, the correlation weighted feature aggregation module is proposed to better fuse multiple view features. Finally, the hierarchical feature aggregation network structure is designed to further to make full use of the correlation information of multiple views. Extensive experimental results have validated the effectiveness of the proposed method.
引用
收藏
页码:33323 / 33335
页数:13
相关论文
共 60 条
[1]  
[Anonymous], 2016, NEURIPS 3D DEEP LEAR
[2]   Long short-term memory [J].
Hochreiter, S ;
Schmidhuber, J .
NEURAL COMPUTATION, 1997, 9 (08) :1735-1780
[3]  
[Anonymous], 2016, ADV NEURAL INFORM PR
[4]  
Atwood J., 2016, P 30 INT C NEUR INF, P1993
[5]   GIFT: Towards Scalable 3D Shape Retrieval [J].
Bai, Song ;
Bai, Xiang ;
Zhou, Zhichao ;
Zhang, Zhaoxiang ;
Tian, Qi ;
Latecki, Longin Jan .
IEEE TRANSACTIONS ON MULTIMEDIA, 2017, 19 (06) :1257-1271
[6]   GIFT: A Real-time and Scalable 3D Shape Search Engine [J].
Bai, Song ;
Bai, Xiang ;
Zhou, Zhichao ;
Zhang, Zhaoxiang ;
Latecki, Longin Jan .
2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, :5023-5032
[7]  
Bruna J., 2013, 2 INT C LEARN REPR I
[8]  
Chang A. X., 2015, P COMP VIS PATT REC
[9]  
Chat~eld K., 2014, ARXIV14053531
[10]   VERAM: View-Enhanced Recurrent Attention Model for 3D Shape Classification [J].
Chen, Songle ;
Zheng, Lintao ;
Zhang, Yan ;
Sun, Zhixin ;
Xu, Kai .
IEEE TRANSACTIONS ON VISUALIZATION AND COMPUTER GRAPHICS, 2019, 25 (12) :3244-3257