3D model retrieval based on multi-view attentional convolutional neural network

被引:8
作者
Liu, An-An [1 ]
Zhou, He-Yu [1 ]
Li, Meng-Jie [1 ]
Nie, Wei-Zhi [1 ]
机构
[1] Tianjin Univ, Sch Elect & Informat Engn, Tianjin 300072, Peoples R China
基金
中国国家自然科学基金;
关键词
3D model retrieval; Multi-view; CNN; LSTM; SHAPE DESCRIPTOR;
D O I
10.1007/s11042-019-7521-8
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
We propose a discriminative Multi-View Attentional Convolutional Neural Network, dubbed as MVA-CNN, which takes the multiple views of an shape as input and output the object category. Unlike previous view-based approaches that simply "compile" the view features into a compact 3D descriptors, our method can discover the context among multiple views in both the visual and spatial domain. First, we extract multiple rendered images from a 3D object by virtual cameras, and then we use Convolutional Neural Network (CNN) to abstract the information of the views. Second, we aggregate the visual views by two steps: 1). an element-wise maximum operation across the view features is adopted to discover discriminative features. 2). a soft attention mechanism is used to dynamically adjust the shape descriptors for better representing the spatial information. The entire network can be trained in an end-to-end way with the standard backpropagation. We verify the effectiveness of MVA-CNN on two widely used datasets: ModelNet10, ModelNet40 by comparing our method with state-of-the-art methods.
引用
收藏
页码:4699 / 4711
页数:13
相关论文
共 47 条
[1]  
[Anonymous], 2015, P IEEE C COMP VIS PA, DOI [10.1109/CVPR.2015.7298801, DOI 10.1109/CVPR.2015.7298801]
[2]  
[Anonymous], 2002, SMA '02, DOI 10.1145/566282.566322
[3]  
[Anonymous], 2016, ARXIV160306208
[4]  
[Anonymous], ARXIV180400586
[5]  
Bahdanau D, 2016, Arxiv, DOI [arXiv:1409.0473, DOI 10.48550/ARXIV.1409.0473]
[6]   GIFT: A Real-time and Scalable 3D Shape Search Engine [J].
Bai, Song ;
Bai, Xiang ;
Zhou, Zhichao ;
Zhang, Zhaoxiang ;
Latecki, Longin Jan .
2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, :5023-5032
[7]   Automated retrieval of 3D CAD model objects in construction range images [J].
Bosche, F. ;
Haas, C. T. .
AUTOMATION IN CONSTRUCTION, 2008, 17 (04) :499-512
[8]  
Cheng Zhiyong, 2018, ARXIV181105318
[9]   Histograms of oriented gradients for human detection [J].
Dalal, N ;
Triggs, B .
2005 IEEE COMPUTER SOCIETY CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, VOL 1, PROCEEDINGS, 2005, :886-893
[10]   Event Classification in Microblogs via Social Tracking [J].
Gao, Yue ;
Zhang, Hanwang ;
Zhao, Xibin ;
Yan, Shuicheng .
ACM TRANSACTIONS ON INTELLIGENT SYSTEMS AND TECHNOLOGY, 2017, 8 (03)