3D model retrieval based on multi-view attentional convolutional neural network

被引：8

作者：

Liu, An-An ^{[1
]}

Zhou, He-Yu ^{[1
]}

Li, Meng-Jie ^{[1
]}

Nie, Wei-Zhi ^{[1
]}

机构：

[1] Tianjin Univ, Sch Elect & Informat Engn, Tianjin 300072, Peoples R China

来源：

MULTIMEDIA TOOLS AND APPLICATIONS | 2020年 / 79卷 / 7-8期

基金：

中国国家自然科学基金;

关键词：

3D model retrieval; Multi-view; CNN; LSTM; SHAPE DESCRIPTOR;

D O I：

10.1007/s11042-019-7521-8

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

We propose a discriminative Multi-View Attentional Convolutional Neural Network, dubbed as MVA-CNN, which takes the multiple views of an shape as input and output the object category. Unlike previous view-based approaches that simply "compile" the view features into a compact 3D descriptors, our method can discover the context among multiple views in both the visual and spatial domain. First, we extract multiple rendered images from a 3D object by virtual cameras, and then we use Convolutional Neural Network (CNN) to abstract the information of the views. Second, we aggregate the visual views by two steps: 1). an element-wise maximum operation across the view features is adopted to discover discriminative features. 2). a soft attention mechanism is used to dynamically adjust the shape descriptors for better representing the spatial information. The entire network can be trained in an end-to-end way with the standard backpropagation. We verify the effectiveness of MVA-CNN on two widely used datasets: ModelNet10, ModelNet40 by comparing our method with state-of-the-art methods.

引用

页码：4699 / 4711

页数：13

共 47 条

[1]

[Anonymous], 2015, P IEEE C COMP VIS PA, DOI [10.1109/CVPR.2015.7298801, DOI 10.1109/CVPR.2015.7298801]

[2]

[Anonymous], 2002, SMA '02, DOI 10.1145/566282.566322

[3]

[Anonymous], 2016, ARXIV160306208

[4]

[Anonymous], ARXIV180400586

[5]

Bahdanau D, 2016, Arxiv, DOI [arXiv:1409.0473, DOI 10.48550/ARXIV.1409.0473]

[6] GIFT: A Real-time and Scalable 3D Shape Search Engine [J].

Bai, Song ;

Bai, Xiang ;

Zhou, Zhichao ;

Zhang, Zhaoxiang ;

Latecki, Longin Jan .

2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, :5023-5032

[7] Automated retrieval of 3D CAD model objects in construction range images [J].

Bosche, F. ;

Haas, C. T. .

AUTOMATION IN CONSTRUCTION, 2008, 17 (04) :499-512

[8]

Cheng Zhiyong, 2018, ARXIV181105318

[9] Histograms of oriented gradients for human detection [J].

Dalal, N ;

Triggs, B .

2005 IEEE COMPUTER SOCIETY CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, VOL 1, PROCEEDINGS, 2005, :886-893

[10] Event Classification in Microblogs via Social Tracking [J].

Gao, Yue ;

Zhang, Hanwang ;

Zhao, Xibin ;

Yan, Shuicheng .

ACM TRANSACTIONS ON INTELLIGENT SYSTEMS AND TECHNOLOGY, 2017, 8 (03)

← 1 2 3 4 5 →