M-GCN: Multi-Branch Graph Convolution Network for 2D Image-based on 3D Model Retrieval

被引：34

作者：

Nie, Wei-Zhi ^{[1
]}

Ren, Min-Jie ^{[1
]}

Liu, An-An ^{[1
]}

Mao, Zhendong ^{[2
]}

Nie, Jie ^{[3
]}

机构：

[1] Tianjin Univ, Sch Elect & Informat Engn, Tianjin 300072, Peoples R China

[2] Univ Sci & Technol China, Sch Elect Informat Engn, Hefei 230052, Peoples R China

[3] Ocean Univ China, Coll Informat Sci & Engn, Qingdao 266100, Peoples R China

来源：

IEEE TRANSACTIONS ON MULTIMEDIA | 2021年 / 23卷

基金：

中国国家自然科学基金;

关键词：

Three-dimensional displays; Solid modeling; Two dimensional displays; Computational modeling; Visualization; Feature extraction; Predictive models; Cross-domain retrieval; 3D model retrieval; multi-head attention; multiple graphs;

D O I：

10.1109/TMM.2020.3006371

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

2D image based 3D model retrieval is a challenging research topic in the field of 3D model retrieval. The huge gap between two modalities - 2D image and 3D model, extremely constrains the retrieval performance. In order to handle this problem, we propose a novel multi-branch graph convolution network (M-GCN) to address the 2D image based 3D model retrieval problem. First, we compute the similarity between 2D image and 3D model based on visual information to construct one cross-modalities graph model, which can provide the original relationship between image and 3D model. However, this relationship is not accurate because of the difference of modalities. Thus, the multi-head attention mechanism is employed to generate a set of fully connected edge-weighted graphs, which can predict the hidden relationship between 2D image and 3D model to further strengthen the correlation for the embedding generation of nodes. Finally, we apply the max-pooling operation to fuse the multi-graphs information and generate the fusion embeddings of nodes for retrieval. To validate the performance of our method, we evaluated M-GCN on the MI3DOR dataset, Shrec 2018 track and Shrec 2014 track. The experimental results demonstrate the superiority of our proposed method over the state-of-the-art methods.

引用

页码：1962 / 1976

页数：15

共 60 条

[1]

Abdul-Rashid H., 2018, 11 EUROGRAPHICS WORK, P37

[2]

Abdul-Rashid H., 2019, EUROGRAPHICSWORKSHOP

[3]

[Anonymous], 2019, PROC 3DOR EUROGRAPHI

[4]

[Anonymous], 2012, P 3DOR

[5] Understanding deep features with computer-generated imagery [J].

Aubry, Mathieu ;

Russell, Bryan C. .

2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2015, :2875-2883

[6] Seeing 3D chairs: exemplar part-based 2D-3D alignment using a large dataset of CAD models [J].

Aubry, Mathieu ;

Maturana, Daniel ;

Efros, Alexei A. ;

Russell, Bryan C. ;

Sivic, Josef .

2014 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2014, :3762-3769

[7]

Battaglia Peter W, 2018, ARXIV180601261

[8] Automated retrieval of 3D CAD model objects in construction range images [J].

Bosche, F. ;

Haas, C. T. .

AUTOMATION IN CONSTRUCTION, 2008, 17 (04) :499-512

[9] A Comprehensive Survey of Graph Embedding: Problems, Techniques, and Applications [J].

Cai, HongYun ;

Zheng, Vincent W. ;

Chang, Kevin Chen-Chuan .

IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2018, 30 (09) :1616-1637

[10] On visual similarity based 3D model retrieval [J].

Chen, DY ;

Tian, XP ;

Shen, YT ;

Ming, OY .

COMPUTER GRAPHICS FORUM, 2003, 22 (03) :223-232

← 1 2 3 4 5 6 →