M-GCN: Multi-Branch Graph Convolution Network for 2D Image-based on 3D Model Retrieval

被引:34
作者
Nie, Wei-Zhi [1 ]
Ren, Min-Jie [1 ]
Liu, An-An [1 ]
Mao, Zhendong [2 ]
Nie, Jie [3 ]
机构
[1] Tianjin Univ, Sch Elect & Informat Engn, Tianjin 300072, Peoples R China
[2] Univ Sci & Technol China, Sch Elect Informat Engn, Hefei 230052, Peoples R China
[3] Ocean Univ China, Coll Informat Sci & Engn, Qingdao 266100, Peoples R China
基金
中国国家自然科学基金;
关键词
Three-dimensional displays; Solid modeling; Two dimensional displays; Computational modeling; Visualization; Feature extraction; Predictive models; Cross-domain retrieval; 3D model retrieval; multi-head attention; multiple graphs;
D O I
10.1109/TMM.2020.3006371
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
2D image based 3D model retrieval is a challenging research topic in the field of 3D model retrieval. The huge gap between two modalities - 2D image and 3D model, extremely constrains the retrieval performance. In order to handle this problem, we propose a novel multi-branch graph convolution network (M-GCN) to address the 2D image based 3D model retrieval problem. First, we compute the similarity between 2D image and 3D model based on visual information to construct one cross-modalities graph model, which can provide the original relationship between image and 3D model. However, this relationship is not accurate because of the difference of modalities. Thus, the multi-head attention mechanism is employed to generate a set of fully connected edge-weighted graphs, which can predict the hidden relationship between 2D image and 3D model to further strengthen the correlation for the embedding generation of nodes. Finally, we apply the max-pooling operation to fuse the multi-graphs information and generate the fusion embeddings of nodes for retrieval. To validate the performance of our method, we evaluated M-GCN on the MI3DOR dataset, Shrec 2018 track and Shrec 2014 track. The experimental results demonstrate the superiority of our proposed method over the state-of-the-art methods.
引用
收藏
页码:1962 / 1976
页数:15
相关论文
共 60 条
[1]  
Abdul-Rashid H., 2018, 11 EUROGRAPHICS WORK, P37
[2]  
Abdul-Rashid H., 2019, EUROGRAPHICSWORKSHOP
[3]  
[Anonymous], 2019, PROC 3DOR EUROGRAPHI
[4]  
[Anonymous], 2012, P 3DOR
[5]   Understanding deep features with computer-generated imagery [J].
Aubry, Mathieu ;
Russell, Bryan C. .
2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2015, :2875-2883
[6]   Seeing 3D chairs: exemplar part-based 2D-3D alignment using a large dataset of CAD models [J].
Aubry, Mathieu ;
Maturana, Daniel ;
Efros, Alexei A. ;
Russell, Bryan C. ;
Sivic, Josef .
2014 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2014, :3762-3769
[7]  
Battaglia Peter W, 2018, ARXIV180601261
[8]   Automated retrieval of 3D CAD model objects in construction range images [J].
Bosche, F. ;
Haas, C. T. .
AUTOMATION IN CONSTRUCTION, 2008, 17 (04) :499-512
[9]   A Comprehensive Survey of Graph Embedding: Problems, Techniques, and Applications [J].
Cai, HongYun ;
Zheng, Vincent W. ;
Chang, Kevin Chen-Chuan .
IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2018, 30 (09) :1616-1637
[10]   On visual similarity based 3D model retrieval [J].
Chen, DY ;
Tian, XP ;
Shen, YT ;
Ming, OY .
COMPUTER GRAPHICS FORUM, 2003, 22 (03) :223-232