3D shape knowledge graph for cross-domain 3D shape retrieval

被引：0

作者：

Chang, Rihao ^{[1
]}

Ma, Yongtao ^{[1
]}

Hao, Tong ^{[2
,5
]}

Wang, Weijie ^{[3
]}

Nie, Weizhi ^{[4
,6
]}

机构：

[1] Tianjin Univ, Sch Microelect, Tianjin, Peoples R China

[2] Tianjin Normal Univ, Sch Life Sci, Tianjin, Peoples R China

[3] Univ Trento, Dept Informat Engn & Comp Sci, Trento, Italy

[4] Tianjin Univ, Sch Elect & Informat Engn, Tianjin, Peoples R China

[5] Tianjin Normal Univ, Sch Life Sci, Tianjin 300387, Peoples R China

[6] Tianjin Univ, Sch Elect & Informat Engn, Tianjin 300072, Peoples R China

来源：

CAAI TRANSACTIONS ON INTELLIGENCE TECHNOLOGY | 2024年 / 9卷 / 05期

基金：

中国国家自然科学基金;

关键词：

3-D; multimedia; CONVOLUTIONAL NEURAL-NETWORKS; MODEL RETRIEVAL; OBJECT CATEGORIZATION;

D O I：

10.1049/cit2.12326

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

The surge in 3D modelling has led to a pronounced research emphasis on the field of 3D shape retrieval. Numerous contemporary approaches have been put forth to tackle this intricate challenge. Nevertheless, effectively addressing the intricacies of cross-modal 3D shape retrieval remains a formidable undertaking, owing to inherent modality-based disparities. The authors present an innovative notion-termed "geometric words"-which functions as elemental constituents for representing entities through combinations. To establish the knowledge graph, the authors employ geometric words as nodes, connecting them via shape categories and geometry attributes. Subsequently, a unique graph embedding method for knowledge acquisition is devised. Finally, an effective similarity measure is introduced for retrieval purposes. Importantly, each 3D or 2D entity can anchor its geometric terms within the knowledge graph, thereby serving as a link between cross-domain data. As a result, the authors' approach facilitates multiple cross-domain 3D shape retrieval tasks. The authors evaluate the proposed method's performance on the ModelNet40 and ShapeNetCore55 datasets, encompassing scenarios related to 3D shape retrieval and cross-domain retrieval. Furthermore, the authors employ the established cross-modal dataset (MI3DOR) to assess cross-modal 3D shape retrieval. The resulting experimental outcomes, in conjunction with comparisons against state-of-the-art techniques, clearly highlight the superiority of our approach.

引用

页码：1199 / 1216

页数：18

共 78 条

[1] Ahmed A, 2020, INT BHURBAN C APPL S, P290, DOI [10.1109/ibcast47879.2020.9044545, 10.1109/IBCAST47879.2020.9044545]
[2] Allen M, 2008, 2008 INTERNATIONAL CONFERENCE ON INFORMATION PROCESSING IN SENSOR NETWORKS, PROCEEDINGS, P371, DOI 10.1109/IPSN.2008.45
[3] [Anonymous], 2016, Comput Sci
[4] [Anonymous], 2015, PROC CVPR IEEE, DOI DOI 10.1109/CVPR.2015.7298801
[5] On time-constant robust tuning of fractional order [proportional derivative] controllers
Badri, Vahid
Tavazoei, Mohammad Saleh
[J]. IEEE-CAA JOURNAL OF AUTOMATICA SINICA, 2019, 6 (05) : 1179 - 1186
[6] GIFT: Towards Scalable 3D Shape Retrieval
Bai, Song
Bai, Xiang
Zhou, Zhichao
Zhang, Zhaoxiang
Tian, Qi
Latecki, Longin Jan
[J]. IEEE TRANSACTIONS ON MULTIMEDIA, 2017, 19 (06) : 1257 - 1271
[7] Burgess Christopher P, 2019, ARXIV190111390
[8] On visual similarity based 3D model retrieval
Chen, DY
Tian, XP
Shen, YT
Ming, OY
[J]. COMPUTER GRAPHICS FORUM, 2003, 22 (03) : 223 - 232
[9] Text2Shape: Generating Shapes from Natural Language by Learning Joint Embeddings
Chen, Kevin
Choy, Christopher B.
Savva, Manolis
Chang, Angel X.
Funkhouser, Thomas
Savarese, Silvio
[J]. COMPUTER VISION - ACCV 2018, PT III, 2019, 11363 : 100 - 116
[10] Chen L.-C., 2017, Rethinking Atrous Convolution for Semantic Image Segmentation

← 1 2 3 4 5 6 7 8 →