3D shape knowledge graph for cross-domain 3D shape retrieval

被引:0
作者
Chang, Rihao [1 ]
Ma, Yongtao [1 ]
Hao, Tong [2 ,5 ]
Wang, Weijie [3 ]
Nie, Weizhi [4 ,6 ]
机构
[1] Tianjin Univ, Sch Microelect, Tianjin, Peoples R China
[2] Tianjin Normal Univ, Sch Life Sci, Tianjin, Peoples R China
[3] Univ Trento, Dept Informat Engn & Comp Sci, Trento, Italy
[4] Tianjin Univ, Sch Elect & Informat Engn, Tianjin, Peoples R China
[5] Tianjin Normal Univ, Sch Life Sci, Tianjin 300387, Peoples R China
[6] Tianjin Univ, Sch Elect & Informat Engn, Tianjin 300072, Peoples R China
基金
中国国家自然科学基金;
关键词
3-D; multimedia; CONVOLUTIONAL NEURAL-NETWORKS; MODEL RETRIEVAL; OBJECT CATEGORIZATION;
D O I
10.1049/cit2.12326
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The surge in 3D modelling has led to a pronounced research emphasis on the field of 3D shape retrieval. Numerous contemporary approaches have been put forth to tackle this intricate challenge. Nevertheless, effectively addressing the intricacies of cross-modal 3D shape retrieval remains a formidable undertaking, owing to inherent modality-based disparities. The authors present an innovative notion-termed "geometric words"-which functions as elemental constituents for representing entities through combinations. To establish the knowledge graph, the authors employ geometric words as nodes, connecting them via shape categories and geometry attributes. Subsequently, a unique graph embedding method for knowledge acquisition is devised. Finally, an effective similarity measure is introduced for retrieval purposes. Importantly, each 3D or 2D entity can anchor its geometric terms within the knowledge graph, thereby serving as a link between cross-domain data. As a result, the authors' approach facilitates multiple cross-domain 3D shape retrieval tasks. The authors evaluate the proposed method's performance on the ModelNet40 and ShapeNetCore55 datasets, encompassing scenarios related to 3D shape retrieval and cross-domain retrieval. Furthermore, the authors employ the established cross-modal dataset (MI3DOR) to assess cross-modal 3D shape retrieval. The resulting experimental outcomes, in conjunction with comparisons against state-of-the-art techniques, clearly highlight the superiority of our approach.
引用
收藏
页码:1199 / 1216
页数:18
相关论文
共 78 条
  • [1] Ahmed A, 2020, INT BHURBAN C APPL S, P290, DOI [10.1109/ibcast47879.2020.9044545, 10.1109/IBCAST47879.2020.9044545]
  • [2] Allen M, 2008, 2008 INTERNATIONAL CONFERENCE ON INFORMATION PROCESSING IN SENSOR NETWORKS, PROCEEDINGS, P371, DOI 10.1109/IPSN.2008.45
  • [3] [Anonymous], 2016, Comput Sci
  • [4] [Anonymous], 2015, PROC CVPR IEEE, DOI DOI 10.1109/CVPR.2015.7298801
  • [5] On time-constant robust tuning of fractional order [proportional derivative] controllers
    Badri, Vahid
    Tavazoei, Mohammad Saleh
    [J]. IEEE-CAA JOURNAL OF AUTOMATICA SINICA, 2019, 6 (05) : 1179 - 1186
  • [6] GIFT: Towards Scalable 3D Shape Retrieval
    Bai, Song
    Bai, Xiang
    Zhou, Zhichao
    Zhang, Zhaoxiang
    Tian, Qi
    Latecki, Longin Jan
    [J]. IEEE TRANSACTIONS ON MULTIMEDIA, 2017, 19 (06) : 1257 - 1271
  • [7] Burgess Christopher P, 2019, ARXIV190111390
  • [8] On visual similarity based 3D model retrieval
    Chen, DY
    Tian, XP
    Shen, YT
    Ming, OY
    [J]. COMPUTER GRAPHICS FORUM, 2003, 22 (03) : 223 - 232
  • [9] Text2Shape: Generating Shapes from Natural Language by Learning Joint Embeddings
    Chen, Kevin
    Choy, Christopher B.
    Savva, Manolis
    Chang, Angel X.
    Funkhouser, Thomas
    Savarese, Silvio
    [J]. COMPUTER VISION - ACCV 2018, PT III, 2019, 11363 : 100 - 116
  • [10] Chen L.-C., 2017, Rethinking Atrous Convolution for Semantic Image Segmentation