Deep Correlated Joint Network for 2-D Image-Based 3-D Model Retrieval

被引:5
|
作者
Nie, Wei-Zhi [1 ]
Liu, An-An [1 ]
Zhao, Sicheng [2 ]
Gao, Yue [3 ]
机构
[1] Tianjin Univ, Sch Elect & Informat Engn, Tianjin 300072, Peoples R China
[2] Univ Calif Berkeley, Dept Elect Engn & Comp Sci, Berkeley, CA 94720 USA
[3] Tsinghua Univ, Sch Software, Beijing 100084, Peoples R China
基金
中国国家自然科学基金;
关键词
Solid modeling; Shape; Feature extraction; Visualization; Correlation; Loss measurement; Benchmark testing; 3-D model retrieval; cross-domain learning; deep metric learning; OBJECT RETRIEVAL; 3D;
D O I
10.1109/TCYB.2020.2995415
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In this article, we propose a novel deep correlated joint network (DCJN) approach for 2-D image-based 3-D model retrieval. First, the proposed method can jointly learn two distinct deep neural networks, which are trained for individual modalities to learn two deep nonlinear transformations for visual feature extraction from the co-embedding feature space. Second, we propose the global loss function for the DCJN, consisting of a discriminative loss and a correlation loss. The discriminative loss aims to minimize the intraclass distance of the extracted features and maximize the interclass distance of such features to a large margin within each modality, while the correlation loss focuses on mitigating the distribution discrepancy across different modalities. Consequently, the proposed method can realize cross-modality feature extraction guided by the defined global loss function to benefit the similarity measure between 2-D images and 3-D models. For a comparison experiment, we contribute the current largest 2-D image-based 3-D model retrieval dataset. Moreover, the proposed method was further evaluated on three popular benchmarks, including the 3-D Shape Retrieval Contest 2014, 2016, and 2018 benchmarks. The extensive comparison experimental results demonstrate the superiority of this method over the state-of-the-art methods.
引用
收藏
页码:1862 / 1871
页数:10
相关论文
共 50 条
  • [1] Monocular Image-Based 3-D Model Retrieval: A Benchmark
    Song, Dan
    Nie, Wei-Zhi
    Li, Wen-Hui
    Kankanhalli, Mohan
    Liu, An-An
    IEEE TRANSACTIONS ON CYBERNETICS, 2022, 52 (08) : 8114 - 8127
  • [2] Adaptive semantic transfer network for unsupervised 2D image-based 3D model retrieval
    Song, Dan
    Yang, Yuanxiang
    Li, Wenhui
    Shao, Zhuang
    Nie, Weizhi
    Li, Xuanya
    Liu, An-An
    COMPUTER VISION AND IMAGE UNDERSTANDING, 2024, 238
  • [3] Supervised Deep-Autoencoder for Depth Image-based 3D Model Retrieval
    Siddiqua, Ayesha
    Fan, Guoliang
    2018 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV 2018), 2018, : 939 - 946
  • [4] Deep Collaborative Attention Network for Hyperspectral Image Classification by Combining 2-D CNN and 3-D CNN
    Guo, Hao
    Liu, Jianjun
    Yang, Jinlong
    Xiao, Zhiyong
    Wu, Zebin
    IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2020, 13 : 4789 - 4802
  • [5] Sphere Image for 3-D Model Retrieval
    Ding, Ke
    Liu, Yun-Hui
    IEEE TRANSACTIONS ON MULTIMEDIA, 2014, 16 (05) : 1369 - 1376
  • [6] Unsupervised Cross-Media Graph Convolutional Network for 2D Image-Based 3D Model Retrieval
    Liang, Qi
    Li, Qiang
    Nie, Weizhi
    Liu, An-An
    IEEE TRANSACTIONS ON MULTIMEDIA, 2023, 25 : 3443 - 3455
  • [7] 3-D/2-D registration by integrating 2-D information in 3-D
    Tomazevic, D
    Likar, B
    Pernus, F
    IEEE TRANSACTIONS ON MEDICAL IMAGING, 2006, 25 (01) : 17 - 27
  • [8] Learning Transferable and Discriminative Representations for 2D Image-Based 3D Model Retrieval
    Zhou, Yaqian
    Liu, Yu
    Zhou, Heyu
    Cheng, Zhiyong
    Li, Xuanya
    Liu, An-An
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2022, 32 (10) : 7147 - 7159
  • [9] 3D Pose Estimation Based on Reinforce Learning for 2D Image-Based 3D Model Retrieval
    Nie, Wei-Zhi
    Jia, Wen-Wu
    Li, Wen-Hui
    Liu, An-An
    Zhao, Si-Cheng
    IEEE TRANSACTIONS ON MULTIMEDIA, 2021, 23 (23) : 1021 - 1034
  • [10] Learning Pairwise Neural Network Encoder for Depth Image-based 3D Model Retrieval
    Zhu, Jing
    Zhu, Fan
    Wong, Edward K.
    Fang, Yi
    MM'15: PROCEEDINGS OF THE 2015 ACM MULTIMEDIA CONFERENCE, 2015, : 1227 - 1230