A convolutional architecture for 3D model embedding using image views

被引:1
|
作者
Labrada, Arniel [1 ]
Bustos, Benjamin [1 ]
Sipiran, Ivan [1 ]
机构
[1] Univ Chile, Dept Comp Sci, Santiago, Chile
来源
VISUAL COMPUTER | 2024年 / 40卷 / 03期
关键词
3D model; Deep learning; Convolutional neural network; Embedding; CLASSIFICATION;
D O I
10.1007/s00371-023-02872-4
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
During the last years, many advances have been made in tasks like 3D model retrieval, 3D model classification, and 3D model segmentation. The typical 3D representations such as point clouds, voxels, and polygon meshes are mostly suitable for rendering purposes, while their use for cognitive processes (retrieval, classification, segmentation) is limited due to their high redundancy and complexity. We propose a deep learning architecture to handle 3D models represented as sets of image views as input. Our proposed architecture combines other standard architectures, like Convolutional Neural Networks and autoencoders, for computing 3D model embeddings using sets of image views extracted from the 3D models, avoiding the common view pooling layer approach used in these cases. Our goal is to represent a 3D model as a vector with enough information so it can substitute the 3D model for high-level tasks. Since this vector is a learned representation which tries to capture the relevant information of a 3D model, we show that the embedding representation conveys semantic information that helps to deal with the similarity assessment of 3D objects. We compare our proposed embedding technique with state-of-the-art techniques for 3D Model Retrieval using the ShapeNet and ModelNet datasets. We show that the embeddings obtained with our proposed architecture allow us to obtain a high effectiveness score in both normalized and perturbed versions of the ShapeNet dataset while improving the training and inference times compared to the standard state-of-the-art techniques.
引用
收藏
页码:1601 / 1615
页数:15
相关论文
共 50 条
  • [1] A convolutional architecture for 3D model embedding using image views
    Arniel Labrada
    Benjamin Bustos
    Ivan Sipiran
    The Visual Computer, 2024, 40 : 1601 - 1615
  • [2] A Separate 3D Convolutional Neural Network Architecture for 3D Medical Image Semantic Segmentation
    Dong, Shidu
    Liu, Zhi
    Wang, Huaqiu
    Zhang, Yihao
    Cui, Shaoguo
    JOURNAL OF MEDICAL IMAGING AND HEALTH INFORMATICS, 2019, 9 (08) : 1705 - 1716
  • [3] Computed Tomography Image Enhancement Using 3D Convolutional Neural Network
    Li, Meng
    Shen, Shiwen
    Gao, Wen
    Hsu, William
    Cong, Jason
    DEEP LEARNING IN MEDICAL IMAGE ANALYSIS AND MULTIMODAL LEARNING FOR CLINICAL DECISION SUPPORT, DLMIA 2018, 2018, 11045 : 291 - 299
  • [4] HyperSpectral Image Classification using an 3D Convolutional Mixer Block
    Dianat, Sara
    Yazdi, Mehran
    2024 32ND INTERNATIONAL CONFERENCE ON ELECTRICAL ENGINEERING, ICEE 2024, 2024, : 870 - 874
  • [5] Comparative Analysis of Image Processing Techniques for Enhanced MRI Image Quality: 3D Reconstruction and Segmentation Using 3D U-Net Architecture
    Lim, Chee Chin
    Ling, Apple Ho Wei
    Chong, Yen Fook
    Mashor, Mohd Yusoff
    Alshantti, Khalilalrahman
    Aziz, Mohd Ezane
    DIAGNOSTICS, 2023, 13 (14)
  • [6] 3D Object Classification using 3D Racah Moments Convolutional Neural Networks
    Mesbah, Abderrahim
    Berrahou, Aissam
    El Alami, Abdelmajid
    Berrahou, Nadia
    Berbia, Hassan
    Qjidaa, Hassan
    PROCEEDINGS OF THE 2ND INTERNATIONAL CONFERENCE ON NETWORKING, INFORMATION SYSTEMS & SECURITY (NISS19), 2019,
  • [7] Efficient multiscale fully convolutional UNet model for segmentation of 3D lung nodule from CT image
    Agnes, Sundaresan A.
    Anitha, Jeevanayagam
    JOURNAL OF MEDICAL IMAGING, 2022, 9 (05)
  • [8] Deep clustering using 3D attention convolutional autoencoder for hyperspectral image analysis
    Zheng, Ziyou
    Zhang, Shuzhen
    Song, Hailong
    Yan, Qi
    SCIENTIFIC REPORTS, 2024, 14 (01)
  • [9] Identification of Melanoma From Hyperspectral Pathology Image Using 3D Convolutional Networks
    Wang, Qian
    Sun, Li
    Wang, Yan
    Zhou, Mei
    Hu, Menghan
    Chen, Jiangang
    Wen, Ying
    Li, Qingli
    IEEE TRANSACTIONS ON MEDICAL IMAGING, 2021, 40 (01) : 218 - 227
  • [10] Brain age estimation based on 3D MRI images using 3D convolutional neural network
    Pardakhti, Nastaran
    Sajedi, Hedieh
    MULTIMEDIA TOOLS AND APPLICATIONS, 2020, 79 (33-34) : 25051 - 25065