A convolutional architecture for 3D model embedding using image views

被引:1
|
作者
Labrada, Arniel [1 ]
Bustos, Benjamin [1 ]
Sipiran, Ivan [1 ]
机构
[1] Univ Chile, Dept Comp Sci, Santiago, Chile
来源
VISUAL COMPUTER | 2024年 / 40卷 / 03期
关键词
3D model; Deep learning; Convolutional neural network; Embedding; CLASSIFICATION;
D O I
10.1007/s00371-023-02872-4
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
During the last years, many advances have been made in tasks like 3D model retrieval, 3D model classification, and 3D model segmentation. The typical 3D representations such as point clouds, voxels, and polygon meshes are mostly suitable for rendering purposes, while their use for cognitive processes (retrieval, classification, segmentation) is limited due to their high redundancy and complexity. We propose a deep learning architecture to handle 3D models represented as sets of image views as input. Our proposed architecture combines other standard architectures, like Convolutional Neural Networks and autoencoders, for computing 3D model embeddings using sets of image views extracted from the 3D models, avoiding the common view pooling layer approach used in these cases. Our goal is to represent a 3D model as a vector with enough information so it can substitute the 3D model for high-level tasks. Since this vector is a learned representation which tries to capture the relevant information of a 3D model, we show that the embedding representation conveys semantic information that helps to deal with the similarity assessment of 3D objects. We compare our proposed embedding technique with state-of-the-art techniques for 3D Model Retrieval using the ShapeNet and ModelNet datasets. We show that the embeddings obtained with our proposed architecture allow us to obtain a high effectiveness score in both normalized and perturbed versions of the ShapeNet dataset while improving the training and inference times compared to the standard state-of-the-art techniques.
引用
收藏
页码:1601 / 1615
页数:15
相关论文
共 50 条
  • [21] An Effective 3D ResNet Architecture for Stereo Image Retrieval
    Ghodhbani, E.
    Kaaniche, M.
    Benazza-Benyahia, A.
    VISAPP: PROCEEDINGS OF THE 16TH INTERNATIONAL JOINT CONFERENCE ON COMPUTER VISION, IMAGING AND COMPUTER GRAPHICS THEORY AND APPLICATIONS - VOL. 4: VISAPP, 2021, : 380 - 387
  • [22] Improving efficiency in convolutional neural networks with 3D image filters
    Uyar, Kuebra
    Tasdemir, Sakir
    Ulker, Erkan
    Unlukal, Nejat
    Solmaz, Merve
    BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2022, 74
  • [23] 3D Model Tools for Architecture and Archaeology Reconstruction
    Vlad, Ioan
    Herban, Ioan Sorin
    Stoian, Mircea
    Vilceanu, Clara-Beatrice
    PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON NUMERICAL ANALYSIS AND APPLIED MATHEMATICS 2015 (ICNAAM-2015), 2016, 1738
  • [24] DEEP POINT CONVOLUTIONAL APPROACH FOR 3D MODEL RETRIEVAL
    Kuang, Zhenzhong
    Yu, Jun
    Fan, Jianping
    Tan, Min
    2018 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO (ICME), 2018,
  • [25] CT-image of rock samples super resolution using 3D convolutional neural network
    Wang, Yukai
    Teng, Qizhi
    He, Xiaohai
    Feng, Junxi
    Zhang, Tingrong
    COMPUTERS & GEOSCIENCES, 2019, 133
  • [26] Deep 3D Pose Dictionary: 3D Human Pose Estimation from Single RGB Image Using Deep Convolutional Neural Network
    Elbasiony, Reda
    Gomaa, Walid
    Ogata, Tetsuya
    ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING - ICANN 2018, PT III, 2018, 11141 : 310 - 320
  • [27] Correction of color information of a 3D model using a range intensity image
    Shinozaki, Megumi
    Kusanagi, Masato
    Umeda, Kazunori
    Godin, Guy
    Rioux, Marc
    COMPUTER VISION AND IMAGE UNDERSTANDING, 2009, 113 (11) : 1170 - 1179
  • [28] Hyperspectral image segmentation using 3D regularized subspace clustering model
    Hinojosa, Carlos
    Rojas, Fernando
    Castillo, Sergio
    Arguello, Henry
    JOURNAL OF APPLIED REMOTE SENSING, 2021, 15 (01)
  • [29] Monocular image based 3D model retrieval using triplet network
    Du Y.
    Li H.
    Yao C.
    Cai Q.
    Li, Haisheng (lihsh@btbu.edu.cn), 1691, Beijing University of Aeronautics and Astronautics (BUAA) (46): : 1691 - 1700
  • [30] KNOWLEDGE BASED 3D BUILDING MODEL RECOGNITION USING CONVOLUTIONAL NEURAL NETWORKS FROM LIDAR AND AERIAL IMAGERIES
    Alidoost, F.
    Arefi, H.
    XXIII ISPRS CONGRESS, COMMISSION III, 2016, 41 (B3): : 833 - 840