A convolutional architecture for 3D model embedding using image views

被引：1

作者：

Labrada, Arniel ^{[1
]}

Bustos, Benjamin ^{[1
]}

Sipiran, Ivan ^{[1
]}

机构：

[1] Univ Chile, Dept Comp Sci, Santiago, Chile

来源：

VISUAL COMPUTER | 2024年 / 40卷 / 03期

关键词：

3D model; Deep learning; Convolutional neural network; Embedding; CLASSIFICATION;

D O I：

10.1007/s00371-023-02872-4

中图分类号：

TP31 [计算机软件];

学科分类号：

081202 ; 0835 ;

摘要：

During the last years, many advances have been made in tasks like 3D model retrieval, 3D model classification, and 3D model segmentation. The typical 3D representations such as point clouds, voxels, and polygon meshes are mostly suitable for rendering purposes, while their use for cognitive processes (retrieval, classification, segmentation) is limited due to their high redundancy and complexity. We propose a deep learning architecture to handle 3D models represented as sets of image views as input. Our proposed architecture combines other standard architectures, like Convolutional Neural Networks and autoencoders, for computing 3D model embeddings using sets of image views extracted from the 3D models, avoiding the common view pooling layer approach used in these cases. Our goal is to represent a 3D model as a vector with enough information so it can substitute the 3D model for high-level tasks. Since this vector is a learned representation which tries to capture the relevant information of a 3D model, we show that the embedding representation conveys semantic information that helps to deal with the similarity assessment of 3D objects. We compare our proposed embedding technique with state-of-the-art techniques for 3D Model Retrieval using the ShapeNet and ModelNet datasets. We show that the embeddings obtained with our proposed architecture allow us to obtain a high effectiveness score in both normalized and perturbed versions of the ShapeNet dataset while improving the training and inference times compared to the standard state-of-the-art techniques.

引用

页码：1601 / 1615

页数：15

共 50 条

[21] An Effective 3D ResNet Architecture for Stereo Image Retrieval
Ghodhbani, E.
Kaaniche, M.
Benazza-Benyahia, A.
VISAPP: PROCEEDINGS OF THE 16TH INTERNATIONAL JOINT CONFERENCE ON COMPUTER VISION, IMAGING AND COMPUTER GRAPHICS THEORY AND APPLICATIONS - VOL. 4: VISAPP, 2021, : 380 - 387
[22] Improving efficiency in convolutional neural networks with 3D image filters
Uyar, Kuebra
Tasdemir, Sakir
Ulker, Erkan
Unlukal, Nejat
Solmaz, Merve
BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2022, 74
[23] 3D Model Tools for Architecture and Archaeology Reconstruction
Vlad, Ioan
Herban, Ioan Sorin
Stoian, Mircea
Vilceanu, Clara-Beatrice
PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON NUMERICAL ANALYSIS AND APPLIED MATHEMATICS 2015 (ICNAAM-2015), 2016, 1738
[24] DEEP POINT CONVOLUTIONAL APPROACH FOR 3D MODEL RETRIEVAL
Kuang, Zhenzhong
Yu, Jun
Fan, Jianping
Tan, Min
2018 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO (ICME), 2018,
[25] CT-image of rock samples super resolution using 3D convolutional neural network
Wang, Yukai
Teng, Qizhi
He, Xiaohai
Feng, Junxi
Zhang, Tingrong
COMPUTERS & GEOSCIENCES, 2019, 133
[26] Deep 3D Pose Dictionary: 3D Human Pose Estimation from Single RGB Image Using Deep Convolutional Neural Network
Elbasiony, Reda
Gomaa, Walid
Ogata, Tetsuya
ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING - ICANN 2018, PT III, 2018, 11141 : 310 - 320
[27] Correction of color information of a 3D model using a range intensity image
Shinozaki, Megumi
Kusanagi, Masato
Umeda, Kazunori
Godin, Guy
Rioux, Marc
COMPUTER VISION AND IMAGE UNDERSTANDING, 2009, 113 (11) : 1170 - 1179
[28] Hyperspectral image segmentation using 3D regularized subspace clustering model
Hinojosa, Carlos
Rojas, Fernando
Castillo, Sergio
Arguello, Henry
JOURNAL OF APPLIED REMOTE SENSING, 2021, 15 (01)
[29] Monocular image based 3D model retrieval using triplet network
Du Y.
Li H.
Yao C.
Cai Q.
Li, Haisheng (lihsh@btbu.edu.cn), 1691, Beijing University of Aeronautics and Astronautics (BUAA) (46): : 1691 - 1700
[30] KNOWLEDGE BASED 3D BUILDING MODEL RECOGNITION USING CONVOLUTIONAL NEURAL NETWORKS FROM LIDAR AND AERIAL IMAGERIES
Alidoost, F.
Arefi, H.
XXIII ISPRS CONGRESS, COMMISSION III, 2016, 41 (B3): : 833 - 840

← 1 2 3 4 5 →