Rotation Prediction Based Representative View Locating Framework for 3D Object Recognition

被引：3

作者：

Jin, Xun ^{[1
]}

Li, De ^{[1
]}

机构：

[1] Yanbian Univ, Dept Comp Sci, Yanji, Peoples R China

来源：

COMPUTER-AIDED DESIGN | 2022年 / 150卷

基金：

中国国家自然科学基金;

关键词：

3D object recognition; Rendered image; Representative view; Reinforcement learning; Deep learning; RETRIEVAL; CLASSIFICATION; NETWORKS; CNN;

D O I：

10.1016/j.cad.2022.103279

中图分类号：

TP31 [计算机软件];

学科分类号：

081202 ; 0835 ;

摘要：

Finding representative views of 3D objects is a key problem in the field of 3D object analysis. We can obtain most of the crucial information of 3D objects from their representative views. In this paper, we propose a framework for learning the features of multi-view rendered images extracted from 3D objects in order to locate representative views of 3D objects. The learning method includes a reinforcement learning based rotation direction prediction (RDP) method and a deep learning based rotation angle prediction (RAP) method. The RDP uses a deep deterministic policy gradient (DDPG) algorithm to learn rotation policies. We improved DDPG to make RDP more applicable for learning 3D object rotation action. RAP uses a convolutional neural network to predict the rotation angles of representative views. We also propose a 3D object classification network. The network reconstructs the rendered images using an encoder-decoder based rendered image reconstruction method and trains the images composed of the original and reconstructed images. Finally, a series of experiments are conducted to validate the feasibility of the proposed methods. Experimental results show the competitive performance of the proposed framework. (C) 2022 Elsevier Ltd. All rights reserved.

引用

页数：11

共 50 条

[21] Appearance-Based 3D Object Approach to Human Ears Recognition
Dimov, Dimo T.
Cantoni, Virginio
BIOMETRIC AUTHENTICATION (BIOMET 2014), 2014, 8897 : 121 - 135
[22] A Voxelized Fractal Descriptor for 3D Object Recognition
Domenech, Jose Francisco
Escalona, Felix
Gomez-Donoso, Francisco
Cazorla, Miguel
IEEE ACCESS, 2020, 8 : 161958 - 161968
[23] A Frustum-based probabilistic framework for 3D object detection by fusion of LiDAR and camera data
Gong, Zheng
Lin, Haojia
Zhang, Dedong
Luo, Zhipeng
Zelek, John
Chen, Yiping
Nurunnabi, Abdul
Wang, Cheng
Li, Jonathan
ISPRS JOURNAL OF PHOTOGRAMMETRY AND REMOTE SENSING, 2020, 159 : 90 - 100
[24] MHFP: Multi-view based hierarchical fusion pooling method for 3D shape recognition
Liang, Qi
Li, Qiang
Zhang, Lihu
Mi, Haixiao
Nie, Weizhi
Li, Xuanya
PATTERN RECOGNITION LETTERS, 2021, 150 : 214 - 220
[25] Adaptive Interaction-Based Multi-view 3D Object Reconstruction
Miao, Jun
Zheng, Yilin
Yan, Jie
Li, Lei
Chu, Jun
ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING, ICANN 2023, PT II, 2023, 14255 : 51 - 64
[26] Efficient object detection by prediction in 3D space
Pang, Yanwei
Jiang, Xiaoheng
Li, Xuelong
Pan, Jing
SIGNAL PROCESSING, 2015, 112 : 64 - 73
[27] MVPointNet: Multi-View Network for 3D Object Based on Point Cloud
Zhou, Weiguo
Jiang, Xin
Liu, Yun-Hui
IEEE SENSORS JOURNAL, 2019, 19 (24) : 12145 - 12152
[28] Uncertainty Prediction for Monocular 3D Object Detection
Mun, Junghwan
Choi, Hyukdoo
SENSORS, 2023, 23 (12)
[29] Explainable Feature Extraction and Prediction Framework for 3D Image Recognition Applied to Pneumonia Detection
Pintelas, Emmanuel
Livieris, Ioannis E.
Pintelas, Panagiotis
ELECTRONICS, 2023, 12 (12)
[30] A Progressive Multi-View Learning Approach for Multi-Loss Optimization in 3D Object Recognition
Prasad, Shitala
Li, Yiqun
Lin, Dongyun
Dong, Sheng
Nwe, Ma Tin Lay
IEEE SIGNAL PROCESSING LETTERS, 2022, 29 : 707 - 711

← 1 2 3 4 5 →