Rotation Prediction Based Representative View Locating Framework for 3D Object Recognition

被引：3

作者：

Jin, Xun ^{[1
]}

Li, De ^{[1
]}

机构：

[1] Yanbian Univ, Dept Comp Sci, Yanji, Peoples R China

来源：

COMPUTER-AIDED DESIGN | 2022年 / 150卷

基金：

中国国家自然科学基金;

关键词：

3D object recognition; Rendered image; Representative view; Reinforcement learning; Deep learning; RETRIEVAL; CLASSIFICATION; NETWORKS; CNN;

D O I：

10.1016/j.cad.2022.103279

中图分类号：

TP31 [计算机软件];

学科分类号：

081202 ; 0835 ;

摘要：

Finding representative views of 3D objects is a key problem in the field of 3D object analysis. We can obtain most of the crucial information of 3D objects from their representative views. In this paper, we propose a framework for learning the features of multi-view rendered images extracted from 3D objects in order to locate representative views of 3D objects. The learning method includes a reinforcement learning based rotation direction prediction (RDP) method and a deep learning based rotation angle prediction (RAP) method. The RDP uses a deep deterministic policy gradient (DDPG) algorithm to learn rotation policies. We improved DDPG to make RDP more applicable for learning 3D object rotation action. RAP uses a convolutional neural network to predict the rotation angles of representative views. We also propose a 3D object classification network. The network reconstructs the rendered images using an encoder-decoder based rendered image reconstruction method and trains the images composed of the original and reconstructed images. Finally, a series of experiments are conducted to validate the feasibility of the proposed methods. Experimental results show the competitive performance of the proposed framework. (C) 2022 Elsevier Ltd. All rights reserved.

引用

页数：11

共 50 条

[31] Efficient 3D object recognition in mobile edge environment
Mofei Song
Qi Guo
Journal of Cloud Computing, 11
[32] CURVATURE AUGMENTED DEEP LEARNING FOR 3D OBJECT RECOGNITION
Braeger, Sarah
Foroosh, Hassan
2018 25TH IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2018, : 3648 - 3652
[33] 3D convolutional neural network for object recognition: a review
Rahul Dev Singh
Ajay Mittal
Rajesh K. Bhatia
Multimedia Tools and Applications, 2019, 78 : 15951 - 15995
[34] A Multimodal 3D Object Detection Method Based on Double-Fusion Framework
Ge T.-A.
Li H.
Guo Y.
Wang J.-Y.
Zhou D.
Tien Tzu Hsueh Pao/Acta Electronica Sinica, 2023, 51 (11): : 3100 - 3110
[35] A Survey and Technology Trends of 3D Features for Object Recognition
Hashimoto, Manabu
Akizuki, Shuichi
Takei, Shoichi
ELECTRONICS AND COMMUNICATIONS IN JAPAN, 2017, 100 (11) : 31 - 42
[36] 3D-Net: Monocular 3D object recognition for traffic monitoring
Rezaei, Mahdi
Azarmi, Mohsen
Mir, Farzam Mohammad Pour
EXPERT SYSTEMS WITH APPLICATIONS, 2023, 227
[37] MMDistill: Multi-Modal BEV Distillation Framework for Multi-View 3D Object Detection
Jiao, Tianzhe
Chen, Yuming
Zhang, Zhe
Guo, Chaopeng
Song, Jie
CMC-COMPUTERS MATERIALS & CONTINUA, 2024, 81 (03): : 4307 - 4325
[38] 3D Object Recognition Based on Volumetric Representation Using Convolutional Neural Networks
Xu, Xiaofan
Corrigan, David
Dehghani, Alireza
Caulfield, Sam
Moloney, David
ARTICULATED MOTION AND DEFORMABLE OBJECTS, 2016, 9756 : 147 - 156
[39] Efficient 3D object recognition in mobile edge environment
Song, Mofei
Guo, Qi
JOURNAL OF CLOUD COMPUTING-ADVANCES SYSTEMS AND APPLICATIONS, 2022, 11 (01):
[40] SVNET: A SINGLE VIEW NETWORK FOR 3D SHAPE RECOGNITION
Li, Shaoshuai
Liu, Fuyan
2019 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO (ICME), 2019, : 1648 - 1653

← 1 2 3 4 5 →