Rotation Prediction Based Representative View Locating Framework for 3D Object Recognition

被引:3
|
作者
Jin, Xun [1 ]
Li, De [1 ]
机构
[1] Yanbian Univ, Dept Comp Sci, Yanji, Peoples R China
基金
中国国家自然科学基金;
关键词
3D object recognition; Rendered image; Representative view; Reinforcement learning; Deep learning; RETRIEVAL; CLASSIFICATION; NETWORKS; CNN;
D O I
10.1016/j.cad.2022.103279
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Finding representative views of 3D objects is a key problem in the field of 3D object analysis. We can obtain most of the crucial information of 3D objects from their representative views. In this paper, we propose a framework for learning the features of multi-view rendered images extracted from 3D objects in order to locate representative views of 3D objects. The learning method includes a reinforcement learning based rotation direction prediction (RDP) method and a deep learning based rotation angle prediction (RAP) method. The RDP uses a deep deterministic policy gradient (DDPG) algorithm to learn rotation policies. We improved DDPG to make RDP more applicable for learning 3D object rotation action. RAP uses a convolutional neural network to predict the rotation angles of representative views. We also propose a 3D object classification network. The network reconstructs the rendered images using an encoder-decoder based rendered image reconstruction method and trains the images composed of the original and reconstructed images. Finally, a series of experiments are conducted to validate the feasibility of the proposed methods. Experimental results show the competitive performance of the proposed framework. (C) 2022 Elsevier Ltd. All rights reserved.
引用
收藏
页数:11
相关论文
共 50 条
  • [31] Efficient 3D object recognition in mobile edge environment
    Mofei Song
    Qi Guo
    Journal of Cloud Computing, 11
  • [32] CURVATURE AUGMENTED DEEP LEARNING FOR 3D OBJECT RECOGNITION
    Braeger, Sarah
    Foroosh, Hassan
    2018 25TH IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2018, : 3648 - 3652
  • [33] 3D convolutional neural network for object recognition: a review
    Rahul Dev Singh
    Ajay Mittal
    Rajesh K. Bhatia
    Multimedia Tools and Applications, 2019, 78 : 15951 - 15995
  • [34] A Multimodal 3D Object Detection Method Based on Double-Fusion Framework
    Ge T.-A.
    Li H.
    Guo Y.
    Wang J.-Y.
    Zhou D.
    Tien Tzu Hsueh Pao/Acta Electronica Sinica, 2023, 51 (11): : 3100 - 3110
  • [35] A Survey and Technology Trends of 3D Features for Object Recognition
    Hashimoto, Manabu
    Akizuki, Shuichi
    Takei, Shoichi
    ELECTRONICS AND COMMUNICATIONS IN JAPAN, 2017, 100 (11) : 31 - 42
  • [36] 3D-Net: Monocular 3D object recognition for traffic monitoring
    Rezaei, Mahdi
    Azarmi, Mohsen
    Mir, Farzam Mohammad Pour
    EXPERT SYSTEMS WITH APPLICATIONS, 2023, 227
  • [37] MMDistill: Multi-Modal BEV Distillation Framework for Multi-View 3D Object Detection
    Jiao, Tianzhe
    Chen, Yuming
    Zhang, Zhe
    Guo, Chaopeng
    Song, Jie
    CMC-COMPUTERS MATERIALS & CONTINUA, 2024, 81 (03): : 4307 - 4325
  • [38] 3D Object Recognition Based on Volumetric Representation Using Convolutional Neural Networks
    Xu, Xiaofan
    Corrigan, David
    Dehghani, Alireza
    Caulfield, Sam
    Moloney, David
    ARTICULATED MOTION AND DEFORMABLE OBJECTS, 2016, 9756 : 147 - 156
  • [39] Efficient 3D object recognition in mobile edge environment
    Song, Mofei
    Guo, Qi
    JOURNAL OF CLOUD COMPUTING-ADVANCES SYSTEMS AND APPLICATIONS, 2022, 11 (01):
  • [40] SVNET: A SINGLE VIEW NETWORK FOR 3D SHAPE RECOGNITION
    Li, Shaoshuai
    Liu, Fuyan
    2019 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO (ICME), 2019, : 1648 - 1653