Rotation Prediction Based Representative View Locating Framework for 3D Object Recognition

被引:3
|
作者
Jin, Xun [1 ]
Li, De [1 ]
机构
[1] Yanbian Univ, Dept Comp Sci, Yanji, Peoples R China
基金
中国国家自然科学基金;
关键词
3D object recognition; Rendered image; Representative view; Reinforcement learning; Deep learning; RETRIEVAL; CLASSIFICATION; NETWORKS; CNN;
D O I
10.1016/j.cad.2022.103279
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Finding representative views of 3D objects is a key problem in the field of 3D object analysis. We can obtain most of the crucial information of 3D objects from their representative views. In this paper, we propose a framework for learning the features of multi-view rendered images extracted from 3D objects in order to locate representative views of 3D objects. The learning method includes a reinforcement learning based rotation direction prediction (RDP) method and a deep learning based rotation angle prediction (RAP) method. The RDP uses a deep deterministic policy gradient (DDPG) algorithm to learn rotation policies. We improved DDPG to make RDP more applicable for learning 3D object rotation action. RAP uses a convolutional neural network to predict the rotation angles of representative views. We also propose a 3D object classification network. The network reconstructs the rendered images using an encoder-decoder based rendered image reconstruction method and trains the images composed of the original and reconstructed images. Finally, a series of experiments are conducted to validate the feasibility of the proposed methods. Experimental results show the competitive performance of the proposed framework. (C) 2022 Elsevier Ltd. All rights reserved.
引用
收藏
页数:11
相关论文
共 50 条
  • [21] Appearance-Based 3D Object Approach to Human Ears Recognition
    Dimov, Dimo T.
    Cantoni, Virginio
    BIOMETRIC AUTHENTICATION (BIOMET 2014), 2014, 8897 : 121 - 135
  • [22] A Voxelized Fractal Descriptor for 3D Object Recognition
    Domenech, Jose Francisco
    Escalona, Felix
    Gomez-Donoso, Francisco
    Cazorla, Miguel
    IEEE ACCESS, 2020, 8 : 161958 - 161968
  • [23] A Frustum-based probabilistic framework for 3D object detection by fusion of LiDAR and camera data
    Gong, Zheng
    Lin, Haojia
    Zhang, Dedong
    Luo, Zhipeng
    Zelek, John
    Chen, Yiping
    Nurunnabi, Abdul
    Wang, Cheng
    Li, Jonathan
    ISPRS JOURNAL OF PHOTOGRAMMETRY AND REMOTE SENSING, 2020, 159 : 90 - 100
  • [24] MHFP: Multi-view based hierarchical fusion pooling method for 3D shape recognition
    Liang, Qi
    Li, Qiang
    Zhang, Lihu
    Mi, Haixiao
    Nie, Weizhi
    Li, Xuanya
    PATTERN RECOGNITION LETTERS, 2021, 150 : 214 - 220
  • [25] Adaptive Interaction-Based Multi-view 3D Object Reconstruction
    Miao, Jun
    Zheng, Yilin
    Yan, Jie
    Li, Lei
    Chu, Jun
    ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING, ICANN 2023, PT II, 2023, 14255 : 51 - 64
  • [26] Efficient object detection by prediction in 3D space
    Pang, Yanwei
    Jiang, Xiaoheng
    Li, Xuelong
    Pan, Jing
    SIGNAL PROCESSING, 2015, 112 : 64 - 73
  • [27] MVPointNet: Multi-View Network for 3D Object Based on Point Cloud
    Zhou, Weiguo
    Jiang, Xin
    Liu, Yun-Hui
    IEEE SENSORS JOURNAL, 2019, 19 (24) : 12145 - 12152
  • [28] Uncertainty Prediction for Monocular 3D Object Detection
    Mun, Junghwan
    Choi, Hyukdoo
    SENSORS, 2023, 23 (12)
  • [29] Explainable Feature Extraction and Prediction Framework for 3D Image Recognition Applied to Pneumonia Detection
    Pintelas, Emmanuel
    Livieris, Ioannis E.
    Pintelas, Panagiotis
    ELECTRONICS, 2023, 12 (12)
  • [30] A Progressive Multi-View Learning Approach for Multi-Loss Optimization in 3D Object Recognition
    Prasad, Shitala
    Li, Yiqun
    Lin, Dongyun
    Dong, Sheng
    Nwe, Ma Tin Lay
    IEEE SIGNAL PROCESSING LETTERS, 2022, 29 : 707 - 711