Rotation Prediction Based Representative View Locating Framework for 3D Object Recognition

被引:3
|
作者
Jin, Xun [1 ]
Li, De [1 ]
机构
[1] Yanbian Univ, Dept Comp Sci, Yanji, Peoples R China
基金
中国国家自然科学基金;
关键词
3D object recognition; Rendered image; Representative view; Reinforcement learning; Deep learning; RETRIEVAL; CLASSIFICATION; NETWORKS; CNN;
D O I
10.1016/j.cad.2022.103279
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Finding representative views of 3D objects is a key problem in the field of 3D object analysis. We can obtain most of the crucial information of 3D objects from their representative views. In this paper, we propose a framework for learning the features of multi-view rendered images extracted from 3D objects in order to locate representative views of 3D objects. The learning method includes a reinforcement learning based rotation direction prediction (RDP) method and a deep learning based rotation angle prediction (RAP) method. The RDP uses a deep deterministic policy gradient (DDPG) algorithm to learn rotation policies. We improved DDPG to make RDP more applicable for learning 3D object rotation action. RAP uses a convolutional neural network to predict the rotation angles of representative views. We also propose a 3D object classification network. The network reconstructs the rendered images using an encoder-decoder based rendered image reconstruction method and trains the images composed of the original and reconstructed images. Finally, a series of experiments are conducted to validate the feasibility of the proposed methods. Experimental results show the competitive performance of the proposed framework. (C) 2022 Elsevier Ltd. All rights reserved.
引用
收藏
页数:11
相关论文
共 50 条
  • [1] Review of multi-view 3D object recognition methods based on deep learning
    Qi, Shaohua
    Ning, Xin
    Yang, Guowei
    Zhang, Liping
    Long, Peng
    Cai, Weiwei
    Li, Weijun
    DISPLAYS, 2021, 69
  • [2] 3D object recognition based on pairwise Multi-view Convolutional Neural Networks
    Gao, Z.
    Wang, D. Y.
    Xue, Y. B.
    Xu, G. P.
    Zhang, H.
    Wang, Y. L.
    JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2018, 56 : 305 - 315
  • [3] Progressive conditional GAN-based augmentation for 3D object recognition
    Muzahid, A. A. M.
    Wanggen, Wan
    Sohel, Ferdous
    Bennamoun, Mohammed
    Hou, Li
    Ullah, Hidayat
    NEUROCOMPUTING, 2021, 460 : 20 - 30
  • [4] The method based on view-directional consistency constraints for robust 3D object recognition
    Shimamura, Jun
    Yoshida, Taiga
    Taniguchi, Yukinobu
    Yabushita, Hiroko
    Sudo, Kyoko
    Murasaki, Kazuhiko
    2015 14TH IAPR INTERNATIONAL CONFERENCE ON MACHINE VISION APPLICATIONS (MVA), 2015, : 455 - 458
  • [5] A NEW ROTATION-INVARIANT DEEP NETWORK FOR 3D OBJECT RECOGNITION
    Zhang, Yachi
    Lu, Zongqing
    Xue, Jing-Hao
    Liao, Qingmin
    2019 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO (ICME), 2019, : 1606 - 1611
  • [6] Multi-view ensemble manifold regularization for 3D object recognition
    Hong, Chaoqun
    Yu, Jun
    You, Jane
    Chen, Xuhui
    Tao, Dapeng
    INFORMATION SCIENCES, 2015, 320 : 395 - 405
  • [7] Multi-view dual attention network for 3D object recognition
    Wang, Wenju
    Cai, Yu
    Wang, Tao
    NEURAL COMPUTING & APPLICATIONS, 2022, 34 (04) : 3201 - 3212
  • [8] Multi-view dual attention network for 3D object recognition
    Wenju Wang
    Yu Cai
    Tao Wang
    Neural Computing and Applications, 2022, 34 : 3201 - 3212
  • [9] Deep models for multi-view 3D object recognition: a review
    Alzahrani, Mona
    Usman, Muhammad
    Jarraya, Salma Kammoun
    Anwar, Saeed
    Helmy, Tarek
    ARTIFICIAL INTELLIGENCE REVIEW, 2024, 57 (12)
  • [10] 3D object retrieval based on multi-view convolutional neural networks
    Xi-Xi Li
    Qun Cao
    Sha Wei
    Multimedia Tools and Applications, 2017, 76 : 20111 - 20124