Rotation Prediction Based Representative View Locating Framework for 3D Object Recognition

被引：3

作者：

Jin, Xun ^{[1
]}

Li, De ^{[1
]}

机构：

[1] Yanbian Univ, Dept Comp Sci, Yanji, Peoples R China

来源：

COMPUTER-AIDED DESIGN | 2022年 / 150卷

基金：

中国国家自然科学基金;

关键词：

3D object recognition; Rendered image; Representative view; Reinforcement learning; Deep learning; RETRIEVAL; CLASSIFICATION; NETWORKS; CNN;

D O I：

10.1016/j.cad.2022.103279

中图分类号：

TP31 [计算机软件];

学科分类号：

081202 ; 0835 ;

摘要：

Finding representative views of 3D objects is a key problem in the field of 3D object analysis. We can obtain most of the crucial information of 3D objects from their representative views. In this paper, we propose a framework for learning the features of multi-view rendered images extracted from 3D objects in order to locate representative views of 3D objects. The learning method includes a reinforcement learning based rotation direction prediction (RDP) method and a deep learning based rotation angle prediction (RAP) method. The RDP uses a deep deterministic policy gradient (DDPG) algorithm to learn rotation policies. We improved DDPG to make RDP more applicable for learning 3D object rotation action. RAP uses a convolutional neural network to predict the rotation angles of representative views. We also propose a 3D object classification network. The network reconstructs the rendered images using an encoder-decoder based rendered image reconstruction method and trains the images composed of the original and reconstructed images. Finally, a series of experiments are conducted to validate the feasibility of the proposed methods. Experimental results show the competitive performance of the proposed framework. (C) 2022 Elsevier Ltd. All rights reserved.

引用

页数：11

共 50 条

[1] Review of multi-view 3D object recognition methods based on deep learning
Qi, Shaohua
Ning, Xin
Yang, Guowei
Zhang, Liping
Long, Peng
Cai, Weiwei
Li, Weijun
DISPLAYS, 2021, 69
[2] 3D object recognition based on pairwise Multi-view Convolutional Neural Networks
Gao, Z.
Wang, D. Y.
Xue, Y. B.
Xu, G. P.
Zhang, H.
Wang, Y. L.
JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2018, 56 : 305 - 315
[3] Progressive conditional GAN-based augmentation for 3D object recognition
Muzahid, A. A. M.
Wanggen, Wan
Sohel, Ferdous
Bennamoun, Mohammed
Hou, Li
Ullah, Hidayat
NEUROCOMPUTING, 2021, 460 : 20 - 30
[4] The method based on view-directional consistency constraints for robust 3D object recognition
Shimamura, Jun
Yoshida, Taiga
Taniguchi, Yukinobu
Yabushita, Hiroko
Sudo, Kyoko
Murasaki, Kazuhiko
2015 14TH IAPR INTERNATIONAL CONFERENCE ON MACHINE VISION APPLICATIONS (MVA), 2015, : 455 - 458
[5] A NEW ROTATION-INVARIANT DEEP NETWORK FOR 3D OBJECT RECOGNITION
Zhang, Yachi
Lu, Zongqing
Xue, Jing-Hao
Liao, Qingmin
2019 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO (ICME), 2019, : 1606 - 1611
[6] Multi-view ensemble manifold regularization for 3D object recognition
Hong, Chaoqun
Yu, Jun
You, Jane
Chen, Xuhui
Tao, Dapeng
INFORMATION SCIENCES, 2015, 320 : 395 - 405
[7] Multi-view dual attention network for 3D object recognition
Wang, Wenju
Cai, Yu
Wang, Tao
NEURAL COMPUTING & APPLICATIONS, 2022, 34 (04) : 3201 - 3212
[8] Multi-view dual attention network for 3D object recognition
Wenju Wang
Yu Cai
Tao Wang
Neural Computing and Applications, 2022, 34 : 3201 - 3212
[9] Deep models for multi-view 3D object recognition: a review
Alzahrani, Mona
Usman, Muhammad
Jarraya, Salma Kammoun
Anwar, Saeed
Helmy, Tarek
ARTIFICIAL INTELLIGENCE REVIEW, 2024, 57 (12)
[10] 3D object retrieval based on multi-view convolutional neural networks
Xi-Xi Li
Qun Cao
Sha Wei
Multimedia Tools and Applications, 2017, 76 : 20111 - 20124

← 1 2 3 4 5 →