View-relation constrained global representation learning for multi-view-based 3D object recognition

被引:4
作者
Xu, Ruchang [1 ]
Mi, Qing [1 ]
Ma, Wei [1 ]
Zha, Hongbin [2 ]
机构
[1] Beijing Univ Technol, Fac Informat Technol, Beijing 100020, Peoples R China
[2] Peking Univ, Sch Elect Engn & Comp Sci, Key Lab Machine Percept MOE, Beijing 100871, Peoples R China
基金
中国国家自然科学基金;
关键词
3D object recognition; Multi-views; View-relation constraints; 3D global representation;
D O I
10.1007/s10489-022-03949-8
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Multi-view observations provide complementary clues for 3D object recognition, but also include redundant information that appears different across views due to view-dependent projection, light reflection and self-occlusions. This paper presents a view-relation constrained global representation network (VCGR-Net) for 3D object recognition that can mitigate the view interference problem at all phases, from view-level source feature generation to multi-view feature aggregation. Specifically, we determine inter-view relations via LSTM implicitly. Based on the relations, we construct a two-stage feature selection module to filter features at each view according to their importance to the global representation and their reliability as observations at specific views. The selected features are then aggregated by referring to intra- and inter-view spatial context to generate global representation for 3D object recognition. Experiments on the ModelNet40 and ModelNet10 datasets demonstrate that the proposed method can suppress view interference and therefore outperform state-of-the-art methods in 3D object recognition.
引用
收藏
页码:7741 / 7750
页数:10
相关论文
共 36 条
  • [1] On visual similarity based 3D model retrieval
    Chen, DY
    Tian, XP
    Shen, YT
    Ming, OY
    [J]. COMPUTER GRAPHICS FORUM, 2003, 22 (03) : 223 - 232
  • [2] MVLidarNet: Real-Time Multi-Class Scene Understanding for Autonomous Driving Using Multiple Views
    Chen, Ke
    Oldja, Ryan
    Smolyanskiy, Nikolai
    Birchfield, Stan
    Popov, Alexander
    Wehr, David
    Eden, Ibrahim
    Pehserl, Joachim
    [J]. 2020 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2020, : 2288 - 2294
  • [3] Group-pair deep feature learning for multi-view 3d model retrieval
    Chen, Xiuxiu
    Liu, Li
    Zhang, Long
    Zhang, Huaxiang
    Meng, Lili
    Liu, Dongmei
    [J]. APPLIED INTELLIGENCE, 2022, 52 (02) : 2013 - 2022
  • [4] Shape Completion using 3D-Encoder-Predictor CNNs and Shape Synthesis
    Dai, Angela
    Qi, Charles Ruizhongtai
    Niessner, Matthias
    [J]. 30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, : 6545 - 6554
  • [5] GVCNN: Group-View Convolutional Neural Networks for 3D Shape Recognition
    Feng, Yifan
    Zhang, Zizhao
    Zhao, Xibin
    Ji, Rongrong
    Gao, Yue
    [J]. 2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 264 - 272
  • [6] Fujiwara K, 2020, PROC CVPR IEEE, P11731, DOI 10.1109/CVPR42600.2020.01175
  • [7] 3D2SeqViews: Aggregating Sequential Views for 3D Global Feature Learning by CNN With Hierarchical Attention Aggregation
    Han, Zhizhong
    Lu, Honglei
    Liu, Zhenbao
    Vong, Chi-Man
    Liu, Yu-Shen
    Zwicker, Matthias
    Han, Junwei
    Chen, C. L. Philip
    [J]. IEEE TRANSACTIONS ON IMAGE PROCESSING, 2019, 28 (08) : 3986 - 3999
  • [8] SeqViews2SeqLabels: Learning 3D Global Features via Aggregating Sequential Views by RNN With Attention
    Han, Zhizhong
    Shang, Mingyang
    Liu, Zhenbao
    Vong, Chi-Man
    Liu, Yu-Shen
    Zwicker, Matthias
    Han, Junwei
    Chen, C. L. Philip
    [J]. IEEE TRANSACTIONS ON IMAGE PROCESSING, 2019, 28 (02) : 658 - 672
  • [9] Learning the Global Descriptor for 3-D Object Recognition Based on Multiple Views Decomposition
    Huang, Jingjia
    Yan, Wei
    Li, Thomas
    Liu, Shan
    Li, Ge
    [J]. IEEE TRANSACTIONS ON MULTIMEDIA, 2022, 24 : 188 - 201
  • [10] Jiang JW, 2019, AAAI CONF ARTIF INTE, P8513