Deep models for multi-view 3D object recognition: a review

被引：9

作者：

Alzahrani, Mona ^{[1
,2
]}

Usman, Muhammad ^{[1
,3
,5
]}

Jarraya, Salma Kammoun ^{[4
]}

Anwar, Saeed ^{[1
,3
]}

Helmy, Tarek ^{[1
,5
]}

机构：

[1] KFUPM, Dept Informat & Comp Sci, Dhahran, Saudi Arabia

[2] Jouf Univ, Coll Comp & Informat Sci, Sakaka, Saudi Arabia

[3] KFUPM, SDAIA KFUPM Joint Res Ctr Artificial Intelligence, Dhahran, Saudi Arabia

[4] KAU, Fac Comp & Informat Technol, Comp Sci Dept, Jeddah 21589, Saudi Arabia

[5] KFUPM, Ctr Intelligent Secure Syst, Dhahran, Saudi Arabia

来源：

ARTIFICIAL INTELLIGENCE REVIEW | 2024年 / 57卷 / 12期

关键词：

3D object recognition; Multi-view object recognition; Multi-view conventional neural network; 3D object classification; 3D object retrieval; CONVOLUTIONAL NEURAL-NETWORK; CLASSIFICATION; CONTACTLESS; IMAGES;

D O I：

10.1007/s10462-024-10941-w

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

This review paper focuses on the progress of deep learning-based methods for multi-view 3D object recognition. It covers the state-of-the-art techniques in this field, specifically those that utilize 3D multi-view data as input representation. The paper provides a comprehensive analysis of the pipeline for deep learning-based multi-view 3D object recognition, including the various techniques employed at each stage. It also presents the latest developments in CNN-based and transformer-based models for multi-view 3D object recognition. The review discusses existing models in detail, including the datasets, camera configurations, view selection strategies, pre-trained CNN architectures, fusion strategies, and recognition performance. Additionally, it examines various computer vision applications that use multi-view classification. Finally, it highlights future directions, factors impacting recognition performance, and trends for the development of multi-view 3D object recognition method.

引用

页数：71

共 124 条

[11] Flora Capture: a citizen science application for collecting structured plant observations [J].

Boho, David ;

Rzanny, Michael ;

Waeldchen, Jana ;

Nitsche, Fabian ;

Deggelmann, Alice ;

Wittich, Hans Christian ;

Seeland, Marco ;

Maeder, Patrick .

BMC BIOINFORMATICS, 2020, 21 (01)

[12]

Brock A, 2016, Arxiv, DOI arXiv:1608.04236

[13] Feature-based similarity search in 3D object databases [J].

Bustos, B ;

Keim, DA ;

Saupe, D ;

Schreck, T ;

Vranic, DV .

ACM COMPUTING SURVEYS, 2005, 37 (04) :345-387

[14] Review of Pavement Defect Detection Methods [J].

Cao, Wenming ;

Liu, Qifan ;

He, Zhiquan .

IEEE ACCESS, 2020, 8 :14531-14544

[15]

Chatfield K, 2014, Arxiv, DOI arXiv:1405.3531

[16] Visibility-Aware Point-Based Multi-View Stereo Network [J].

Chen, Rui ;

Han, Songfang ;

Xu, Jing ;

Su, Hao .

IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2021, 43 (10) :3695-3708

[17]

Chen S, 2021, Arxiv, DOI arXiv:2110.13083

[18] VERAM: View-Enhanced Recurrent Attention Model for 3D Shape Classification [J].

Chen, Songle ;

Zheng, Lintao ;

Zhang, Yan ;

Sun, Zhixin ;

Xu, Kai .

IEEE TRANSACTIONS ON VISUALIZATION AND COMPUTER GRAPHICS, 2019, 25 (12) :3244-3257

[19] Shape Completion using 3D-Encoder-Predictor CNNs and Shape Synthesis [J].

Dai, Angela ;

Qi, Charles Ruizhongtai ;

Niessner, Matthias .

30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, :6545-6554

[20]

Dosovitskiy A, 2021, Arxiv, DOI arXiv:2010.11929

← 1 2 3 4 5 6 7 8 9 10 →