3D Point Cloud Generation to Understand Real Object Structure via Graph Convolutional Networks

被引:0
作者
Ashfaq, Hamid [1 ]
Alazeb, Abdulwahab [2 ]
Almakdi, Sultan [2 ]
Alshehri, Mohammed S. [2 ]
Almujally, Nouf Abdullah [3 ]
Rlotaibi, Sard S. [4 ]
Algarni, Asaad [5 ]
Jalal, Ahmad [1 ,6 ]
机构
[1] Air Univ, Dept Comp Sci, E-9, Islamabad 44000, Pakistan
[2] Najran Univ, Coll Comp Sci & Informat Syst, Dept Comp Sci, Najran 55461, Saudi Arabia
[3] Princess Nourah Bint Abdulrahman Univ, Coll Comp & Informat Sci, Dept Informat Syst, Riyadh 11671, Saudi Arabia
[4] King Saud Univ, Informat Technol Dept, Riyadh 24382, Saudi Arabia
[5] Northern Border Univ, Fac Comp & Informat Technol, Dept Comp Sci, Rafha 91911, Saudi Arabia
[6] Korea Univ, Coll Informat, Dept Comp Sci & Engn, Seoul 02841, South Korea
关键词
point cloud; 3D model reconstruction; generative adversarial network; graph; convolution network; real object structure; RECONSTRUCTION; STEREO; ACCURATE;
D O I
10.18280/ts.410613
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Estimating and generating a three-dimensional (3D) model from a single image are challenging problems that have gained considerable attention from researchers in different fields of computer vision and artificial intelligence. Previously, there has been research work on single-angle and multi-view object use for 3D reconstruction. 3D data can be represented in many forms like meshes, voxels, and point clouds. This article presents 3D reconstruction using standard and state-of-the-art methods. Conventionally, to estimate the 3D many systems investigate multi-view images, stereo images, or object scanning with the support of additional sensors like Light Detection and Ranging (LiDAR) and depth sensors. The proposed semi-neural network system is the blend of neural network and also image processing filters and machine learning algorithms to extract features that have been used in the network. Three different types of features have been used in this paper that will help to estimate the 3D of the object from a single image. These features include semantic segmentation, depth of image, and surface normal. Semantic segmentation features have been extracted from the segmentation filter that has been exploited for extracting the object portion. Similarly, depth features have been used to estimate the object in the z-axis from NYUv2 dataset training using SENET-154 architecture. Finally surface normal features have been extracted based on estimated depth results using edge detection, and horizontal and vertical convolutional filters. Surface normal helps in determining the x, y and, z orientations of an object. The final representation of the object model has been in the form of a 3D point cloud. The resultant 3D point cloud has made it easy to analyze the model quality by points and distance representing intermodal and ground truth. In this article, three publicly available benchmark datasets have been used for system evaluation and experimental assessment including ShapeNetCore, ModelNet10 and ObjectNet3D datasets. The ShapeNetCore has archived an accuracy of 95.41% and chamfer distance of 0.00098, the ModelNet10 dataset has achieved an accuracy of 94.74% and chamfer distance of 0.00132 and finally, the ObjectNet3D dataset has achieved an accuracy of 95.53% and chamfer distance 0.00091. The results of many classes of the proposed system are outstanding at visualization as compared to standard methods.
引用
收藏
页码:2935 / 2946
页数:12
相关论文
共 50 条
[41]   Using 3D point cloud and graph-based neural networks to improve the estimation of pulmonary function tests from chest CT [J].
Jia, Jingnan ;
Yu, Bo ;
Mody, Prerak ;
Ninaber, Maarten K. ;
Schouffoer, Anne A. ;
de Vries-Bouwstra, Jeska K. ;
Kroft, Lucia J.M. ;
Staring, Marius ;
Stoel, Berend C. .
Computers in Biology and Medicine, 2024, 182
[42]   Point Cloud-Based 3D Object Classification With Non Local Attention and Lightweight Convolution Neural Networks [J].
Karthik, R. ;
Inamdar, Rohan ;
Sundarr, S. Kavin ;
Cho, Jaehyuk ;
Veerappampalayam Easwaramoorthy, Sathishkumar .
IEEE ACCESS, 2024, 12 :158530-158545
[43]   Joint embedding of structure and features via graph convolutional networks [J].
Lerique, Sebastien ;
Abitbol, Jacob Levy ;
Karsai, Marton .
APPLIED NETWORK SCIENCE, 2020, 5 (01)
[44]   Joint embedding of structure and features via graph convolutional networks [J].
Sébastien Lerique ;
Jacob Levy Abitbol ;
Márton Karsai .
Applied Network Science, 5
[45]   AUTOMATIC POLE-LIKE OBJECT MODELING VIA 3D PART-BASED ANALYSIS OF POINT CLOUD [J].
He, Liu ;
Yang, Haoxiang ;
Huang, Yuchun .
REMOTE SENSING TECHNOLOGIES AND APPLICATIONS IN URBAN ENVIRONMENTS, 2016, 10008
[46]   A Low-Power Graph Convolutional Network Processor With Sparse Grouping for 3D Point Cloud Semantic Segmentation in Mobile Devices [J].
Kim, Sangjin ;
Kim, Sangyeob ;
Lee, Juhyoung ;
Yoo, Hoi-Jun .
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS I-REGULAR PAPERS, 2022, 69 (04) :1507-1518
[47]   PTA-Det: Point Transformer Associating Point Cloud and Image for 3D Object Detection [J].
Wan, Rui ;
Zhao, Tianyun ;
Zhao, Wei .
SENSORS, 2023, 23 (06)
[48]   A robust 3D point cloud watermarking method based on the graph Fourier transform [J].
Felipe A. B. S. Ferreira ;
Juliano B. Lima .
Multimedia Tools and Applications, 2020, 79 :1921-1950
[49]   Enhancing the Local Graph Semantic Feature for 3D Point Cloud Classification and Segmentation [J].
Wang, Yong ;
Tang, Xintong ;
Yue, Chenke .
IEEE ACCESS, 2022, 10 :74620-74628
[50]   A robust 3D point cloud watermarking method based on the graph Fourier transform [J].
Ferreira, Felipe A. B. S. ;
Lima, Juliano B. .
MULTIMEDIA TOOLS AND APPLICATIONS, 2020, 79 (3-4) :1921-1950