An accurate volume estimation on single view object images by deep learning based depth map analysis and 3D reconstruction

被引：0

作者：

Radhamadhab Dalai

Nibedita Dalai

Kishore Kumar Senapati

机构：

[1] BIT Mesra,Computer Science & Engineering

[2] PMEC College,Civil Engineering

来源：

Multimedia Tools and Applications | 2023年 / 82卷

关键词：

Volume estimation; Pre-processing; Feature extraction; Depth map; 3DU- GNet; Single view; And deep learning;

D O I：

暂无

中图分类号：

学科分类号：

摘要：

The volume estimation of a rigid object from a single view object image is the important need in numerous automated vision based systems. The volume estimation on multiple view images are simple to estimate. But volume estimation on a single view object image is a difficult process and has significant importance in volume estimation. This work presents effective object volume estimation in both regular and irregular single view object images. Initially, the single view input images are pre-processed with Mean-median filtering. Afterwards, edge features are extracted by utilizing the Gaussian edge based laplacian operator and key points are extracted using the Scale invariant feature transform (SIFT) feature. The extracted features are considered for the shape analysis of the objects. Subsequently, VGG-ResNet framework is utilized for depth analysis based on the extracted features. The point clouds generation for the volume estimation is attained through the extracted features. Finally, the volume estimation on single view object is effectively attained through the hybrid 3 dimensional U-Net and graph neural network (Hybrid 3DU-GNet). This framework provides the 3D geometric creation for the accurate volume estimation. This provides the significant improvement on volume estimation. The presented methodology effectively estimates the volume on both regular and irregular single view object images. The presented approach is implemented in the working platform of MATLAB. The experimental results of the presented work is analysed with the different existing approaches and proved the significant improvement in performance metrics. The performance metrics are Accuracy (98.59%), precision (98.21%), recall (97.09%), computational time (3.2 seconds), R-squared (98.2%), (Mean absolute percentage error) MAPE (6.1%), and (Root mean squared error) RMSE (0.93).

引用

页码：28235 / 28258

页数：23

共 84 条

[1]

Chen P-H(2020)MVSNet++: learning depth-based attention pyramid features for multi-view stereo IEEE Trans Image Process 29 7261-7273

[2]

Yang H-C(2016)Two-view 3D reconstruction for food volume estimation IEEE Trans Multimed 19 1090-1099

[3]

Chen K-W(2016)Sufficient canonical correlation analysis IEEE Trans Image Process 6 610-2619

[4]

Chen Y-S(2021)SOSD-net: joint semantic object segmentation and depth estimation from monocular images Neurocomputing 440 251-263

[5]

Dehais J(2019)Volumetric estimation using 3D reconstruction method for grading of fruits Multimed Tools Appl 78 1613-1634

[6]

Anthimopoulos M(2020)Deep learning-based monocular depth estimation methods—a state-of-the-art review Sensors 20 2272-12

[7]

Shevchik S(2021)Flood depth mapping in street photos with image processing and deep neural networks Comput Environ Urban Syst 88 1-251

[8]

Mougiakakou S(2019)Maturity detection and volume estimation of apricot using image processing technique ScientiaHorticulturae 251 247-278

[9]

Guo Y(2021)Adaptive depth estimation for pyramid multi-view stereo Comput Graph 97 268-1819

[10]

Ding X(2019)Classification of tree species and stock volume estimation in ground forest images using deep learning Comput Electron Agric 166 105012-349

← 1 2 3 4 5 6 7 8 9 →