Lightweight monocular absolute depth estimation based on attention mechanism

被引:1
作者
Jin, Jiayu [1 ,2 ]
Tao, Bo [1 ]
Qian, Xinbo [2 ,3 ]
Hu, Jiaxin [3 ]
Li, Gongfa [4 ]
机构
[1] Wuhan Univ Sci & Technol, Key Lab Met Equipment & Control Technol, Minist Educ, Wuhan, Peoples R China
[2] Wuhan Univ Sci & Technol, Hubei Key Lab Mech Transmiss & Mfg Engn, Wuhan, Peoples R China
[3] Wuhan Univ Sci & Technol, Precis Mfg Inst, Wuhan, Peoples R China
[4] Wuhan Univ Sci & Technol, Res Ctr Biomimet Robot & Intelligent Measurement &, Wuhan, Peoples R China
关键词
lightweight network; deep learning; monocular depth estimation; channel attention; self-supervised;
D O I
10.1117/1.JEI.33.2.023010
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
To solve the problem of obtaining a higher accuracy at the expense of redundant models, we propose a network architecture. We utilize a lightweight network that retains the high-precision advantage of the transformer and effectively combines it with convolutional neural network. By greatly reducing the training parameters, this approach achieves high precision, making it well suited for deployment on edge devices. A detail highlight module (DHM) is added to effectively fuse information from multiple scales, making the depth of prediction more accurate and clearer. A dense geometric constraints module is introduced to recover accurate scale factors in autonomous driving without additional sensors. Experimental results demonstrate that our model improves the accuracy from 98.1% to 98.3% compared with Monodepth2, and the model parameters are reduced by about 80%.
引用
收藏
页数:13
相关论文
共 50 条
[31]   Monocular Depth Estimation Based on Deep Learning:A Survey [J].
Ruan Xiaogang ;
Yan Wenjing ;
Huang Jing ;
Guo Peiyuan ;
Guo Wei .
2020 CHINESE AUTOMATION CONGRESS (CAC 2020), 2020, :2436-2440
[32]   Self-Supervised Monocular Depth Estimation for Traffic Scenes Based on Dual Attention Mechanism and Adaptive Cost Volume [J].
Wu G. ;
Liu W. ;
Hu J. ;
Cheng S. ;
Yang W.-X. ;
Sun L.-K. .
Tien Tzu Hsueh Pao/Acta Electronica Sinica, 2024, 52 (05) :1670-1678
[33]   Monocular depth estimation based on deep learning: An overview [J].
ChaoQiang Zhao ;
QiYu Sun ;
ChongZhen Zhang ;
Yang Tang ;
Feng Qian .
Science China Technological Sciences, 2020, 63 :1612-1627
[34]   Monocular depth estimation based on deep learning: An overview [J].
Zhao, ChaoQiang ;
Sun, QiYu ;
Zhang, ChongZhen ;
Tang, Yang ;
Qian, Feng .
SCIENCE CHINA-TECHNOLOGICAL SCIENCES, 2020, 63 (09) :1612-1627
[35]   RENA-Depth: toward recursion representation enhancement in neighborhood attention guided lightweight self-supervised monocular depth estimation [J].
Yang, Chaochao ;
Lu, Yuanyao ;
Qiu, Yongsheng ;
Wang, Yuantao .
OPTICAL ENGINEERING, 2024, 63 (08)
[36]   Lightweight monocular depth estimation using a fusion-improved transformer [J].
Sui, Xin ;
Gao, Song ;
Xu, Aigong ;
Zhang, Cong ;
Wang, Changqiang ;
Shi, Zhengxu .
SCIENTIFIC REPORTS, 2024, 14 (01)
[37]   LightDepthNet: Lightweight CNN Architecture for Monocular Depth Estimation on Edge Devices [J].
Liu, Qingliang ;
Zhou, Shuai .
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS II-EXPRESS BRIEFS, 2024, 71 (04) :2389-2393
[38]   Monocular Image Depth Estimation Based on Multi-Scale Attention Oriented Network [J].
Liu J. ;
Wen J. ;
Liang Y. .
Huanan Ligong Daxue Xuebao/Journal of South China University of Technology (Natural Science), 2020, 48 (12) :52-62
[39]   Transformer-based monocular depth estimation with hybrid attention fusion and progressive regression [J].
Liu, Peng ;
Zhang, Zonghua ;
Meng, Zhaozong ;
Gao, Nan .
NEUROCOMPUTING, 2025, 620
[40]   URNet: An UNet-Based Model with Residual Mechanism for Monocular Depth Estimation [J].
Duong, Hoang-Thanh ;
Chen, Hsi-Min ;
Chang, Che-Cheng .
ELECTRONICS, 2023, 12 (06)