Joint Soft-Hard Attention for Self-Supervised Monocular Depth Estimation

被引：5

作者：

Fan, Chao ^{[1
,2
,3
]}

Yin, Zhenyu ^{[2
,3
]}

Xu, Fulong ^{[1
,2
,3
]}

Chai, Anying ^{[1
,2
,3
]}

Zhang, Feiqing ^{[1
,2
,3
]}

机构：

[1] Univ Chinese Acad Sci, Beijing 100049, Peoples R China

[2] Chinese Acad Sci, Shenyang Inst Comp Technol, Shenyang 110168, Peoples R China

[3] Liaoning Key Lab Domest Ind Control Platform Tech, Shenyang 110168, Peoples R China

来源：

SENSORS | 2021年 / 21卷 / 21期

关键词：

monocular depth estimation; self-supervised learning; attention mechanism; VISION;

D O I：

10.3390/s21216956

中图分类号：

O65 [分析化学];

学科分类号：

070302 ; 081704 ;

摘要：

In recent years, self-supervised monocular depth estimation has gained popularity among researchers because it uses only a single camera at a much lower cost than the direct use of laser sensors to acquire depth. Although monocular self-supervised methods can obtain dense depths, the estimation accuracy needs to be further improved for better applications in scenarios such as autonomous driving and robot perception. In this paper, we innovatively combine soft attention and hard attention with two new ideas to improve self-supervised monocular depth estimation: (1) a soft attention module and (2) a hard attention strategy. We integrate the soft attention module in the model architecture to enhance feature extraction in both spatial and channel dimensions, adding only a small number of parameters. Unlike traditional fusion approaches, we use the hard attention strategy to enhance the fusion of generated multi-scale depth predictions. Further experiments demonstrate that our method can achieve the best self-supervised performance both on the standard KITTI benchmark and the Make3D dataset.

引用

页数：17

共 50 条

[1] Self-Supervised Monocular Depth Estimation Based on Channel Attention
Tao, Bo
Chen, Xinbo
Tong, Xiliang
Jiang, Du
Chen, Baojia
PHOTONICS, 2022, 9 (06)
[2] Self-supervised monocular depth estimation via joint attention and intelligent mask loss
Guo, Peng
Pan, Shuguo
Gao, Wang
Khoshelham, Kourosh
MACHINE VISION AND APPLICATIONS, 2025, 36 (01)
[3] Revisiting Self-supervised Monocular Depth Estimation
Kim, Ue-Hwan
Lee, Gyeong-Min
Kim, Jong-Hwan
ROBOT INTELLIGENCE TECHNOLOGY AND APPLICATIONS 6, 2022, 429 : 336 - 350
[4] Self-supervised monocular depth estimation in fog
Tao, Bo
Hu, Jiaxin
Jiang, Du
Li, Gongfa
Chen, Baojia
Qian, Xinbo
OPTICAL ENGINEERING, 2023, 62 (03)
[5] Dual-attention-based semantic-aware self-supervised monocular depth estimation
Xu, Jinze
Ye, Feng
Lai, Yizong
MULTIMEDIA TOOLS AND APPLICATIONS, 2024, 83 (24) : 65579 - 65601
[6] Self-Supervised Monocular Depth Estimation With Multiscale Perception
Zhang, Yourun
Gong, Maoguo
Li, Jianzhao
Zhang, Mingyang
Jiang, Fenlong
Zhao, Hongyu
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2022, 31 : 3251 - 3266
[7] Self-supervised monocular depth estimation for gastrointestinal endoscopy
Liu, Yuying
Zuo, Siyang
COMPUTER METHODS AND PROGRAMS IN BIOMEDICINE, 2023, 238
[8] Self-Supervised Monocular Depth Estimation With Extensive Pretraining
Choi, Hyukdoo
IEEE ACCESS, 2021, 9 : 157236 - 157246
[9] Self-supervised monocular depth estimation with large kernel attention and dynamic scene perception
Xiang, Xuezhi
Wang, Yao
Li, Xiaoheng
Zhang, Lei
Zhen, Xiantong
JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2025, 108
[10] Enhanced self-supervised monocular depth estimation with self-attention and joint depth-pose loss for laparoscopic images
Li, Wenda
Hayashi, Yuichiro
Oda, Masahiro
Kitasaka, Takayuki
Misawa, Kazunari
Mori, Kensaku
INTERNATIONAL JOURNAL OF COMPUTER ASSISTED RADIOLOGY AND SURGERY, 2025, : 775 - 785

← 1 2 3 4 5 →