SABV-Depth: A biologically inspired deep learning network for monocular depth estimation

被引：12

作者：

Wang, Junfan ^{[1
,2
]}

Chen, Yi ^{[1
,2
]}

Dong, Zhekang ^{[1
,2
,3
]}

Gao, Mingyu ^{[1
,2
]}

Lin, Huipin ^{[1
,2
]}

Miao, Qiheng ^{[4
]}

机构：

[1] Hangzhou Dianzi Univ, Sch Elect Informat, Hangzhou 310018, Zhejiang, Peoples R China

[2] Zhejiang Prov Key Lab Equipment Elect, Hangzhou 310018, Zhejiang, Peoples R China

[3] Zhejiang Univ, Dept Elect Engn, Hangzhou 310027, Zhejiang, Peoples R China

[4] Zhejiang Huaruijie Technol Co Ltd, Hangzhou 310051, Zhejiang, Peoples R China

来源：

KNOWLEDGE-BASED SYSTEMS | 2023年 / 263卷

关键词：

Depth estimation; Biological vision; Mapping relationship; Self -attention mechanism; VISION; MODEL; CONSCIOUSNESS;

D O I：

10.1016/j.knosys.2023.110301

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Monocular depth estimation makes it possible for machines to perceive the real world. The prediction performance of the depth estimation network based on deep learning will be affected due to the depth of the deep network and the locality of convolution operations. The imitation of the biological visual system and its functional structure is becoming a research hotspot. In this paper, we study the interpretability relationship between the biological visual system and the monocular depth estimation network. By concretizing the attention mechanism in biological vision, we propose a monocular depth estimation network based on the self-attention mechanism, named SABV-Depth, which can improve prediction accuracy. Inspired by the biological visual interaction mechanism, we focus on the information transfer between each module of the network and improve the information retention ability, and enable the network to output a depth map with rich object information and detailed information. Further, a decoder module with an inner-connection is proposed to recover depth maps with sharp edge contours. Our method is experimentally validated on the KITTI dataset and NYU Depth V2 dataset. The results show that compared with other works, the proposed method improves prediction accuracy. Meanwhile, the depth map has more object information and detail information, and a better edge information processing effect. (c) 2023 The Author(s). Published by Elsevier B.V. This is an open access article under the CC BY-NC-ND

引用

页数：14

共 60 条

[1] Alhashim I, 2019, Arxiv, DOI arXiv:1812.11941
[2] Diversity of intrinsically photosensitive retinal ganglion cells: circuits and functions
Aranda, Marcos L.
Schmidt, Tiffany M.
[J]. CELLULAR AND MOLECULAR LIFE SCIENCES, 2021, 78 (03) : 889 - 907
[3] Global workspace theory of consciousness: toward a cognitive neuroscience of human experience
Baars, BJ
[J]. BOUNDARIES OF CONSCIOUSNESS: NEUROBIOLOGY AND NEUROPATHOLOGY, 2005, 150 : 45 - 53
[4] NEUROSCIENCE Neural population control via deep image synthesis
Bashivan, Pouya
Kar, Kohitij
DiCarlo, James J.
[J]. SCIENCE, 2019, 364 (6439) : 453 - +
[5] LGN-CNN: A biologically inspired CNN architecture
Bertoni, Federico
Citti, Giovanna
Sarti, Alessandro
[J]. NEURAL NETWORKS, 2022, 145 : 42 - 55
[6] AdaBins: Depth Estimation Using Adaptive Bins
Bhat, Shariq Farooq
Alhashim, Ibraheem
Wonka, Peter
[J]. 2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 4008 - 4017
[7] Auto-Rectify Network for Unsupervised Indoor Depth Estimation
Bian, Jia-Wang
Zhan, Huangying
Wang, Naiyan
Chin, Tat-Jun
Shen, Chunhua
Reid, Ian
[J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2022, 44 (12) : 9802 - 9813
[8] Unsupervised Scale-Consistent Depth Learning from Video
Bian, Jia-Wang
Zhan, Huangying
Wang, Naiyan
Li, Zhichao
Zhang, Le
Shen, Chunhua
Cheng, Ming-Ming
Reid, Ian
[J]. INTERNATIONAL JOURNAL OF COMPUTER VISION, 2021, 129 (09) : 2548 - 2564
[9] Chang Shu, 2020, Computer Vision - ECCV 2020. 16th European Conference. Proceedings. Lecture Notes in Computer Science (LNCS 12364), P572, DOI 10.1007/978-3-030-58529-7_34
[10] Single image depth estimation based on sculpture strategy
Chen, Shu
Fan, Xiang
Pu, Zhengdong
Ouyang, Jianquan
Zou, Beiji
[J]. KNOWLEDGE-BASED SYSTEMS, 2022, 250

← 1 2 3 4 5 6 →