Real-time monocular depth estimation with adaptive receptive fields

被引:0
|
作者
Zhenyan Ji
Xiaojun Song
Xiaoxuan Guo
Fangshi Wang
José Enrique Armendáriz-Iñigo
机构
[1] Beijing Jiaotong University,School of Software Engineering
[2] Public University of Navarre,Department of Statistics, Computer Science and Mathematics
来源
关键词
Monocular depth estimation; Adaptive receptive field; Real-time performance; Convolutional neural network;
D O I
暂无
中图分类号
学科分类号
摘要
Monocular depth estimation is a popular research topic in the field of autonomous driving. Nowadays many models are leading in accuracy but performing poorly in a real-time scenario. To effectively increase the depth estimation efficiency, we propose a novel model combining a multi-scale pyramid architecture for depth estimation together with adaptive receptive fields. The pyramid architecture reduces the trainable parameters from dozens of mega to less than 10 mega. Adaptive receptive fields are more sensitive to objects at different depth/distances in images, leading to better accuracy. We have adopted stacked convolution kernels instead of raw kernels to compress the model. Thus, the model that we proposed performs well in both real-time performance and estimation accuracy. We provide a set of experiments where our model performs better in terms of Eigen split than other previously known models. Furthermore, we show that our model is also better in runtime performance in regard to the depth estimation to the rest of models but the Pyd-Net model. Finally, our model is a lightweight depth estimation model with state-of-the-art accuracy.
引用
收藏
页码:1369 / 1381
页数:12
相关论文
共 50 条
  • [41] Robust Scale Estimation in Real-Time Monocular SFM for Autonomous Driving
    Song, Shiyu
    Chandraker, Manmohan
    2014 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2014, : 1566 - 1573
  • [42] \ Real time monocular depth from defocus
    Leroy, Jean-Vincent
    Simon, Thierry
    Deschenes, Francois
    IMAGE AND SIGNAL PROCESSING, 2008, 5099 : 103 - +
  • [43] BinsFormer: Revisiting Adaptive Bins for Monocular Depth Estimation
    Li, Zhenyu
    Wang, Xuyang
    Liu, Xianming
    Jiang, Junjun
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2024, 33 : 3964 - 3976
  • [44] An Adaptive Unsupervised Learning Framework for Monocular Depth Estimation
    Yang, Delong
    Zhong, Xunyu
    Lin, Lixiong
    Peng, Xiafu
    IEEE ACCESS, 2019, 7 : 148142 - 148151
  • [45] Real-time camera pose estimation for sports fields
    Leonardo Citraro
    Pablo Márquez-Neila
    Stefano Savarè
    Vivek Jayaram
    Charles Dubout
    Félix Renaut
    Andrés Hasfura
    Horesh Ben Shitrit
    Pascal Fua
    Machine Vision and Applications, 2020, 31
  • [46] Real-time camera pose estimation for sports fields
    Citraro, Leonardo
    Marquez-Neila, Pablo
    Savare, Stefano
    Jayaram, Vivek
    Dubout, Charles
    Renaut, Felix
    Hasfura, Andres
    Ben Shitrit, Horesh
    Fua, Pascal
    MACHINE VISION AND APPLICATIONS, 2020, 31 (03)
  • [47] REAL-TIME ADAPTIVE FILTERS FOR TIME-DELAY ESTIMATION
    DOKIC, MV
    CLARKSON, PM
    MECHANICAL SYSTEMS AND SIGNAL PROCESSING, 1992, 6 (05) : 403 - 418
  • [48] Depth-Guided Aggregation for Real-Time Binocular Depth Estimation Network
    Fu, Dongxin
    Zheng, Shaowu
    Xie, Pengcheng
    Li, Weihua
    IEEE MULTIMEDIA, 2024, 31 (02) : 36 - 47
  • [49] Real-time Decentralized Monocular SLAM
    Bresson, Guillaume
    Aufrere, Romuald
    Chapuis, Roland
    2012 12TH INTERNATIONAL CONFERENCE ON CONTROL, AUTOMATION, ROBOTICS & VISION (ICARCV), 2012, : 1018 - 1023
  • [50] Real-time monocular object SLAM
    Galvez-Lopez, Dorian
    Salas, Marta
    Tardos, Juan D.
    Montiel, J. M. M.
    ROBOTICS AND AUTONOMOUS SYSTEMS, 2016, 75 : 435 - 449