Multi-scale, multi-dimensional binocular endoscopic image depth estimation network

被引:0
|
作者
Wang, Xiongzhi [1 ,2 ]
Nie, Yunfeng [3 ]
Ren, Wenqi [5 ]
Wei, Min [4 ]
Zhang, Jingang [1 ,2 ]
机构
[1] Univ Chinese Acad Sci, Sch Future Technol, Beijing 100039, Peoples R China
[2] Xidian Univ, Sch Aerosp Science&Technol, Xian 710071, Peoples R China
[3] Vrije Univ Brussel & Flanders Make, Dept Appl Phys & Photon, Brussel Photon, B-1050 Brussels, Belgium
[4] Chinese Acad Sci, State Key Lab Informat Secur, Inst Informat Engn, Beijing 100093, Peoples R China
[5] Chinese Peoples Liberat Army Gen Hosp, Med Ctr 4, Dept Orthoped, Beijing 100853, Peoples R China
基金
中国国家自然科学基金;
关键词
Depth estimation; Endoscopic datasets; Convolutional neural network; Stereoscopic vision; STEREO; COLONOSCOPY; LESIONS;
D O I
10.1016/j.compbiomed.2023.107305
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
During invasive surgery, the use of deep learning techniques to acquire depth information from lesion sites in real-time is hindered by the lack of endoscopic environmental datasets. This work aims to develop a high-accuracy three-dimensional (3D) simulation model for generating image datasets and acquiring depth information in real-time. Here, we proposed an end-to-end multi-scale supervisory depth estimation network (MMDENet) model for the depth estimation of pairs of binocular images. The proposed MMDENet highlights a multi-scale feature extraction module incorporating contextual information to enhance the correspondence precision of poorly exposed regions. A multi-dimensional information-guidance refinement module is also proposed to refine the initial coarse disparity map. Statistical experimentation demonstrated a 3.14% reduction in endpoint error compared to state-of-the-art methods. With a processing time of approximately 30fps, satisfying the requirements of real-time operation applications. In order to validate the performance of the trained MMDENet in actual endoscopic images, we conduct both qualitative and quantitative analysis with 93.38% high precision, which holds great promise for applications in surgical navigation.
引用
收藏
页数:10
相关论文
共 50 条
  • [1] Binocular Depth Estimation Algorithm Based on Multi-Scale Attention Feature Fusion
    Yang Huitong
    Lei Lang
    Lin Yongchun
    LASER & OPTOELECTRONICS PROGRESS, 2022, 59 (18)
  • [2] Multi-scale depth classification network for monocular depth estimation
    Yang, Yi
    Tian, Lihua
    Li, Chen
    Zhang, Botong
    COMPUTERS & ELECTRICAL ENGINEERING, 2022, 102
  • [3] Deep Multi-scale Convolutional Neural Network Method for Depth Estimation from a Single Image
    Ma, Zhaowei
    Niu, Yifeng
    Hu, Jia
    PROCEEDINGS OF THE 32ND 2020 CHINESE CONTROL AND DECISION CONFERENCE (CCDC 2020), 2020, : 3984 - 3988
  • [4] MANET: MULTI-SCALE AGGREGATED NETWORK FOR LIGHT FIELD DEPTH ESTIMATION
    Li, Yan
    Zhang, Lu
    Wang, Qiong
    Lafruit, Gauthier
    2020 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2020, : 1998 - 2002
  • [5] Monocular Depth Estimation With Multi-Scale Feature Fusion
    Xu, Xianfa
    Chen, Zhe
    Yin, Fuliang
    IEEE SIGNAL PROCESSING LETTERS, 2021, 28 : 678 - 682
  • [6] Monocular depth estimation with multi-scale feature fusion
    Wang Q.
    Zhang S.
    Huazhong Keji Daxue Xuebao (Ziran Kexue Ban)/Journal of Huazhong University of Science and Technology (Natural Science Edition), 2020, 48 (05): : 7 - 12
  • [7] Hand pose estimation with multi-scale network
    Zhongxu Hu
    Youmin Hu
    Bo Wu
    Jie Liu
    Dongmin Han
    Thomas Kurfess
    Applied Intelligence, 2018, 48 : 2501 - 2515
  • [8] Hand pose estimation with multi-scale network
    Hu, Zhongxu
    Hu, Youmin
    Wu, Bo
    Liu, Jie
    Han, Dongmin
    Kurfess, Thomas
    APPLIED INTELLIGENCE, 2018, 48 (08) : 2501 - 2515
  • [9] DEPTH ESTIMATION OF MULTI-MODAL SCENE BASED ON MULTI-SCALE MODULATION
    Wang, Anjie
    Fang, Zhijun
    Jiang, Xiaoyan
    Gao, Yongbin
    Cao, Gaofeng
    Ma, Siwei
    2023 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, ICIP, 2023, : 2795 - 2799
  • [10] Multi-Scale Mutual Feature Convolutional Neural Network for Depth Image Denoise and Enhancement
    Liao, Xuan
    Zhang, Xin
    2017 IEEE VISUAL COMMUNICATIONS AND IMAGE PROCESSING (VCIP), 2017,