Multi-scale, multi-dimensional binocular endoscopic image depth estimation network

被引:0
|
作者
Wang, Xiongzhi [1 ,2 ]
Nie, Yunfeng [3 ]
Ren, Wenqi [5 ]
Wei, Min [4 ]
Zhang, Jingang [1 ,2 ]
机构
[1] Univ Chinese Acad Sci, Sch Future Technol, Beijing 100039, Peoples R China
[2] Xidian Univ, Sch Aerosp Science&Technol, Xian 710071, Peoples R China
[3] Vrije Univ Brussel & Flanders Make, Dept Appl Phys & Photon, Brussel Photon, B-1050 Brussels, Belgium
[4] Chinese Acad Sci, State Key Lab Informat Secur, Inst Informat Engn, Beijing 100093, Peoples R China
[5] Chinese Peoples Liberat Army Gen Hosp, Med Ctr 4, Dept Orthoped, Beijing 100853, Peoples R China
基金
中国国家自然科学基金;
关键词
Depth estimation; Endoscopic datasets; Convolutional neural network; Stereoscopic vision; STEREO; COLONOSCOPY; LESIONS;
D O I
10.1016/j.compbiomed.2023.107305
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
During invasive surgery, the use of deep learning techniques to acquire depth information from lesion sites in real-time is hindered by the lack of endoscopic environmental datasets. This work aims to develop a high-accuracy three-dimensional (3D) simulation model for generating image datasets and acquiring depth information in real-time. Here, we proposed an end-to-end multi-scale supervisory depth estimation network (MMDENet) model for the depth estimation of pairs of binocular images. The proposed MMDENet highlights a multi-scale feature extraction module incorporating contextual information to enhance the correspondence precision of poorly exposed regions. A multi-dimensional information-guidance refinement module is also proposed to refine the initial coarse disparity map. Statistical experimentation demonstrated a 3.14% reduction in endpoint error compared to state-of-the-art methods. With a processing time of approximately 30fps, satisfying the requirements of real-time operation applications. In order to validate the performance of the trained MMDENet in actual endoscopic images, we conduct both qualitative and quantitative analysis with 93.38% high precision, which holds great promise for applications in surgical navigation.
引用
收藏
页数:10
相关论文
共 50 条
  • [41] Swin-Depth: Using Transformers and Multi-Scale Fusion for Monocular-Based Depth Estimation
    Cheng, Zeyu
    Zhang, Yi
    Tang, Chengkai
    IEEE SENSORS JOURNAL, 2021, 21 (23) : 26912 - 26920
  • [42] Multi-Scale Spatial Attention-Guided Monocular Depth Estimation With Semantic Enhancement
    Xu, Xianfa
    Chen, Zhe
    Yin, Fuliang
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2021, 30 : 8811 - 8822
  • [43] Image Segmentation-Based Multi-Focus Image Fusion Through Multi-Scale Convolutional Neural Network
    Du, Chaoben
    Gao, Shesheng
    IEEE ACCESS, 2017, 5 : 15750 - 15761
  • [44] Multi-scale hybrid three-dimensional-two-dimensional-attention boosted convolutional neural network for hyperspectral image classification
    Fu, Ximeng
    Wang, Gaoyu
    Wang, Chenyu
    Xu, Huanhuan
    Li, Huiying
    JOURNAL OF APPLIED REMOTE SENSING, 2023, 17 (02)
  • [45] LMSN:a lightweight multi-scale network for single image super-resolution
    Yiye Zou
    Xiaomin Yang
    Marcelo Keese Albertini
    Farhan Hussain
    Multimedia Systems, 2021, 27 : 845 - 856
  • [46] Image dehazing algorithm based on multi-scale concat convolutional neural network
    Qiao Dan
    Zhang Chuang
    Zhu Chen-yu
    CHINESE JOURNAL OF LIQUID CRYSTALS AND DISPLAYS, 2021, 36 (10) : 1420 - 1429
  • [47] Remote Sensing Image Fusion Using Multi-Scale Convolutional Neural Network
    Wei Shi
    ChaoBen Du
    BingBing Gao
    JiNing Yan
    Journal of the Indian Society of Remote Sensing, 2021, 49 : 1677 - 1687
  • [48] LMSN:a lightweight multi-scale network for single image super-resolution
    Zou, Yiye
    Yang, Xiaomin
    Albertini, Marcelo Keese
    Hussain, Farhan
    MULTIMEDIA SYSTEMS, 2021, 27 (04) : 845 - 856
  • [49] Image Interpolation Using Multi-Scale Attention-Aware Inception Network
    Ji, Jiahuan
    Zhong, Baojiang
    Ma, Kai-Kuang
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2020, 29 : 9413 - 9428
  • [50] Lightweight multi-scale distillation attention network for image super-resolution
    Tang, Yinggan
    Hu, Quanwei
    Bu, Chunning
    KNOWLEDGE-BASED SYSTEMS, 2025, 309