MSDE-Net: A Multi-Scale Dual-Encoding Network for Surgical Instrument Segmentation

被引:2
作者
Yang, Lei [1 ]
Gu, Yuge [1 ]
Bian, Guibin [1 ,2 ]
Liu, Yanhong [1 ]
机构
[1] Zhengzhou Univ, Sch Elect & Informat Engn, Zhengzhou 450001, Peoples R China
[2] Chinese Acad Sci, Inst Automat, State Key Lab Management & Control Complex Syst, Beijing 100190, Peoples R China
基金
中国国家自然科学基金;
关键词
Surgical instrument segmentation; transformer; dual-branch encoder; feature fusion; IMAGE SEGMENTATION; SURGERY; COLOR; ROBOT;
D O I
10.1109/JBHI.2023.3344716
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Minimally invasive surgery, which relies on surgical robots and microscopes, demands precise image segmentation to ensure safe and efficient procedures. Nevertheless, achieving accurate segmentation of surgical instruments remains challenging due to the complexity of the surgical environment. To tackle this issue, this paper introduces a novel multiscale dual-encoding segmentation network, termed MSDE-Net, designed to automatically and precisely segment surgical instruments. The proposed MSDE-Net leverages a dual-branch encoder comprising a convolutional neural network (CNN) branch and a transformer branch to effectively extract both local and global features. Moreover, an attention fusion block (AFB) is introduced to ensure effective information complementarity between the dual-branch encoding paths. Additionally, a multilayer context fusion block (MCF) is proposed to enhance the network's capacity to simultaneously extract global and local features. Finally, to extend the scope of global feature information under larger receptive fields, a multi-receptive field fusion (MRF) block is incorporated. Through comprehensive experimental evaluations on two publicly available datasets for surgical instrument segmentation, the proposed MSDE-Net demonstrates superior performance compared to existing methods.
引用
收藏
页码:4072 / 4083
页数:12
相关论文
共 62 条
  • [41] A survey of graph theoretical approaches to image segmentation
    Peng, Bo
    Zhang, Lei
    Zhang, David
    [J]. PATTERN RECOGNITION, 2013, 46 (03) : 1020 - 1038
  • [42] Ramesh K., 2021, EAI Endorsed Transactions on Pervasive Health and Technology, V7, P6, DOI [10.4108/eai.12-4-2021.169184, DOI 10.4108/EAI.12-4-2021.169184]
  • [43] U-Net: Convolutional Networks for Biomedical Image Segmentation
    Ronneberger, Olaf
    Fischer, Philipp
    Brox, Thomas
    [J]. MEDICAL IMAGE COMPUTING AND COMPUTER-ASSISTED INTERVENTION, PT III, 2015, 9351 : 234 - 241
  • [44] Iris Segmentation Using Geodesic Active Contours
    Shah, Samir
    Ross, Arun
    [J]. IEEE TRANSACTIONS ON INFORMATION FORENSICS AND SECURITY, 2009, 4 (04) : 824 - 836
  • [45] Li-SegPNet: Encoder-Decoder Mode Lightweight Segmentation Network for Colorectal Polyps Analysis
    Sharma, Pallabi
    Gautam, Anmol
    Maji, Pallab
    Pachori, Ram Bilas
    Balabantaray, Bunil Kumar
    [J]. IEEE TRANSACTIONS ON BIOMEDICAL ENGINEERING, 2023, 70 (04) : 1330 - 1339
  • [46] U-Net and Its Variants for Medical Image Segmentation: A Review of Theory and Applications
    Siddique, Nahian
    Paheding, Sidike
    Elkin, Colin P.
    Devabhaktuni, Vijay
    [J]. IEEE ACCESS, 2021, 9 : 82031 - 82057
  • [47] Multi-Scale Self-Guided Attention for Medical Image Segmentation
    Sinha, Ashish
    Dolz, Jose
    [J]. IEEE JOURNAL OF BIOMEDICAL AND HEALTH INFORMATICS, 2021, 25 (01) : 121 - 130
  • [48] Srivastava A., 2021, P IEEE 4 INT C BIOEN, P1
  • [49] MSRF-Net: A Multi-Scale Residual Fusion Network for Biomedical Image Segmentation
    Srivastava, Abhishek
    Jha, Debesh
    Chanda, Sukalpa
    Pal, Umapada
    Johansen, Havard
    Johansen, Dag
    Riegler, Michael
    Ali, Sharib
    Halvorsen, Pal
    [J]. IEEE JOURNAL OF BIOMEDICAL AND HEALTH INFORMATICS, 2022, 26 (05) : 2252 - 2263
  • [50] Touvron H, 2021, PR MACH LEARN RES, V139, P7358