S3AR U-Net: A separable squeezed similarity attention-gated residual U-Net for glottis segmentation

被引:1
|
作者
Montalbo, Francis Jesmar P. [1 ]
机构
[1] Batangas State Univ, Rizal Ave,Extens, Batangas 4200, Batangas, Philippines
关键词
Biomedical imaging; Convolutional neural networks; Laryngeal video endoscopy; Lightweight attention mechanisms; Otolaryngology; Throat disease; IMAGE SEGMENTATION; ARCHITECTURE; NETWORK; TUMOR; UNET;
D O I
10.1016/j.bspc.2024.106047
中图分类号
R318 [生物医学工程];
学科分类号
0831 ;
摘要
Deep learning in medical imaging became a focal point in research, emphasizing techniques to automatically detect ailments in magnetic resonance images (MRI) and X-rays. Beyond these imaging modalities, there is a recognized need to apply soft computing solutions to Laryngeal Video Endoscopy (LVE), specifically in the context of Videostroboscopy. While this technique offers visual oscillations of the vocal fold, manual examination of the glottis presents inherent challenges. In current literature, various proposals favor segmentation models such as U-Net, trained with labeled data, to address this diagnostic challenge. Fortunately, the Benchmark for Automatic Glottis Segmentation (BAGLS) dataset, comprising 55,750 meticulously labeled images, has become available. However, training a U-Net model with BAGLS requires substantial computational resource investment. Therefore, this study proposes an approach to reduce computational costs by designing a lightweight U-Net incorporating separable depthwise convolutions. In addition to minimizing computational demands, this study enhances the compact architecture by incorporating attention gates, skip connections, and squeeze-andexcitation blocks. These modifications tend to extract robust features while avoiding unnecessary parameter enlargement. Despite its compact nature, the proposed S3AR U-Net includes a feature similarity module that enriches its spectrum of features. Overall, the S3AR U-Net introduced in this study proves to be a cost-efficient solution, boasting only 62 K parameters and operating at 0.21 G FLOPs. Impressively, it achieves a DSC of 88.73 % and IoU of 79.97 %. These performance metrics position the S3AR U-Net favorably compared to most existing U-Net solutions, showcasing its efficacy in automatically segmenting a person's glottis during medical imaging analysis.
引用
收藏
页数:20
相关论文
共 50 条
  • [41] MADRU-Net: Multiscale Attention-Based Cardiac MRI Segmentation Using Deep Residual U-Net
    Singh, Kamal Raj
    Sharma, Ambalika
    Singh, Girish Kumar
    IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT, 2024, 73 (1-13) : 1 - 13
  • [42] CS U-NET: A Medical Image Segmentation Method Integrating Spatial and Contextual Attention Mechanisms Based on U-NET
    Zhang, Fanyang
    Fan, Zhang
    INTERNATIONAL JOURNAL OF IMAGING SYSTEMS AND TECHNOLOGY, 2025, 35 (02)
  • [43] Dilated multi-scale residual attention (DMRA) U-Net: three-dimensional (3D) dilated multi-scale residual attention U-Net for brain tumor segmentation
    Zhang, Lihong
    Li, Yuzhuo
    Liang, Yingbo
    Xu, Chongxin
    Liu, Tong
    Sun, Junding
    QUANTITATIVE IMAGING IN MEDICINE AND SURGERY, 2024, 14 (10) : 7249 - 7264
  • [44] A Few-Shot Attention Recurrent Residual U-Net for Crack Segmentation
    Katsamenis, Iason
    Protopapadakis, Eftychios
    Bakalos, Nikolaos
    Varvarigos, Andreas
    Doulamis, Anastasios
    Doulamis, Nikolaos
    Voulodimos, Athanasios
    ADVANCES IN VISUAL COMPUTING, ISVC 2023, PT I, 2023, 14361 : 199 - 209
  • [45] Segmentation for Athlete's Ankle Injury Image Using Residual Double Attention U-Net Model
    Zhang, Jing
    Zhou, Jian
    Huang, Ming
    Raj, Raja Soosaimarian Peter
    BRAZILIAN ARCHIVES OF BIOLOGY AND TECHNOLOGY, 2023, 66
  • [46] DAU-Net: a novel U-Net with dual attention for retinal vessel segmentation
    Jian, Muwei
    Xu, Wenjing
    Nie, Changqun
    Li, Shuo
    Yang, Songwen
    Li, Xiaoguang
    BIOMEDICAL PHYSICS & ENGINEERING EXPRESS, 2025, 11 (02):
  • [47] A Global and Local Enhanced Residual U-Net for Accurate Retinal Vessel Segmentation
    Lian, Sheng
    Li, Lei
    Lian, Guiren
    Xiao, Xiao
    Luo, Zhiming
    Li, Shaozi
    IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2021, 18 (03) : 852 - 862
  • [48] A Dense-Gated U-Net for Brain Lesion Segmentation
    Ji, Zhongyi
    Han, Xiao
    Lin, Tong
    Wang, Wenmin
    2020 IEEE INTERNATIONAL CONFERENCE ON VISUAL COMMUNICATIONS AND IMAGE PROCESSING (VCIP), 2020, : 104 - 107
  • [49] Multi-Convolutional Channel Residual Spatial Attention U-Net for Industrial and Medical Image Segmentation
    Chen, Haoyu
    Kim, Kyungbaek
    IEEE ACCESS, 2024, 12 : 76089 - 76101
  • [50] RAU-Net: U-Net Model Based on Residual and Attention for Kidney and Kidney Tumor Segmentation
    Guo, Jingna
    Zeng, Wei
    Yu, Sen
    Xiao, Junqiu
    2021 IEEE INTERNATIONAL CONFERENCE ON CONSUMER ELECTRONICS AND COMPUTER ENGINEERING (ICCECE), 2021, : 353 - 356