S3AR U-Net: A separable squeezed similarity attention-gated residual U-Net for glottis segmentation

被引:1
|
作者
Montalbo, Francis Jesmar P. [1 ]
机构
[1] Batangas State Univ, Rizal Ave,Extens, Batangas 4200, Batangas, Philippines
关键词
Biomedical imaging; Convolutional neural networks; Laryngeal video endoscopy; Lightweight attention mechanisms; Otolaryngology; Throat disease; IMAGE SEGMENTATION; ARCHITECTURE; NETWORK; TUMOR; UNET;
D O I
10.1016/j.bspc.2024.106047
中图分类号
R318 [生物医学工程];
学科分类号
0831 ;
摘要
Deep learning in medical imaging became a focal point in research, emphasizing techniques to automatically detect ailments in magnetic resonance images (MRI) and X-rays. Beyond these imaging modalities, there is a recognized need to apply soft computing solutions to Laryngeal Video Endoscopy (LVE), specifically in the context of Videostroboscopy. While this technique offers visual oscillations of the vocal fold, manual examination of the glottis presents inherent challenges. In current literature, various proposals favor segmentation models such as U-Net, trained with labeled data, to address this diagnostic challenge. Fortunately, the Benchmark for Automatic Glottis Segmentation (BAGLS) dataset, comprising 55,750 meticulously labeled images, has become available. However, training a U-Net model with BAGLS requires substantial computational resource investment. Therefore, this study proposes an approach to reduce computational costs by designing a lightweight U-Net incorporating separable depthwise convolutions. In addition to minimizing computational demands, this study enhances the compact architecture by incorporating attention gates, skip connections, and squeeze-andexcitation blocks. These modifications tend to extract robust features while avoiding unnecessary parameter enlargement. Despite its compact nature, the proposed S3AR U-Net includes a feature similarity module that enriches its spectrum of features. Overall, the S3AR U-Net introduced in this study proves to be a cost-efficient solution, boasting only 62 K parameters and operating at 0.21 G FLOPs. Impressively, it achieves a DSC of 88.73 % and IoU of 79.97 %. These performance metrics position the S3AR U-Net favorably compared to most existing U-Net solutions, showcasing its efficacy in automatically segmenting a person's glottis during medical imaging analysis.
引用
收藏
页数:20
相关论文
共 50 条
  • [21] Improved Segmentation by Adversarial U-Net
    Sriker, David
    Cohen, Dana
    Cahan, Noa
    Greenspan, Hayit
    MEDICAL IMAGING 2021: COMPUTER-AIDED DIAGNOSIS, 2021, 11597
  • [22] DRA U-Net: An Attention based U-Net Framework for 2D Medical Image Segmentation
    Zhang, Xian
    Feng, Ziyuan
    Zhong, Tianchi
    Shen, Sicheng
    Zhang, Ruolin
    Zhou, Lijie
    Zhang, Bo
    Wang, Wendong
    2021 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2021, : 3936 - 3942
  • [23] Attention-augmented U-Net (AA-U-Net) for semantic segmentation
    Kumar T. Rajamani
    Priya Rani
    Hanna Siebert
    Rajkumar ElagiriRamalingam
    Mattias P. Heinrich
    Signal, Image and Video Processing, 2023, 17 : 981 - 989
  • [24] Attention-augmented U-Net (AA-U-Net) for semantic segmentation
    Rajamani, Kumar T.
    Rani, Priya
    Siebert, Hanna
    ElagiriRamalingam, Rajkumar
    Heinrich, Mattias P.
    SIGNAL IMAGE AND VIDEO PROCESSING, 2023, 17 (04) : 981 - 989
  • [25] OAU-net: Outlined Attention U-net for biomedical image segmentation
    Song, Haojie
    Wang, Yuefei
    Zeng, Shijie
    Guo, Xiaoyan
    Li, Zheheng
    BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2023, 79
  • [26] Improved U-Net based on ResNet and SE-Net with dual attention mechanism for glottis semantic segmentation
    Ni, Jui-Chung
    Lee, Shih-Hsiung
    Shen, Yen-Cheng
    Yang, Chu-Sing
    MEDICAL ENGINEERING & PHYSICS, 2025, 136
  • [27] Dense and shuffle attention U-Net for automatic skin lesion segmentation
    Zhang, Guanzhong
    Wang, Shengsheng
    INTERNATIONAL JOURNAL OF IMAGING SYSTEMS AND TECHNOLOGY, 2022, 32 (06) : 2066 - 2079
  • [28] Cascaded hybrid residual U-Net for glioma segmentation
    Long, Jiaosong
    Ma, Guangzhi
    Liu, Hong
    Song, Enmin
    Hung, Chih-Cheng
    Xu, Xiangyang
    Jin, Renchao
    Zhuang, Yuzhou
    Liu, DaiYang
    Ma, Guangzhi
    Song, Enmin
    MULTIMEDIA TOOLS AND APPLICATIONS, 2020, 79 (33-34) : 24929 - 24947
  • [29] Cascaded hybrid residual U-Net for glioma segmentation
    Jiaosong Long
    Guangzhi Ma
    Hong Liu
    Enmin Song
    Chih-Cheng Hung
    Xiangyang Xu
    Renchao Jin
    Yuzhou Zhuang
    DaiYang Liu
    Guangzhi Ma
    Enmin Song
    Multimedia Tools and Applications, 2020, 79 : 24929 - 24947
  • [30] Attention guided U-Net for accurate iris segmentation
    Lian, Sheng
    Luo, Zhiming
    Zhong, Zhun
    Lin, Xiang
    Su, Songzhi
    Li, Shaozi
    JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2018, 56 : 296 - 304