S3AR U-Net: A separable squeezed similarity attention-gated residual U-Net for glottis segmentation

被引:1
|
作者
Montalbo, Francis Jesmar P. [1 ]
机构
[1] Batangas State Univ, Rizal Ave,Extens, Batangas 4200, Batangas, Philippines
关键词
Biomedical imaging; Convolutional neural networks; Laryngeal video endoscopy; Lightweight attention mechanisms; Otolaryngology; Throat disease; IMAGE SEGMENTATION; ARCHITECTURE; NETWORK; TUMOR; UNET;
D O I
10.1016/j.bspc.2024.106047
中图分类号
R318 [生物医学工程];
学科分类号
0831 ;
摘要
Deep learning in medical imaging became a focal point in research, emphasizing techniques to automatically detect ailments in magnetic resonance images (MRI) and X-rays. Beyond these imaging modalities, there is a recognized need to apply soft computing solutions to Laryngeal Video Endoscopy (LVE), specifically in the context of Videostroboscopy. While this technique offers visual oscillations of the vocal fold, manual examination of the glottis presents inherent challenges. In current literature, various proposals favor segmentation models such as U-Net, trained with labeled data, to address this diagnostic challenge. Fortunately, the Benchmark for Automatic Glottis Segmentation (BAGLS) dataset, comprising 55,750 meticulously labeled images, has become available. However, training a U-Net model with BAGLS requires substantial computational resource investment. Therefore, this study proposes an approach to reduce computational costs by designing a lightweight U-Net incorporating separable depthwise convolutions. In addition to minimizing computational demands, this study enhances the compact architecture by incorporating attention gates, skip connections, and squeeze-andexcitation blocks. These modifications tend to extract robust features while avoiding unnecessary parameter enlargement. Despite its compact nature, the proposed S3AR U-Net includes a feature similarity module that enriches its spectrum of features. Overall, the S3AR U-Net introduced in this study proves to be a cost-efficient solution, boasting only 62 K parameters and operating at 0.21 G FLOPs. Impressively, it achieves a DSC of 88.73 % and IoU of 79.97 %. These performance metrics position the S3AR U-Net favorably compared to most existing U-Net solutions, showcasing its efficacy in automatically segmenting a person's glottis during medical imaging analysis.
引用
收藏
页数:20
相关论文
共 50 条
  • [31] Composite Attention Residual U-Net for Rib Fracture Detection
    Wang, Xiaoming
    Wang, Yongxiong
    ENTROPY, 2023, 25 (03)
  • [32] U-Net with Attention Mechanism for Retinal Vessel Segmentation
    Si, Ze
    Fu, Dongmei
    Li, Jiahao
    IMAGE AND GRAPHICS, ICIG 2019, PT II, 2019, 11902 : 668 - 677
  • [33] Deep Learning Model Development with U-net Architecture for Glottis Segmentation
    Derdiman, Yasar Said
    Koc, Turgay
    29TH IEEE CONFERENCE ON SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS (SIU 2021), 2021,
  • [34] Attention Convolutional U-Net for Automatic Liver Tumor Segmentation
    Bibi, Asima
    Khan, Muhammad Salman
    2021 INTERNATIONAL CONFERENCE ON FRONTIERS OF INFORMATION TECHNOLOGY (FIT 2021), 2021, : 102 - 107
  • [35] CLA-U-Net: Convolutional Long-short-term-memory Attention-gated U-Net for Automatic Segmentation of the Left Ventricle in 2-D Echocardiograms
    Lin, Zihan
    Tsui, Po-Hsiang
    Zeng, Yan
    Bin, Guangyu
    Wu, Shuicai
    Zhou, Zhuhuang
    2022 IEEE INTERNATIONAL ULTRASONICS SYMPOSIUM (IEEE IUS), 2022,
  • [36] DARU-Net: A dual attention residual U-Net for uterine fibroids segmentation on MRI
    Zhang, Jian
    Liu, Yang
    Chen, Liping
    Ma, Si
    Zhong, Yuqing
    He, Zhimin
    Li, Chengwei
    Xiao, Zhibo
    Zheng, Yineng
    Lv, Fajin
    JOURNAL OF APPLIED CLINICAL MEDICAL PHYSICS, 2023, 24 (06):
  • [37] Sharp dense U-Net: an enhanced dense U-Net architecture for nucleus segmentation
    Senapati, Pradip
    Basu, Anusua
    Deb, Mainak
    Dhal, Krishna Gopal
    INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS, 2024, 15 (06) : 2079 - 2094
  • [38] Brain Tumor Segmentation with Attention-based U-Net
    Li, Tuofu
    Liu, Javin Jia
    Tai, Yintao
    Tian, Yuxuan
    SECOND IYSF ACADEMIC SYMPOSIUM ON ARTIFICIAL INTELLIGENCE AND COMPUTER ENGINEERING, 2021, 12079
  • [39] An attention based residual U-Net with swin transformer for brain MRI segmentation
    Angona, Tazkia Mim
    Mondal, M. Rubaiyat Hossain
    ARRAY, 2025, 25
  • [40] Semantic Segmentation of Aerial Imagery Using U-Net with Self-Attention and Separable Convolutions
    Khan, Bakht Alam
    Jung, Jin-Woo
    APPLIED SCIENCES-BASEL, 2024, 14 (09):