S3AR U-Net: A separable squeezed similarity attention-gated residual U-Net for glottis segmentation

被引:1
|
作者
Montalbo, Francis Jesmar P. [1 ]
机构
[1] Batangas State Univ, Rizal Ave,Extens, Batangas 4200, Batangas, Philippines
关键词
Biomedical imaging; Convolutional neural networks; Laryngeal video endoscopy; Lightweight attention mechanisms; Otolaryngology; Throat disease; IMAGE SEGMENTATION; ARCHITECTURE; NETWORK; TUMOR; UNET;
D O I
10.1016/j.bspc.2024.106047
中图分类号
R318 [生物医学工程];
学科分类号
0831 ;
摘要
Deep learning in medical imaging became a focal point in research, emphasizing techniques to automatically detect ailments in magnetic resonance images (MRI) and X-rays. Beyond these imaging modalities, there is a recognized need to apply soft computing solutions to Laryngeal Video Endoscopy (LVE), specifically in the context of Videostroboscopy. While this technique offers visual oscillations of the vocal fold, manual examination of the glottis presents inherent challenges. In current literature, various proposals favor segmentation models such as U-Net, trained with labeled data, to address this diagnostic challenge. Fortunately, the Benchmark for Automatic Glottis Segmentation (BAGLS) dataset, comprising 55,750 meticulously labeled images, has become available. However, training a U-Net model with BAGLS requires substantial computational resource investment. Therefore, this study proposes an approach to reduce computational costs by designing a lightweight U-Net incorporating separable depthwise convolutions. In addition to minimizing computational demands, this study enhances the compact architecture by incorporating attention gates, skip connections, and squeeze-andexcitation blocks. These modifications tend to extract robust features while avoiding unnecessary parameter enlargement. Despite its compact nature, the proposed S3AR U-Net includes a feature similarity module that enriches its spectrum of features. Overall, the S3AR U-Net introduced in this study proves to be a cost-efficient solution, boasting only 62 K parameters and operating at 0.21 G FLOPs. Impressively, it achieves a DSC of 88.73 % and IoU of 79.97 %. These performance metrics position the S3AR U-Net favorably compared to most existing U-Net solutions, showcasing its efficacy in automatically segmenting a person's glottis during medical imaging analysis.
引用
收藏
页数:20
相关论文
共 50 条
  • [1] Segmentation of the Thoracic Aorta using an Attention-Gated U-Net
    Zhong, Jiayang
    Bian, Zhangxing
    Hatt, Charles R.
    Burris, Nicholas S.
    MEDICAL IMAGING 2021: COMPUTER-AIDED DIAGNOSIS, 2021, 11597
  • [2] Chaining a U-Net With a Residual U-Net for Retinal Blood Vessels Segmentation
    Alfonso Francia, Gendry
    Pedraza, Carlos
    Aceves, Marco
    Tovar-Arriaga, Saul
    IEEE ACCESS, 2020, 8 : 38493 - 38500
  • [3] MSR U-Net: An Improved U-Net Model for Retinal Blood Vessel Segmentation
    Kande, Giri Babu
    Ravi, Logesh
    Kande, Nitya
    Nalluri, Madhusudana Rao
    Kotb, Hossam
    Aboras, Kareem M.
    Yousef, Amr
    Ghadi, Yazeed Yasin
    Sasikumar, A.
    IEEE ACCESS, 2024, 12 : 534 - 551
  • [4] Multimodal attention-gated cascaded U-Net model for automatic brain tumor detection and segmentation
    Chinnam, Siva Koteswara Rao
    Sistla, Venkatramaphanikumar
    Kolli, Venkata Krishna Kishore
    BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2022, 78
  • [5] Recurrent residual U-Net for medical image segmentation
    Alom, Md Zahangir
    Yakopcic, Chris
    Hasan, Mahmudul
    Taha, Tarek M.
    Asari, Vijayan K.
    JOURNAL OF MEDICAL IMAGING, 2019, 6 (01)
  • [6] SDResU-Net: Separable and Dilated Residual U-Net for MRI Brain Tumor Segmentation
    Zhang, Jianxin
    Lv, Xiaogang
    Sun, Qiule
    Zhang, Qiang
    Wei, Xiaopeng
    Liu, Bin
    CURRENT MEDICAL IMAGING, 2020, 16 (06) : 720 - 728
  • [7] CHANNEL ATTENTION RESIDUAL U-NET FOR RETINAL VESSEL SEGMENTATION
    Guo, Changlu
    Szemenyei, Marton
    Hu, Yangtao
    Wang, Wenle
    Zhou, Wei
    Yi, Yugen
    2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 1185 - 1189
  • [8] AResU-Net: Attention Residual U-Net for Brain Tumor Segmentation
    Zhang, Jianxin
    Lv, Xiaogang
    Zhang, Hengbo
    Liu, Bin
    SYMMETRY-BASEL, 2020, 12 (05):
  • [9] A mixed attention-gated U-Net for continuous cuffless blood pressure estimation
    Zhong, Yiting
    Chen, Yongyi
    Zhang, Dan
    Xu, Yanghui
    Karimi, Hamid Reza
    SIGNAL IMAGE AND VIDEO PROCESSING, 2023, 17 (08) : 4143 - 4151
  • [10] A mixed attention-gated U-Net for continuous cuffless blood pressure estimation
    Yiting Zhong
    Yongyi Chen
    Dan Zhang
    Yanghui Xu
    Hamid Reza Karimi
    Signal, Image and Video Processing, 2023, 17 : 4143 - 4151