S3AR U-Net: A separable squeezed similarity attention-gated residual U-Net for glottis segmentation

被引：1

作者：

Montalbo, Francis Jesmar P. ^{[1
]}

机构：

[1] Batangas State Univ, Rizal Ave,Extens, Batangas 4200, Batangas, Philippines

来源：

BIOMEDICAL SIGNAL PROCESSING AND CONTROL | 2024年 / 92卷

关键词：

Biomedical imaging; Convolutional neural networks; Laryngeal video endoscopy; Lightweight attention mechanisms; Otolaryngology; Throat disease; IMAGE SEGMENTATION; ARCHITECTURE; NETWORK; TUMOR; UNET;

D O I：

10.1016/j.bspc.2024.106047

中图分类号：

R318 [生物医学工程];

学科分类号：

0831 ;

摘要：

Deep learning in medical imaging became a focal point in research, emphasizing techniques to automatically detect ailments in magnetic resonance images (MRI) and X-rays. Beyond these imaging modalities, there is a recognized need to apply soft computing solutions to Laryngeal Video Endoscopy (LVE), specifically in the context of Videostroboscopy. While this technique offers visual oscillations of the vocal fold, manual examination of the glottis presents inherent challenges. In current literature, various proposals favor segmentation models such as U-Net, trained with labeled data, to address this diagnostic challenge. Fortunately, the Benchmark for Automatic Glottis Segmentation (BAGLS) dataset, comprising 55,750 meticulously labeled images, has become available. However, training a U-Net model with BAGLS requires substantial computational resource investment. Therefore, this study proposes an approach to reduce computational costs by designing a lightweight U-Net incorporating separable depthwise convolutions. In addition to minimizing computational demands, this study enhances the compact architecture by incorporating attention gates, skip connections, and squeeze-andexcitation blocks. These modifications tend to extract robust features while avoiding unnecessary parameter enlargement. Despite its compact nature, the proposed S3AR U-Net includes a feature similarity module that enriches its spectrum of features. Overall, the S3AR U-Net introduced in this study proves to be a cost-efficient solution, boasting only 62 K parameters and operating at 0.21 G FLOPs. Impressively, it achieves a DSC of 88.73 % and IoU of 79.97 %. These performance metrics position the S3AR U-Net favorably compared to most existing U-Net solutions, showcasing its efficacy in automatically segmenting a person's glottis during medical imaging analysis.

引用

页数：20

共 50 条

[41] MADRU-Net: Multiscale Attention-Based Cardiac MRI Segmentation Using Deep Residual U-Net
Singh, Kamal Raj
Sharma, Ambalika
Singh, Girish Kumar
IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT, 2024, 73 (1-13) : 1 - 13
[42] CS U-NET: A Medical Image Segmentation Method Integrating Spatial and Contextual Attention Mechanisms Based on U-NET
Zhang, Fanyang
Fan, Zhang
INTERNATIONAL JOURNAL OF IMAGING SYSTEMS AND TECHNOLOGY, 2025, 35 (02)
[43] Dilated multi-scale residual attention (DMRA) U-Net: three-dimensional (3D) dilated multi-scale residual attention U-Net for brain tumor segmentation
Zhang, Lihong
Li, Yuzhuo
Liang, Yingbo
Xu, Chongxin
Liu, Tong
Sun, Junding
QUANTITATIVE IMAGING IN MEDICINE AND SURGERY, 2024, 14 (10) : 7249 - 7264
[44] A Few-Shot Attention Recurrent Residual U-Net for Crack Segmentation
Katsamenis, Iason
Protopapadakis, Eftychios
Bakalos, Nikolaos
Varvarigos, Andreas
Doulamis, Anastasios
Doulamis, Nikolaos
Voulodimos, Athanasios
ADVANCES IN VISUAL COMPUTING, ISVC 2023, PT I, 2023, 14361 : 199 - 209
[45] Segmentation for Athlete's Ankle Injury Image Using Residual Double Attention U-Net Model
Zhang, Jing
Zhou, Jian
Huang, Ming
Raj, Raja Soosaimarian Peter
BRAZILIAN ARCHIVES OF BIOLOGY AND TECHNOLOGY, 2023, 66
[46] DAU-Net: a novel U-Net with dual attention for retinal vessel segmentation
Jian, Muwei
Xu, Wenjing
Nie, Changqun
Li, Shuo
Yang, Songwen
Li, Xiaoguang
BIOMEDICAL PHYSICS & ENGINEERING EXPRESS, 2025, 11 (02):
[47] A Global and Local Enhanced Residual U-Net for Accurate Retinal Vessel Segmentation
Lian, Sheng
Li, Lei
Lian, Guiren
Xiao, Xiao
Luo, Zhiming
Li, Shaozi
IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2021, 18 (03) : 852 - 862
[48] A Dense-Gated U-Net for Brain Lesion Segmentation
Ji, Zhongyi
Han, Xiao
Lin, Tong
Wang, Wenmin
2020 IEEE INTERNATIONAL CONFERENCE ON VISUAL COMMUNICATIONS AND IMAGE PROCESSING (VCIP), 2020, : 104 - 107
[49] Multi-Convolutional Channel Residual Spatial Attention U-Net for Industrial and Medical Image Segmentation
Chen, Haoyu
Kim, Kyungbaek
IEEE ACCESS, 2024, 12 : 76089 - 76101
[50] RAU-Net: U-Net Model Based on Residual and Attention for Kidney and Kidney Tumor Segmentation
Guo, Jingna
Zeng, Wei
Yu, Sen
Xiao, Junqiu
2021 IEEE INTERNATIONAL CONFERENCE ON CONSUMER ELECTRONICS AND COMPUTER ENGINEERING (ICCECE), 2021, : 353 - 356

← 1 2 3 4 5 →