S3AR U-Net: A separable squeezed similarity attention-gated residual U-Net for glottis segmentation

被引：1

作者：

Montalbo, Francis Jesmar P. ^{[1
]}

机构：

[1] Batangas State Univ, Rizal Ave,Extens, Batangas 4200, Batangas, Philippines

来源：

BIOMEDICAL SIGNAL PROCESSING AND CONTROL | 2024年 / 92卷

关键词：

Biomedical imaging; Convolutional neural networks; Laryngeal video endoscopy; Lightweight attention mechanisms; Otolaryngology; Throat disease; IMAGE SEGMENTATION; ARCHITECTURE; NETWORK; TUMOR; UNET;

D O I：

10.1016/j.bspc.2024.106047

中图分类号：

R318 [生物医学工程];

学科分类号：

0831 ;

摘要：

Deep learning in medical imaging became a focal point in research, emphasizing techniques to automatically detect ailments in magnetic resonance images (MRI) and X-rays. Beyond these imaging modalities, there is a recognized need to apply soft computing solutions to Laryngeal Video Endoscopy (LVE), specifically in the context of Videostroboscopy. While this technique offers visual oscillations of the vocal fold, manual examination of the glottis presents inherent challenges. In current literature, various proposals favor segmentation models such as U-Net, trained with labeled data, to address this diagnostic challenge. Fortunately, the Benchmark for Automatic Glottis Segmentation (BAGLS) dataset, comprising 55,750 meticulously labeled images, has become available. However, training a U-Net model with BAGLS requires substantial computational resource investment. Therefore, this study proposes an approach to reduce computational costs by designing a lightweight U-Net incorporating separable depthwise convolutions. In addition to minimizing computational demands, this study enhances the compact architecture by incorporating attention gates, skip connections, and squeeze-andexcitation blocks. These modifications tend to extract robust features while avoiding unnecessary parameter enlargement. Despite its compact nature, the proposed S3AR U-Net includes a feature similarity module that enriches its spectrum of features. Overall, the S3AR U-Net introduced in this study proves to be a cost-efficient solution, boasting only 62 K parameters and operating at 0.21 G FLOPs. Impressively, it achieves a DSC of 88.73 % and IoU of 79.97 %. These performance metrics position the S3AR U-Net favorably compared to most existing U-Net solutions, showcasing its efficacy in automatically segmenting a person's glottis during medical imaging analysis.

引用

页数：20

共 50 条

[21] Improved Segmentation by Adversarial U-Net
Sriker, David
Cohen, Dana
Cahan, Noa
Greenspan, Hayit
MEDICAL IMAGING 2021: COMPUTER-AIDED DIAGNOSIS, 2021, 11597
[22] DRA U-Net: An Attention based U-Net Framework for 2D Medical Image Segmentation
Zhang, Xian
Feng, Ziyuan
Zhong, Tianchi
Shen, Sicheng
Zhang, Ruolin
Zhou, Lijie
Zhang, Bo
Wang, Wendong
2021 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2021, : 3936 - 3942
[23] Attention-augmented U-Net (AA-U-Net) for semantic segmentation
Kumar T. Rajamani
Priya Rani
Hanna Siebert
Rajkumar ElagiriRamalingam
Mattias P. Heinrich
Signal, Image and Video Processing, 2023, 17 : 981 - 989
[24] Attention-augmented U-Net (AA-U-Net) for semantic segmentation
Rajamani, Kumar T.
Rani, Priya
Siebert, Hanna
ElagiriRamalingam, Rajkumar
Heinrich, Mattias P.
SIGNAL IMAGE AND VIDEO PROCESSING, 2023, 17 (04) : 981 - 989
[25] OAU-net: Outlined Attention U-net for biomedical image segmentation
Song, Haojie
Wang, Yuefei
Zeng, Shijie
Guo, Xiaoyan
Li, Zheheng
BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2023, 79
[26] Improved U-Net based on ResNet and SE-Net with dual attention mechanism for glottis semantic segmentation
Ni, Jui-Chung
Lee, Shih-Hsiung
Shen, Yen-Cheng
Yang, Chu-Sing
MEDICAL ENGINEERING & PHYSICS, 2025, 136
[27] Dense and shuffle attention U-Net for automatic skin lesion segmentation
Zhang, Guanzhong
Wang, Shengsheng
INTERNATIONAL JOURNAL OF IMAGING SYSTEMS AND TECHNOLOGY, 2022, 32 (06) : 2066 - 2079
[28] Cascaded hybrid residual U-Net for glioma segmentation
Long, Jiaosong
Ma, Guangzhi
Liu, Hong
Song, Enmin
Hung, Chih-Cheng
Xu, Xiangyang
Jin, Renchao
Zhuang, Yuzhou
Liu, DaiYang
Ma, Guangzhi
Song, Enmin
MULTIMEDIA TOOLS AND APPLICATIONS, 2020, 79 (33-34) : 24929 - 24947
[29] Cascaded hybrid residual U-Net for glioma segmentation
Jiaosong Long
Guangzhi Ma
Hong Liu
Enmin Song
Chih-Cheng Hung
Xiangyang Xu
Renchao Jin
Yuzhou Zhuang
DaiYang Liu
Guangzhi Ma
Enmin Song
Multimedia Tools and Applications, 2020, 79 : 24929 - 24947
[30] Attention guided U-Net for accurate iris segmentation
Lian, Sheng
Luo, Zhiming
Zhong, Zhun
Lin, Xiang
Su, Songzhi
Li, Shaozi
JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2018, 56 : 296 - 304

← 1 2 3 4 5 →