S3AR U-Net: A separable squeezed similarity attention-gated residual U-Net for glottis segmentation

被引：1

作者：

Montalbo, Francis Jesmar P. ^{[1
]}

机构：

[1] Batangas State Univ, Rizal Ave,Extens, Batangas 4200, Batangas, Philippines

来源：

BIOMEDICAL SIGNAL PROCESSING AND CONTROL | 2024年 / 92卷

关键词：

Biomedical imaging; Convolutional neural networks; Laryngeal video endoscopy; Lightweight attention mechanisms; Otolaryngology; Throat disease; IMAGE SEGMENTATION; ARCHITECTURE; NETWORK; TUMOR; UNET;

D O I：

10.1016/j.bspc.2024.106047

中图分类号：

R318 [生物医学工程];

学科分类号：

0831 ;

摘要：

Deep learning in medical imaging became a focal point in research, emphasizing techniques to automatically detect ailments in magnetic resonance images (MRI) and X-rays. Beyond these imaging modalities, there is a recognized need to apply soft computing solutions to Laryngeal Video Endoscopy (LVE), specifically in the context of Videostroboscopy. While this technique offers visual oscillations of the vocal fold, manual examination of the glottis presents inherent challenges. In current literature, various proposals favor segmentation models such as U-Net, trained with labeled data, to address this diagnostic challenge. Fortunately, the Benchmark for Automatic Glottis Segmentation (BAGLS) dataset, comprising 55,750 meticulously labeled images, has become available. However, training a U-Net model with BAGLS requires substantial computational resource investment. Therefore, this study proposes an approach to reduce computational costs by designing a lightweight U-Net incorporating separable depthwise convolutions. In addition to minimizing computational demands, this study enhances the compact architecture by incorporating attention gates, skip connections, and squeeze-andexcitation blocks. These modifications tend to extract robust features while avoiding unnecessary parameter enlargement. Despite its compact nature, the proposed S3AR U-Net includes a feature similarity module that enriches its spectrum of features. Overall, the S3AR U-Net introduced in this study proves to be a cost-efficient solution, boasting only 62 K parameters and operating at 0.21 G FLOPs. Impressively, it achieves a DSC of 88.73 % and IoU of 79.97 %. These performance metrics position the S3AR U-Net favorably compared to most existing U-Net solutions, showcasing its efficacy in automatically segmenting a person's glottis during medical imaging analysis.

引用

页数：20

共 50 条

[31] Composite Attention Residual U-Net for Rib Fracture Detection
Wang, Xiaoming
Wang, Yongxiong
ENTROPY, 2023, 25 (03)
[32] U-Net with Attention Mechanism for Retinal Vessel Segmentation
Si, Ze
Fu, Dongmei
Li, Jiahao
IMAGE AND GRAPHICS, ICIG 2019, PT II, 2019, 11902 : 668 - 677
[33] Deep Learning Model Development with U-net Architecture for Glottis Segmentation
Derdiman, Yasar Said
Koc, Turgay
29TH IEEE CONFERENCE ON SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS (SIU 2021), 2021,
[34] Attention Convolutional U-Net for Automatic Liver Tumor Segmentation
Bibi, Asima
Khan, Muhammad Salman
2021 INTERNATIONAL CONFERENCE ON FRONTIERS OF INFORMATION TECHNOLOGY (FIT 2021), 2021, : 102 - 107
[35] CLA-U-Net: Convolutional Long-short-term-memory Attention-gated U-Net for Automatic Segmentation of the Left Ventricle in 2-D Echocardiograms
Lin, Zihan
Tsui, Po-Hsiang
Zeng, Yan
Bin, Guangyu
Wu, Shuicai
Zhou, Zhuhuang
2022 IEEE INTERNATIONAL ULTRASONICS SYMPOSIUM (IEEE IUS), 2022,
[36] DARU-Net: A dual attention residual U-Net for uterine fibroids segmentation on MRI
Zhang, Jian
Liu, Yang
Chen, Liping
Ma, Si
Zhong, Yuqing
He, Zhimin
Li, Chengwei
Xiao, Zhibo
Zheng, Yineng
Lv, Fajin
JOURNAL OF APPLIED CLINICAL MEDICAL PHYSICS, 2023, 24 (06):
[37] Sharp dense U-Net: an enhanced dense U-Net architecture for nucleus segmentation
Senapati, Pradip
Basu, Anusua
Deb, Mainak
Dhal, Krishna Gopal
INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS, 2024, 15 (06) : 2079 - 2094
[38] Brain Tumor Segmentation with Attention-based U-Net
Li, Tuofu
Liu, Javin Jia
Tai, Yintao
Tian, Yuxuan
SECOND IYSF ACADEMIC SYMPOSIUM ON ARTIFICIAL INTELLIGENCE AND COMPUTER ENGINEERING, 2021, 12079
[39] An attention based residual U-Net with swin transformer for brain MRI segmentation
Angona, Tazkia Mim
Mondal, M. Rubaiyat Hossain
ARRAY, 2025, 25
[40] Semantic Segmentation of Aerial Imagery Using U-Net with Self-Attention and Separable Convolutions
Khan, Bakht Alam
Jung, Jin-Woo
APPLIED SCIENCES-BASEL, 2024, 14 (09):

← 1 2 3 4 5 →