S3AR U-Net: A separable squeezed similarity attention-gated residual U-Net for glottis segmentation

被引：1

作者：

Montalbo, Francis Jesmar P. ^{[1
]}

机构：

[1] Batangas State Univ, Rizal Ave,Extens, Batangas 4200, Batangas, Philippines

来源：

BIOMEDICAL SIGNAL PROCESSING AND CONTROL | 2024年 / 92卷

关键词：

Biomedical imaging; Convolutional neural networks; Laryngeal video endoscopy; Lightweight attention mechanisms; Otolaryngology; Throat disease; IMAGE SEGMENTATION; ARCHITECTURE; NETWORK; TUMOR; UNET;

D O I：

10.1016/j.bspc.2024.106047

中图分类号：

R318 [生物医学工程];

学科分类号：

0831 ;

摘要：

Deep learning in medical imaging became a focal point in research, emphasizing techniques to automatically detect ailments in magnetic resonance images (MRI) and X-rays. Beyond these imaging modalities, there is a recognized need to apply soft computing solutions to Laryngeal Video Endoscopy (LVE), specifically in the context of Videostroboscopy. While this technique offers visual oscillations of the vocal fold, manual examination of the glottis presents inherent challenges. In current literature, various proposals favor segmentation models such as U-Net, trained with labeled data, to address this diagnostic challenge. Fortunately, the Benchmark for Automatic Glottis Segmentation (BAGLS) dataset, comprising 55,750 meticulously labeled images, has become available. However, training a U-Net model with BAGLS requires substantial computational resource investment. Therefore, this study proposes an approach to reduce computational costs by designing a lightweight U-Net incorporating separable depthwise convolutions. In addition to minimizing computational demands, this study enhances the compact architecture by incorporating attention gates, skip connections, and squeeze-andexcitation blocks. These modifications tend to extract robust features while avoiding unnecessary parameter enlargement. Despite its compact nature, the proposed S3AR U-Net includes a feature similarity module that enriches its spectrum of features. Overall, the S3AR U-Net introduced in this study proves to be a cost-efficient solution, boasting only 62 K parameters and operating at 0.21 G FLOPs. Impressively, it achieves a DSC of 88.73 % and IoU of 79.97 %. These performance metrics position the S3AR U-Net favorably compared to most existing U-Net solutions, showcasing its efficacy in automatically segmenting a person's glottis during medical imaging analysis.

引用

页数：20

共 50 条

[1] Segmentation of the Thoracic Aorta using an Attention-Gated U-Net
Zhong, Jiayang
Bian, Zhangxing
Hatt, Charles R.
Burris, Nicholas S.
MEDICAL IMAGING 2021: COMPUTER-AIDED DIAGNOSIS, 2021, 11597
[2] Chaining a U-Net With a Residual U-Net for Retinal Blood Vessels Segmentation
Alfonso Francia, Gendry
Pedraza, Carlos
Aceves, Marco
Tovar-Arriaga, Saul
IEEE ACCESS, 2020, 8 : 38493 - 38500
[3] MSR U-Net: An Improved U-Net Model for Retinal Blood Vessel Segmentation
Kande, Giri Babu
Ravi, Logesh
Kande, Nitya
Nalluri, Madhusudana Rao
Kotb, Hossam
Aboras, Kareem M.
Yousef, Amr
Ghadi, Yazeed Yasin
Sasikumar, A.
IEEE ACCESS, 2024, 12 : 534 - 551
[4] Multimodal attention-gated cascaded U-Net model for automatic brain tumor detection and segmentation
Chinnam, Siva Koteswara Rao
Sistla, Venkatramaphanikumar
Kolli, Venkata Krishna Kishore
BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2022, 78
[5] Recurrent residual U-Net for medical image segmentation
Alom, Md Zahangir
Yakopcic, Chris
Hasan, Mahmudul
Taha, Tarek M.
Asari, Vijayan K.
JOURNAL OF MEDICAL IMAGING, 2019, 6 (01)
[6] SDResU-Net: Separable and Dilated Residual U-Net for MRI Brain Tumor Segmentation
Zhang, Jianxin
Lv, Xiaogang
Sun, Qiule
Zhang, Qiang
Wei, Xiaopeng
Liu, Bin
CURRENT MEDICAL IMAGING, 2020, 16 (06) : 720 - 728
[7] CHANNEL ATTENTION RESIDUAL U-NET FOR RETINAL VESSEL SEGMENTATION
Guo, Changlu
Szemenyei, Marton
Hu, Yangtao
Wang, Wenle
Zhou, Wei
Yi, Yugen
2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 1185 - 1189
[8] AResU-Net: Attention Residual U-Net for Brain Tumor Segmentation
Zhang, Jianxin
Lv, Xiaogang
Zhang, Hengbo
Liu, Bin
SYMMETRY-BASEL, 2020, 12 (05):
[9] A mixed attention-gated U-Net for continuous cuffless blood pressure estimation
Zhong, Yiting
Chen, Yongyi
Zhang, Dan
Xu, Yanghui
Karimi, Hamid Reza
SIGNAL IMAGE AND VIDEO PROCESSING, 2023, 17 (08) : 4143 - 4151
[10] A mixed attention-gated U-Net for continuous cuffless blood pressure estimation
Yiting Zhong
Yongyi Chen
Dan Zhang
Yanghui Xu
Hamid Reza Karimi
Signal, Image and Video Processing, 2023, 17 : 4143 - 4151

← 1 2 3 4 5 →