SwinUNeLCsT: Global-local spatial representation learning with hybrid CNN-transformer for efficient tuberculosis lung cavity weakly supervised semantic segmentation

被引：8

作者：

Tan, Zhuoyi ^{[1
]}

Madzin, Hizmawati ^{[1
]}

Norafida, Bahari ^{[2
]}

Rahmat, Rahmita Wirza O. K. ^{[1
]}

Khalid, Fatimah ^{[1
]}

Sulaiman, Puteri Suhaiza

机构：

[1] Univ Putra Malaysia, Fac Comp Sci & Informat Technol, Serdang 43400, Malaysia

[2] Dept Radiol, Univ Putra Malaysia, Serdang 43400, Selangor, Malaysia

来源：

JOURNAL OF KING SAUD UNIVERSITY COMPUTER AND INFORMATION SCIENCES | 2024年 / 36卷 / 04期

关键词：

Deep learning; Classification; Semantic segmentation; Weakly-supervised learning; CT tuberculosis imaging; IMAGE;

D O I：

10.1016/j.jksuci.2024.102012

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Radiological diagnosis of lung cavities (LCs) is the key to identifying tuberculosis (TB). Conventional deep learning methods rely on a large amount of accurate pixel -level data to segment LCs. This process is timeconsuming and laborious, especially for those subtle LCs. To address such challenges, firstly, we introduce a novel 3D TB LCs imaging convolutional neural network (CNN) -transformer hybrid model (SwinUNeLCsT). The core idea of SwinUNeLCsT is to combine local details and global dependencies for TB CT scan image feature representation to effectively improve the recognition ability of LCs. Secondly, to reduce the dependence on accurate pixel -level annotations, we design an end -to -end LCs weakly supervised semantic segmentation (WSSS) framework. Through this framework, radiologists need only to classify the number and the approximate location (e.g., left lung, right lung, or both) of LCs in the CT scan to achieve efficient segmentation of the LCs. This process eliminates the need for meticulously drawing boundaries, greatly reducing the cost of annotation. Extensive experimental results show that SwinUNeLCsT outperforms currently popular medical 3D segmentation methods in the supervised semantic segmentation paradigm. Meanwhile, our WSSS framework based on SwinUNeLCsT also performs best among the existing state-of-the-art medical 3D WSSS methods.

引用

页数：15

共 73 条

[1] Improving tuberculosis severity assessment in computed tomography images using novel DAvoU-Net segmentation and deep learning framework [J].

Alebiosu, David Olayemi ;

Dharmaratne, Anuja ;

Lim, Chern Hong .

EXPERT SYSTEMS WITH APPLICATIONS, 2023, 213

[2]

Amodei D, 2016, PR MACH LEARN RES, V48

[3]

[Anonymous], 2020, Int. J. Adv. Trends Comput. Sci. Eng., V9, DOI 10.30534/ijatcse/2020/175942020

[4] Deep semantic segmentation of natural and medical images: a review [J].

Asgari Taghanaki, Saeid ;

Abhishek, Kumar ;

Cohen, Joseph Paul ;

Cohen-Adad, Julien ;

Hamarneh, Ghassan .

ARTIFICIAL INTELLIGENCE REVIEW, 2021, 54 (01) :137-178

[5]

Berthelot D, 2019, ADV NEUR IN, V32

[6] An Empirical Study of Training Self-Supervised Vision Transformers [J].

Chen, Xinlei ;

Xie, Saining ;

He, Kaiming .

2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, :9620-9629

[7] C-CAM: Causal CAM for Weakly Supervised Semantic Segmentation on Medical Image [J].

Chen, Zhang ;

Tian, Zhiqiang ;

Zhu, Jihua ;

Li, Ce ;

Du, Shaoyi .

2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2022, :11666-11675

[8] Class Re-Activation Maps for Weakly-Supervised Semantic Segmentation [J].

Chen, Zhaozheng ;

Wang, Tan ;

Wu, Xiongwei ;

Hua, Xian-Sheng ;

Zhang, Hanwang ;

Sun, Qianru .

2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, :959-968

[9] Xception: Deep Learning with Depthwise Separable Convolutions [J].

Chollet, Francois .

30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, :1800-1807

[10] Anti-tuberculosis treatment strategies and drug development: challenges and priorities [J].

Dartois, Veronique A. ;

Rubin, Eric J. .

NATURE REVIEWS MICROBIOLOGY, 2022, 20 (11) :685-701

← 1 2 3 4 5 6 7 8 →