3D multi-scale vision transformer for lung nodule detection in chest CT images

被引:15
作者
Mkindu, Hassan [1 ]
Wu, Longwen [1 ]
Zhao, Yaqin [1 ]
机构
[1] Harbin Inst Technol, Sch Elect & Informat Engn, Harbin 150001, Peoples R China
基金
中国国家自然科学基金;
关键词
Computer-aided diagnosis; Computed tomography; Vision transformer; Lung nodule; 3D-MSViT; FALSE-POSITIVE REDUCTION;
D O I
10.1007/s11760-022-02464-0
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Lung cancer becomes the most prominent cause of cancer-related death in society. Normally, radiologists use computed tomography (CT) to diagnose lung nodules in lung cancer patients. A single CT scan for a patient produces hundreds of images that are manually analyzed by radiologists which is a big burden and sometimes leads to inaccuracy. Recently, many computer-aided diagnosis (CAD) systems integrated with deep learning architectures have been proposed to assist radiologists. This study proposes the CAD scheme based on a 3D multi-scale vision transformer (3D-MSViT) to enhance multi-scale feature extraction and improves lung nodule prediction efficiency from 3D CT images. The 3D-MSViT architecture adopted a local-global transformer block structure whereby the local transformer stage individually processes each scale patch and forwards it to the global transformer level for merging multi-scale features. The transformer blocks fully relied on the attention mechanism without the inclusion of the convolutional neural network to reduce the network parameters. The proposed CAD scheme was validated on 888 CT images of the Lung Nodule Analysis 2016 (LUNA16) public dataset. Free-response receiver operating characteristics analysis was adopted to evaluate the proposed method. The 3D-MSViT algorithm obtained the highest sensitivity of 97.81% and competition performance metrics of 0.911. Therefore, the 3D-MSViT scheme obtained comparable results with low network complexity related to the counterpart deep learning approaches in prior studies.
引用
收藏
页码:2473 / 2480
页数:8
相关论文
共 35 条
[1]  
De Moura J., 2018, IEEE T MED IMAGING, V7, P1, DOI [10.1117/12.2285954, DOI 10.1117/12.2285954]
[2]   Multilevel Contextual 3-D CNNs for False Positive Reduction in Pulmonary Nodule Detection [J].
Dou, Qi ;
Chen, Hao ;
Yu, Lequan ;
Qin, Jing ;
Heng, Pheng-Ann .
IEEE TRANSACTIONS ON BIOMEDICAL ENGINEERING, 2017, 64 (07) :1558-1567
[3]   LNCDS: A 2D-3D cascaded CNN approach for lung nodule classification, detection and segmentation [J].
Dutande, Prasad ;
Baid, Ujjwal ;
Talbar, Sanjay .
BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2021, 67
[4]   Automated pulmonary nodule detection in CT images using 3D deep squeeze-and-excitation networks [J].
Gong, Li ;
Jiang, Shan ;
Yang, Zhiyong ;
Zhang, Guobin ;
Wang, Lu .
INTERNATIONAL JOURNAL OF COMPUTER ASSISTED RADIOLOGY AND SURGERY, 2019, 14 (11) :1969-1979
[5]   Automatic lung nodule detection using multi-scale dot nodule-enhancement filter and weighted support vector machines in chest computed tomography [J].
Gu, Yu ;
Lu, Xiaoqi ;
Zhang, Baohua ;
Zhao, Ying ;
Yu, Dahua ;
Gao, Lixin ;
Cui, Guimei ;
Wu, Liang ;
Zhou, Tao .
PLOS ONE, 2019, 14 (01)
[6]   One-stage pulmonary nodule detection using 3-D DCNN with feature fusion and attention mechanism in CT image [J].
Huang, Yao-Sian ;
Chou, Ping-Ru ;
Chen, Hsin-Ming ;
Chang, Yeun-Chung ;
Chang, Ruey-Feng .
COMPUTER METHODS AND PROGRAMS IN BIOMEDICINE, 2022, 220
[7]  
Janocha K., 2017, Schedae Informaticae, V25, P49, DOI [10.4467/20838476SI.16.004.6185, DOI 10.4467/20838476SI.16.004.6185]
[8]   Cancer statistics, 2007 [J].
Jemal, Ahmedin ;
Siegel, Rebecca ;
Ward, Elizabeth ;
Murray, Taylor ;
Xu, Jiaquan ;
Thun, Michael J. .
CA-A CANCER JOURNAL FOR CLINICIANS, 2007, 57 (01) :43-66
[9]   An Automatic Detection System of Lung Nodule Based on Multigroup Patch-Based Deep Learning Network [J].
Jiang, Hongyang ;
Ma, He ;
Qian, Wei ;
Gao, Mengdi ;
Li, Yan .
IEEE JOURNAL OF BIOMEDICAL AND HEALTH INFORMATICS, 2018, 22 (04) :1227-1237
[10]  
KEKEKE, 2020, J. Mach. Learn. Res., V21, P1