Computer-Aided Diagnosis of Spinal Tuberculosis From CT Images Based on Deep Learning With Multimodal Feature Fusion

被引：13

作者：

Li, Zhaotong ^{[1
,2
]}

Wu, Fengliang ^{[3
,4
]}

Hong, Fengze ^{[5
]}

Gai, Xiaoyan ^{[6
]}

Cao, Wenli ^{[7
]}

Zhang, Zeru ^{[1
]}

Yang, Timin ^{[4
]}

Wang, Jiu ^{[4
]}

Gao, Song ^{[1
]}

Peng, Chao ^{[4
]}

机构：

[1] Peking Univ Hlth Sci Ctr, Inst Med Technol, Beijing, Peoples R China

[2] Peking Univ, Sch Hlth Human, Beijing, Peoples R China

[3] Peking Univ Third Hosp, Beijing Key Lab Spinal Dis Res, Dept Orthoped, Engn Res Ctr Bone & Joint Precis Med, Beijing, Peoples R China

[4] Peoples Hosp Tibet Autonomous Region, Dept Orthoped, Lhasa, Peoples R China

[5] Tibet Univ, Coll Med, Lhasa, Peoples R China

[6] Peking Univ Third Hosp, Dept Resp & Crit Care Med, Beijing, Peoples R China

[7] Beijing Geriatr Hosp, TB Dept, Beijing, Peoples R China

来源：

FRONTIERS IN MICROBIOLOGY | 2022年 / 13卷

基金：

中国国家自然科学基金;

关键词：

computer-aided diagnosis; spinal tuberculosis; computed tomography; feature fusion; deep learning; RADIOMICS; CANCER; CLASSIFICATION; INFORMATION; CHALLENGES; MANAGEMENT; MODEL;

D O I：

10.3389/fmicb.2022.823324

中图分类号：

Q93 [微生物学];

学科分类号：

071005 ; 100705 ;

摘要：

BackgroundSpinal tuberculosis (TB) has the highest incidence in remote plateau areas, particularly in Tibet, China, due to inadequate local healthcare services, which not only facilitates the transmission of TB bacteria but also increases the burden on grassroots hospitals. Computer-aided diagnosis (CAD) is urgently required to improve the efficiency of clinical diagnosis of TB using computed tomography (CT) images. However, classical machine learning with handcrafted features generally has low accuracy, and deep learning with self-extracting features relies heavily on the size of medical datasets. Therefore, CAD, which effectively fuses multimodal features, is an alternative solution for spinal TB detection. MethodsA new deep learning method is proposed that fuses four elaborate image features, specifically three handcrafted features and one convolutional neural network (CNN) feature. Spinal TB CT images were collected from 197 patients with spinal TB, from 2013 to 2020, in the People's Hospital of Tibet Autonomous Region, China; 3,000 effective lumbar spine CT images were randomly screened to our dataset, from which two sets of 1,500 images each were classified as tuberculosis (positive) and health (negative). In addition, virtual data augmentation is proposed to enlarge the handcrafted features of the TB dataset. Essentially, the proposed multimodal feature fusion CNN consists of four main sections: matching network, backbone (ResNet-18/50, VGG-11/16, DenseNet-121/161), fallen network, and gated information fusion network. Detailed performance analyses were conducted based on the multimodal features, proposed augmentation, model stability, and model-focused heatmap. ResultsExperimental results showed that the proposed model with VGG-11 and virtual data augmentation exhibited optimal performance in terms of accuracy, specificity, sensitivity, and area under curve. In addition, an inverse relationship existed between the model size and test accuracy. The model-focused heatmap also shifted from the irrelevant region to the bone destruction caused by TB. ConclusionThe proposed augmentation effectively simulated the real data distribution in the feature space. More importantly, all the evaluation metrics and analyses demonstrated that the proposed deep learning model exhibits efficient feature fusion for multimodal features. Our study provides a profound insight into the preliminary auxiliary diagnosis of spinal TB from CT images applicable to the Tibetan area.

引用

页数：18

共 66 条

[1] Fusion of deep-learned and hand-crafted features for cancelable recognition systems [J].

Abdellatef, Essam ;

Omran, Eman M. ;

Soliman, Randa F. ;

Ismail, Nabil A. ;

Abd Elrahman, Salah Eldin S. E. ;

Ismail, Khalid N. ;

Rihan, Mohamed ;

Abd El-Samie, Fathi E. ;

Eisa, Ayman A. .

SOFT COMPUTING, 2020, 24 (20) :15189-15208

[2] Decoding tumour phenotype by noninvasive imaging using a quantitative radiomics approach [J].

Aerts, Hugo J. W. L. ;

Velazquez, Emmanuel Rios ;

Leijenaar, Ralph T. H. ;

Parmar, Chintan ;

Grossmann, Patrick ;

Cavalho, Sara ;

Bussink, Johan ;

Monshouwer, Rene ;

Haibe-Kains, Benjamin ;

Rietveld, Derek ;

Hoebers, Frank ;

Rietbergen, Michelle M. ;

Leemans, C. Rene ;

Dekker, Andre ;

Quackenbush, John ;

Gillies, Robert J. ;

Lambin, Philippe .

NATURE COMMUNICATIONS, 2014, 5

[3]

Alkhateeb A, 2021, Deep Learning for Biomedical Data Analysis: Techniques, Approaches, and Applications, P255

[4] Going Deep in Medical Image Analysis: Concepts, Methods, Challenges, and Future Directions [J].

Altaf, Fouzia ;

Islam, Syed M. S. ;

Akhtar, Naveed ;

Janjua, Naeem Khalid .

IEEE ACCESS, 2019, 7 :99540-99572

[5]

[Anonymous], 2014, COMPUT RES REPOSITOR

[6] A deep feature fusion methodology for breast cancer diagnosis demonstrated on three imaging modality datasets [J].

Antropova, Natalia ;

Huynh, Benjamin Q. ;

Giger, Maryellen L. .

MEDICAL PHYSICS, 2017, 44 (10) :5162-5171

[7] Medical Image Analysis using Convolutional Neural Networks: A Review [J].

Anwar, Syed Muhammad ;

Majid, Muhammad ;

Qayyum, Adnan ;

Awais, Muhammad ;

Alnowami, Majdi ;

Khan, Muhammad Khurram .

JOURNAL OF MEDICAL SYSTEMS, 2018, 42 (11)

[8]

Arevalo J., 2017, P INT C LEARNING REP

[9] Medical image retrieval using ResNet-18 [J].

Ayyachamy, Swarnambiga ;

Alex, Varghese ;

Khened, Mahendra ;

Krishnamurthi, Ganapathy .

MEDICAL IMAGING 2019: IMAGING INFORMATICS FOR HEALTHCARE, RESEARCH, AND APPLICATIONS, 2019, 10954

[10] SURF: Speeded up robust features [J].

Bay, Herbert ;

Tuytelaars, Tinne ;

Van Gool, Luc .

COMPUTER VISION - ECCV 2006 , PT 1, PROCEEDINGS, 2006, 3951 :404-417

← 1 2 3 4 5 6 7 →