A fusion of VGG-16 and ViT models for improving bone tumor classification in computed tomography

被引：4

作者：

Chen, Weimin ^{[1
]}

Ayoub, Muhammad ^{[2
]}

Liao, Mengyun ^{[2
]}

Shi, Ruizheng ^{[3
]}

Zhang, Mu ^{[4
]}

Su, Feng ^{[4
]}

Huang, Zhiguo ^{[4
]}

Li, Yuanzhe ^{[5
]}

Wang, Yi ^{[5
]}

Wong, Kevin K. L. ^{[1
,6
]}

机构：

[1] Hunan City Univ, Sch Informat & Elect, Yiyang 413000, Peoples R China

[2] Cent South Univ, Sch Comp Sci & Engn, Changsha 410083, Hunan, Peoples R China

[3] Cent South Univ, Xiangya Hosp, Natl Clin Res Ctr Geriatr Disorders, Changsha 410008, Hunan, Peoples R China

[4] Cent South Univ, Xiangya Hosp, Dept Emergency, Changsha 410008, Hunan, Peoples R China

[5] Fujian Med Univ, Affiliated Hosp 2, Dept CT MRI, Quanzhou 362000, Peoples R China

[6] Univ Saskatchewan, Coll Engn, Dept Mech Engn, Saskatoon, SK S7N 5A9, Canada

来源：

JOURNAL OF BONE ONCOLOGY | 2023年 / 43卷

关键词：

Vision Transformer; VGG-16; ViT; Bone tumors diagnosis; Deep learning; Orthopedics image classification;

D O I：

10.1016/j.jbo.2023.100508

中图分类号：

R73 [肿瘤学];

学科分类号：

100214 ;

摘要：

Background and Objective: Bone tumors present significant challenges in orthopedic medicine due to variations in clinical treatment approaches for different tumor types, which includes benign, malignant, and intermediate cases. Convolutional Neural Networks (CNNs) have emerged as prominent models for tumor classification. However, their limited perception ability hinders the acquisition of global structural information, potentially affecting classification accuracy. To address this limitation, we propose an optimized deep learning algorithm for precise classification of diverse bone tumors. Materials and Methods: Our dataset comprises 786 computed tomography (CT) images of bone tumors, featuring sections from two distinct bone species, namely the tibia and femur. Sourced from The Second Affiliated Hospital of Fujian Medical University, the dataset was meticulously preprocessed with noise reduction techniques. We introduce a novel fusion model, VGG16-ViT, leveraging the advantages of the VGG-16 network and the Vision Transformer (ViT) model. Specifically, we select 27 features from the third layer of VGG-16 and input them into the Vision Transformer encoder for comprehensive training. Furthermore, we evaluate the impact of secondary migration using CT images from Xiangya Hospital for validation. Results: The proposed fusion model demonstrates notable improvements in classification performance. It effectively reduces the training time while achieving an impressive classification accuracy rate of 97.6%, marking a significant enhancement of 8% in sensitivity and specificity optimization. Furthermore, the investigation into secondary migration's effects on experimental outcomes across the three models reveals its potential to enhance system performance. Conclusion: Our novel VGG-16 and Vision Transformer joint network exhibits robust classification performance on bone tumor datasets. The integration of these models enables precise and efficient classification,

引用

页数：11

共 28 条

[1] Al-Dhabyani W, 2019, INT J ADV COMPUT SC, V10, P618
[2] Bienrmann J.S., J. Natl. Compr. Canc. Netw., P1540
[3] Carion Nicolas, 2020, Computer Vision - ECCV 2020. 16th European Conference. Proceedings. Lecture Notes in Computer Science (LNCS 12346), P213, DOI 10.1007/978-3-030-58452-8_13
[4] Radiography in the Initial Diagnosis of Primary Bone Tumors
Costelloe, Colleen M.
Madewell, John E.
[J]. AMERICAN JOURNAL OF ROENTGENOLOGY, 2013, 200 (01) : 3 - 7
[5] TransMed: Transformers Advance Multi-Modal Medical Image Classification
Dai, Yin
Gao, Yifan
Liu, Fayu
[J]. DIAGNOSTICS, 2021, 11 (08)
[6] A Survey of Convolutional Neural Networks for Image Classification: Models and Datasets
Deng, Tiancan
[J]. 2022 INTERNATIONAL CONFERENCE ON BIG DATA, INFORMATION AND COMPUTER NETWORK (BDICN 2022), 2022, : 746 - 749
[7] A two-stage gene selection scheme utilizing MRMR filter and GA wrapper
El Akadi, Ali
Amine, Aouatif
El Ouardighi, Abdeljalil
Aboutajdine, Driss
[J]. KNOWLEDGE AND INFORMATION SYSTEMS, 2011, 26 (03) : 487 - 500
[8] Fritzsche H, 2017, ORTHOPADE, V46, P484, DOI 10.1007/s00132-017-3429-z
[9] Artificial intelligence in medical imaging
Gore, John C.
[J]. MAGNETIC RESONANCE IMAGING, 2020, 68 : A1 - A4
[10] Deep convolutional neural network VGG-16 model for differential diagnosing of papillary thyroid carcinomas in cytological images: a pilot study
Guari, Qing
Wang, Yunjun
Ping, Bo
Li, Duanshu
Du, Jiajun
Qin, Yu
Lu, Hongtao
Wan, Xiaochun
Xiang, Jun
[J]. JOURNAL OF CANCER, 2019, 10 (20): : 4876 - 4882

← 1 2 3 →