A fusion of VGG-16 and ViT models for improving bone tumor classification in computed tomography

被引:4
作者
Chen, Weimin [1 ]
Ayoub, Muhammad [2 ]
Liao, Mengyun [2 ]
Shi, Ruizheng [3 ]
Zhang, Mu [4 ]
Su, Feng [4 ]
Huang, Zhiguo [4 ]
Li, Yuanzhe [5 ]
Wang, Yi [5 ]
Wong, Kevin K. L. [1 ,6 ]
机构
[1] Hunan City Univ, Sch Informat & Elect, Yiyang 413000, Peoples R China
[2] Cent South Univ, Sch Comp Sci & Engn, Changsha 410083, Hunan, Peoples R China
[3] Cent South Univ, Xiangya Hosp, Natl Clin Res Ctr Geriatr Disorders, Changsha 410008, Hunan, Peoples R China
[4] Cent South Univ, Xiangya Hosp, Dept Emergency, Changsha 410008, Hunan, Peoples R China
[5] Fujian Med Univ, Affiliated Hosp 2, Dept CT MRI, Quanzhou 362000, Peoples R China
[6] Univ Saskatchewan, Coll Engn, Dept Mech Engn, Saskatoon, SK S7N 5A9, Canada
关键词
Vision Transformer; VGG-16; ViT; Bone tumors diagnosis; Deep learning; Orthopedics image classification;
D O I
10.1016/j.jbo.2023.100508
中图分类号
R73 [肿瘤学];
学科分类号
100214 ;
摘要
Background and Objective: Bone tumors present significant challenges in orthopedic medicine due to variations in clinical treatment approaches for different tumor types, which includes benign, malignant, and intermediate cases. Convolutional Neural Networks (CNNs) have emerged as prominent models for tumor classification. However, their limited perception ability hinders the acquisition of global structural information, potentially affecting classification accuracy. To address this limitation, we propose an optimized deep learning algorithm for precise classification of diverse bone tumors. Materials and Methods: Our dataset comprises 786 computed tomography (CT) images of bone tumors, featuring sections from two distinct bone species, namely the tibia and femur. Sourced from The Second Affiliated Hospital of Fujian Medical University, the dataset was meticulously preprocessed with noise reduction techniques. We introduce a novel fusion model, VGG16-ViT, leveraging the advantages of the VGG-16 network and the Vision Transformer (ViT) model. Specifically, we select 27 features from the third layer of VGG-16 and input them into the Vision Transformer encoder for comprehensive training. Furthermore, we evaluate the impact of secondary migration using CT images from Xiangya Hospital for validation. Results: The proposed fusion model demonstrates notable improvements in classification performance. It effectively reduces the training time while achieving an impressive classification accuracy rate of 97.6%, marking a significant enhancement of 8% in sensitivity and specificity optimization. Furthermore, the investigation into secondary migration's effects on experimental outcomes across the three models reveals its potential to enhance system performance. Conclusion: Our novel VGG-16 and Vision Transformer joint network exhibits robust classification performance on bone tumor datasets. The integration of these models enables precise and efficient classification,
引用
收藏
页数:11
相关论文
共 28 条
  • [1] Al-Dhabyani W, 2019, INT J ADV COMPUT SC, V10, P618
  • [2] Bienrmann J.S., J. Natl. Compr. Canc. Netw., P1540
  • [3] Carion Nicolas, 2020, Computer Vision - ECCV 2020. 16th European Conference. Proceedings. Lecture Notes in Computer Science (LNCS 12346), P213, DOI 10.1007/978-3-030-58452-8_13
  • [4] Radiography in the Initial Diagnosis of Primary Bone Tumors
    Costelloe, Colleen M.
    Madewell, John E.
    [J]. AMERICAN JOURNAL OF ROENTGENOLOGY, 2013, 200 (01) : 3 - 7
  • [5] TransMed: Transformers Advance Multi-Modal Medical Image Classification
    Dai, Yin
    Gao, Yifan
    Liu, Fayu
    [J]. DIAGNOSTICS, 2021, 11 (08)
  • [6] A Survey of Convolutional Neural Networks for Image Classification: Models and Datasets
    Deng, Tiancan
    [J]. 2022 INTERNATIONAL CONFERENCE ON BIG DATA, INFORMATION AND COMPUTER NETWORK (BDICN 2022), 2022, : 746 - 749
  • [7] A two-stage gene selection scheme utilizing MRMR filter and GA wrapper
    El Akadi, Ali
    Amine, Aouatif
    El Ouardighi, Abdeljalil
    Aboutajdine, Driss
    [J]. KNOWLEDGE AND INFORMATION SYSTEMS, 2011, 26 (03) : 487 - 500
  • [8] Fritzsche H, 2017, ORTHOPADE, V46, P484, DOI 10.1007/s00132-017-3429-z
  • [9] Artificial intelligence in medical imaging
    Gore, John C.
    [J]. MAGNETIC RESONANCE IMAGING, 2020, 68 : A1 - A4
  • [10] Deep convolutional neural network VGG-16 model for differential diagnosing of papillary thyroid carcinomas in cytological images: a pilot study
    Guari, Qing
    Wang, Yunjun
    Ping, Bo
    Li, Duanshu
    Du, Jiajun
    Qin, Yu
    Lu, Hongtao
    Wan, Xiaochun
    Xiang, Jun
    [J]. JOURNAL OF CANCER, 2019, 10 (20): : 4876 - 4882