Preliminary Study on Patch Sizes in Vision Transformers (ViT) for COVID-19 and Diseased Lungs Classification

被引:11
作者
Than, Joel C. M. [1 ]
Liang, Pun [1 ]
Rijal, Omar Mohd [2 ]
Kassim, Rosminah M. [3 ]
Yunus, Ashari [4 ]
Noor, Norliza M. [5 ]
Then, Patrick [1 ]
机构
[1] Swinburne Univ, Fac Engn Comp & Sci, Technol Sarawak Campus, Kuching, Malaysia
[2] Univ Malaya, Inst Math Sci, Kuala Lumpur, Malaysia
[3] Kuala Lumpur Hosp, Dept Diagnost Imaging, Kuala Lumpur, Malaysia
[4] Kuala Lumpur Hosp, Inst Resp Med, Kuala Lumpur, Malaysia
[5] Univ Teknol Malaysia, Razak Fac Technol & Informat, Kuala Lumpur Campus, Kuala Lumpur, Malaysia
来源
1ST NATIONAL BIOMEDICAL ENGINEERING CONFERENCE (NBEC 2021): ADVANCED TECHNOLOGY FOR MODERN HEALTHCARE | 2021年
关键词
deep learning; lung; COVID-19; transformer; classification;
D O I
10.1109/NBEC53282.2021.9618751
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
COVID-19 and lung diseases have been the major focus of research currently due to the pandemic's reach and effect. Deep Learning (DL) is playing a large role today in various fields from disease classification to drug response identification. The conventional DL method used for images is the Convolutional Neural Network (CNN). A potential method that will replace the usage of CNNs is Transformer specifically Vision Transformers (ViT). This study is a preliminary exploration to determine the performance of using ViT on diseased lungs, COVID-19 infected lungs, and normal lungs. This study was performed on two datasets. The first dataset was a publicly accessible dataset from Iran that has a large cohort of patients. The second dataset was a Malaysian dataset. These images were utilized to verify the usage of ViT and its effectiveness. Images were segregated into several sized patches (16x16, 32x32, 64x64, 128x128, 256x256) pixels. To determine the performance of ViT method, performance metrics of accuracy, sensitivity, specificity, negative predictive value (NPV), positive predictive value (PPV) and F1-score. From the results of this study, ViT is a promising method with a peak accuracy of 95.36%.
引用
收藏
页码:146 / 150
页数:5
相关论文
共 19 条
  • [1] Two-phase multi-model automatic brain tumour diagnosis system from magnetic resonance images using convolutional neural networks
    Abd-Ellah, Mahmoud Khaled
    Awad, Ali Ismail
    Khalaf, Ashraf A. M.
    Hamed, Hesham F. A.
    [J]. EURASIP JOURNAL ON IMAGE AND VIDEO PROCESSING, 2018,
  • [2] Performance of Radiologists in Differentiating COVID-19 from Non-COVID-19 Viral Pneumonia at Chest CT
    Bai, Harrison X.
    Hsieh, Ben
    Xiong, Zeng
    Halsey, Kasey
    Choi, Ji Whae
    Tran, Thi My Linh
    Pan, Ian
    Shi, Lin-Bo
    Wang, Dong-Cui
    Mei, Ji
    Jiang, Xiao-Long
    Zeng, Qiu-Hua
    Egglin, Thomas K.
    Hu, Ping-Feng
    Agarwal, Saurabh
    Xie, Fang-Fang
    Li, Sha
    Healey, Terrance
    Atalay, Michael K.
    Liao, Wei-Hua
    [J]. RADIOLOGY, 2020, 296 (02) : E46 - E54
  • [3] d'Ascoli S, 2021, Arxiv, DOI arXiv:2103.10697
  • [4] Dosovitskiy A, 2021, Arxiv, DOI arXiv:2010.11929
  • [5] Gao XH, 2021, Arxiv, DOI arXiv:2107.01682
  • [6] Automated Identification of Diabetic Retinopathy Using Deep Learning
    Gargeya, Rishab
    Leng, Theodore
    [J]. OPHTHALMOLOGY, 2017, 124 (07) : 962 - 969
  • [7] Thin-section CT obtained at 10-mm increments versus limited three-level thin-section CT for idiopathic pulmonary fibrosis: Correlation with pathologic scoring
    Kazerooni, EA
    Martinez, FJ
    Flint, A
    Jamadar, DA
    Gross, BH
    Spizarny, DL
    Cascade, PN
    Whyte, RI
    Lynch, JP
    Toews, G
    [J]. AMERICAN JOURNAL OF ROENTGENOLOGY, 1997, 169 (04) : 977 - 983
  • [8] Lung Disease Classification using GLCM and Deep Features from Different Deep Learning Architectures with Principal Component Analysis
    Ming, Joel Than Chia
    Noor, Norliza Mohd
    Rijal, Omar Mohd
    Kassim, Rosminah M.
    Yunus, Ashari
    [J]. INTERNATIONAL JOURNAL OF INTEGRATED ENGINEERING, 2018, 10 (07): : 76 - 89
  • [9] Mondal A. K., 2021, XVITCOS EXPLAINABLE, DOI [10.36227/techrxiv.14912367.v1, DOI 10.36227/TECHRXIV.14912367.V1]
  • [10] Noor N. M., 2015, PERFORMANCE EVALUATI