PBVit: A Patch-Based Vision Transformer for Enhanced Brain Tumor Detection

被引:1
|
作者
Chauhan, Pratikkumar [1 ]
Lunagaria, Munindra [1 ]
Verma, Deepak Kumar [1 ]
Vaghela, Krunal [1 ]
Tejani, Ghanshyam G. [2 ,3 ]
Sharma, Sunil Kumar [4 ]
Khan, Ahmad Raza [5 ]
机构
[1] Marwadi Univ, Dept Comp Engn, Rajkot 360003, Gujarat, India
[2] Yuan Ze Univ, Dept Ind Engn & Management, Taoyuan, Taiwan
[3] Appl Sci Private Univ, Appl Sci Res Ctr, Amman 11937, Jordan
[4] Majmaah Univ, Coll Comp & Informat Sci, Dept Informat Syst, Majmaah 11952, Saudi Arabia
[5] Majmaah Univ, Coll Comp & Informat Sci, Dept Informat Technol, Majmaah 11952, Saudi Arabia
来源
IEEE ACCESS | 2025年 / 13卷
关键词
Brain tumors; Brain modeling; Accuracy; Transformers; Computational modeling; Training; Medical diagnostic imaging; Computer vision; Magnetic resonance imaging; Computer architecture; Brain tumor detection; vision transformer; healthcare brain tumor detection; CNN PBvit; DETIR; CLASSIFICATION; CNN;
D O I
10.1109/ACCESS.2024.3521002
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Brain Tumor holds a significant holds in human health, classified into three primary types: glioma, meningioma, and pituitary tumors. Early detection and accurate classification are vital for effective diagnosis and lowering healthcare costs. In PBvit we presents a novel brain tumor detection framework, the Patch Base Vision Transformer (PBVit). PBVit adopts a patch-based approach where input tumor images are divided into fixed-size patches, with each patch treated as a token. These image patches are linearly projected into lower-dimensional token embeddings, and positional encodings are added to help the model understand spatial relationships within the image. PBVit enhances the detection of intricate patterns and anomalies in brain scans, improving diagnostic accuracy. We trained PBVit using the Figshare brain tumor dataset and observed notable performance improvements compared to traditional CNN-based models. The PBVit reached an accuracy of 95.8%, a precision of 95.3%, a recall of 93.2%, and an F1-score of 92%, indicating its robustness in identifying brain tumors. The promising results demonstrate that PBVit can play a important role in facilitating early-stage diagnosis, reducing unnecessary biopsies, and ultimately enhancing patient care, while also showcasing the potential of transformer-based architectures in medical imaging.
引用
收藏
页码:13015 / 13029
页数:15
相关论文
共 50 条
  • [1] Patch-Based Separable Transformer for Visual Recognition
    Sun, Shuyang
    Yue, Xiaoyu
    Zhao, Hengshuang
    Torr, Philip H. S.
    Bai, Song
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 45 (07) : 9241 - 9247
  • [2] Tumor ViT-GRU-XAI: Advanced Brain Tumor Diagnosis Framework: Vision Transformer and GRU Integration for Improved MRI Analysis: A Case Study of Egypt
    Aly, Mohammed
    Ghallab, Abdullatif
    Fathi, Islam S.
    IEEE ACCESS, 2024, 12 : 184726 - 184754
  • [3] Efficient and Adaptable Patch-Based Crack Detection
    Guo, Jing-Ming
    Markoni, Herleeyandi
    IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2022, 23 (11) : 21885 - 21896
  • [5] IntelPVT: intelligent patch-based pyramid vision transformers for object detection and classification
    Nimma, Divya
    Zhou, Zhaoxian
    INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS, 2024, 15 (05) : 1767 - 1778
  • [6] IntelPVT: intelligent patch-based pyramid vision transformers for object detection and classification
    Divya Nimma
    Zhaoxian Zhou
    International Journal of Machine Learning and Cybernetics, 2024, 15 : 1767 - 1778
  • [7] SleepViTransformer: Patch-based sleep spectrogram transformer for automatic sleep staging
    Peng, Li
    Ren, Yanzhen
    Luan, Zhiheng
    Chen, Xiong
    Yang, Xiuping
    Tu, Weiping
    BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2023, 86
  • [8] DPT: Deformable Patch-based Transformer for Visual Recognition
    Chen, Zhiyang
    Zhu, Yousong
    Zhao, Chaoyang
    Hu, Guosheng
    Zeng, Wei
    Wang, Jinqiao
    Tang, Ming
    PROCEEDINGS OF THE 29TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2021, 2021, : 2899 - 2907
  • [9] RI-ViT: A Multi-Scale Hybrid Method Based on Vision Transformer for Breast Cancer Detection in Histopathological Images
    Monjezi, Ehsan
    Akbarizadeh, Gholamreza
    Ansari-Asl, Karim
    IEEE ACCESS, 2024, 12 : 186074 - 186086
  • [10] A Vision Transformer Enhanced with Patch Encoding for Malware Classification
    Park, Kyoung-Won
    Cho, Sung-Bae
    INTELLIGENT DATA ENGINEERING AND AUTOMATED LEARNING - IDEAL 2022, 2022, 13756 : 289 - 299