PBVit: A Patch-Based Vision Transformer for Enhanced Brain Tumor Detection

被引：1

作者：

Chauhan, Pratikkumar ^{[1
]}

Lunagaria, Munindra ^{[1
]}

Verma, Deepak Kumar ^{[1
]}

Vaghela, Krunal ^{[1
]}

Tejani, Ghanshyam G. ^{[2
,3
]}

Sharma, Sunil Kumar ^{[4
]}

Khan, Ahmad Raza ^{[5
]}

机构：

[1] Marwadi Univ, Dept Comp Engn, Rajkot 360003, Gujarat, India

[2] Yuan Ze Univ, Dept Ind Engn & Management, Taoyuan, Taiwan

[3] Appl Sci Private Univ, Appl Sci Res Ctr, Amman 11937, Jordan

[4] Majmaah Univ, Coll Comp & Informat Sci, Dept Informat Syst, Majmaah 11952, Saudi Arabia

[5] Majmaah Univ, Coll Comp & Informat Sci, Dept Informat Technol, Majmaah 11952, Saudi Arabia

来源：

IEEE ACCESS | 2025年 / 13卷

关键词：

Brain tumors; Brain modeling; Accuracy; Transformers; Computational modeling; Training; Medical diagnostic imaging; Computer vision; Magnetic resonance imaging; Computer architecture; Brain tumor detection; vision transformer; healthcare brain tumor detection; CNN PBvit; DETIR; CLASSIFICATION; CNN;

D O I：

10.1109/ACCESS.2024.3521002

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Brain Tumor holds a significant holds in human health, classified into three primary types: glioma, meningioma, and pituitary tumors. Early detection and accurate classification are vital for effective diagnosis and lowering healthcare costs. In PBvit we presents a novel brain tumor detection framework, the Patch Base Vision Transformer (PBVit). PBVit adopts a patch-based approach where input tumor images are divided into fixed-size patches, with each patch treated as a token. These image patches are linearly projected into lower-dimensional token embeddings, and positional encodings are added to help the model understand spatial relationships within the image. PBVit enhances the detection of intricate patterns and anomalies in brain scans, improving diagnostic accuracy. We trained PBVit using the Figshare brain tumor dataset and observed notable performance improvements compared to traditional CNN-based models. The PBVit reached an accuracy of 95.8%, a precision of 95.3%, a recall of 93.2%, and an F1-score of 92%, indicating its robustness in identifying brain tumors. The promising results demonstrate that PBVit can play a important role in facilitating early-stage diagnosis, reducing unnecessary biopsies, and ultimately enhancing patient care, while also showcasing the potential of transformer-based architectures in medical imaging.

引用

页码：13015 / 13029

页数：15

共 50 条

[1] Patch-Based Separable Transformer for Visual Recognition
Sun, Shuyang
Yue, Xiaoyu
Zhao, Hengshuang
Torr, Philip H. S.
Bai, Song
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 45 (07) : 9241 - 9247
[2] Tumor ViT-GRU-XAI: Advanced Brain Tumor Diagnosis Framework: Vision Transformer and GRU Integration for Improved MRI Analysis: A Case Study of Egypt
Aly, Mohammed
Ghallab, Abdullatif
Fathi, Islam S.
IEEE ACCESS, 2024, 12 : 184726 - 184754
[3] Efficient and Adaptable Patch-Based Crack Detection
Guo, Jing-Ming
Markoni, Herleeyandi
IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2022, 23 (11) : 21885 - 21896
[4] Brain tumor image pixel segmentation and detection using an aggregation of GAN models with vision transformer
Datta, Priyanka
Rohilla, Rajesh
INTERNATIONAL JOURNAL OF IMAGING SYSTEMS AND TECHNOLOGY, 2024, 34 (01)
[5] IntelPVT: intelligent patch-based pyramid vision transformers for object detection and classification
Nimma, Divya
Zhou, Zhaoxian
INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS, 2024, 15 (05) : 1767 - 1778
[6] IntelPVT: intelligent patch-based pyramid vision transformers for object detection and classification
Divya Nimma
Zhaoxian Zhou
International Journal of Machine Learning and Cybernetics, 2024, 15 : 1767 - 1778
[7] SleepViTransformer: Patch-based sleep spectrogram transformer for automatic sleep staging
Peng, Li
Ren, Yanzhen
Luan, Zhiheng
Chen, Xiong
Yang, Xiuping
Tu, Weiping
BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2023, 86
[8] DPT: Deformable Patch-based Transformer for Visual Recognition
Chen, Zhiyang
Zhu, Yousong
Zhao, Chaoyang
Hu, Guosheng
Zeng, Wei
Wang, Jinqiao
Tang, Ming
PROCEEDINGS OF THE 29TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2021, 2021, : 2899 - 2907
[9] RI-ViT: A Multi-Scale Hybrid Method Based on Vision Transformer for Breast Cancer Detection in Histopathological Images
Monjezi, Ehsan
Akbarizadeh, Gholamreza
Ansari-Asl, Karim
IEEE ACCESS, 2024, 12 : 186074 - 186086
[10] A Vision Transformer Enhanced with Patch Encoding for Malware Classification
Park, Kyoung-Won
Cho, Sung-Bae
INTELLIGENT DATA ENGINEERING AND AUTOMATED LEARNING - IDEAL 2022, 2022, 13756 : 289 - 299

← 1 2 3 4 5 →