RanMerFormer: Randomized vision transformer with token merging for brain tumor classification

被引:18
|
作者
Wang, Jian [1 ]
Lu, Si -Yuan [2 ]
Wang, Shui-Hua [1 ,3 ]
Zhang, Yu-Dong [1 ,4 ]
机构
[1] Univ Leicester, Sch Comp & Math Sci, Leicester LE1 7RH, England
[2] Nanjing Univ Posts & Telecommun, Sch Commun & Informat Engn, Nanjing 210003, Peoples R China
[3] Xian Jiaotong Liverpool Univ, Dept Biol Sci, Suzhou 215123, Jiangsu, Peoples R China
[4] Southeast Univ, Sch Comp Sci & Engn, Nanjing 210096, Jiangsu, Peoples R China
基金
英国生物技术与生命科学研究理事会;
关键词
Brain tumor; Magnetic resonance image; Computer-aided diagnosis; Vision transformer; Token merging; Randomized vector functional-link; CONVOLUTIONAL NEURAL-NETWORK; SEGMENTATION;
D O I
10.1016/j.neucom.2023.127216
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Brains are the control center of the nervous system in human bodies, and brain tumor is one of the most deadly diseases. Currently, magnetic resonance imaging (MRI) is the most effective way to brain tumors early detection in clinical diagnoses due to its superior imaging quality for soft tissues. Manual analysis of brain MRI is errorprone which depends on empirical experience and the fatigue state of the radiologists to a large extent. Computer-aided diagnosis (CAD) systems are becoming more and more impactful because they can provide accurate prediction results based on medical images with advanced techniques from computer vision. Therefore, a novel CAD method for brain tumor classification named RanMerFormer is presented in this paper. A pre-trained vision transformer is used as the backbone model. Then, a merging mechanism is proposed to remove the redundant tokens in the vision transformer, which improves computing efficiency substantially. Finally, a randomized vector functional-link serves as the head in the proposed RanMerFormer, which can be trained swiftly. All the simulation results are obtained from two public benchmark datasets, which reveal that the proposed RanMerFormer can achieve state-of-the-art performance for brain tumor classification. The trained RanMerFormer can be applied in real-world scenarios to assist in brain tumor diagnosis.
引用
收藏
页数:12
相关论文
共 50 条
  • [1] ATMformer: An Adaptive Token Merging Vision Transformer for Remote Sensing Image Scene Classification
    Niu, Yi
    Song, Zhuochen
    Luo, Qingyu
    Chen, Guochao
    Ma, Mingming
    Li, Fu
    REMOTE SENSING, 2025, 17 (04)
  • [2] Depression Classification Using Token Merging-Based Speech Spectrotemporal Transformer
    Kumar, Lokesh
    Kaustubh, Kumar
    Prasanna, S. R. Mahadeva
    SPEECH AND COMPUTER, SPECOM 2024, PT I, 2025, 15299 : 324 - 335
  • [3] Vision Transformers for Brain Tumor Classification
    Simon, Eliott
    Briassouli, Alexia
    PROCEEDINGS OF THE 15TH INTERNATIONAL JOINT CONFERENCE ON BIOMEDICAL ENGINEERING SYSTEMS AND TECHNOLOGIES (BIOIMAGING), VOL 2, 2021, : 123 - 130
  • [4] Vision Transformer with window sequence merging mechanism for image classification
    Jiao, Erjie
    Leng, Qiangkui
    Guo, Jiamei
    Meng, Xiangfu
    Wang, Changzhong
    APPLIED SOFT COMPUTING, 2025, 171
  • [5] Vision transformer with feature calibration and selective cross-attention for brain tumor classification
    Mohammad Ali Labbaf Khaniki
    Marzieh Mirzaeibonehkhater
    Mohammad Manthouri
    Elham Hasani
    Iran Journal of Computer Science, 2025, 8 (2) : 335 - 347
  • [6] Combining the Transformer and Convolution for Effective Brain Tumor Classification Using MRI Images
    Aloraini, Mohammed
    Khan, Asma
    Aladhadh, Suliman
    Habib, Shabana
    Alsharekh, Mohammed F.
    Islam, Muhammad
    APPLIED SCIENCES-BASEL, 2023, 13 (06):
  • [7] Improving vision transformer for medical image classification via token-wise perturbation
    Li, Yuexiang
    Huang, Yawen
    He, Nanjun
    Ma, Kai
    Zheng, Yefeng
    JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2024, 98
  • [8] Semi-supervised vision transformer with adaptive token sampling for breast cancer classification
    Wang, Wei
    Jiang, Ran
    Cui, Ning
    Li, Qian
    Yuan, Feng
    Xiao, Zhifeng
    FRONTIERS IN PHARMACOLOGY, 2022, 13
  • [9] Deep CNN for Brain Tumor Classification
    Ayadi, Wadhah
    Elhamzi, Wajdi
    Charfi, Imen
    Atri, Mohamed
    NEURAL PROCESSING LETTERS, 2021, 53 (01) : 671 - 700
  • [10] The Application of Vision Transformer in Image Classification
    He, Zhixuan
    2022 THE 6TH INTERNATIONAL CONFERENCE ON VIRTUAL AND AUGMENTED REALITY SIMULATIONS, ICVARS 2022, 2022, : 56 - 63