Optimized Vision Transformers for Superior Plant Disease Detection

被引:0
|
作者
Ouamane, Abdelmalik [1 ,2 ]
Chouchane, Ammar [2 ,3 ]
Himeur, Yassine [4 ]
Miniaoui, Sami [4 ]
Atalla, Shadi [4 ]
Mansoor, Wathiq [4 ]
Al-Ahmad, Hussain [4 ]
机构
[1] Univ Biskra, Lab LI3C, Biskra 07000, Algeria
[2] Agence Themat Rech Sci St ATRSS, Es Senia 31000, Algeria
[3] Univ Ctr Barika, Barika 05001, Algeria
[4] Univ Dubai, Coll Engn & Informat Technol, Dubai, U Arab Emirates
来源
IEEE ACCESS | 2025年 / 13卷
关键词
Plant disease detection; vision transformer; convolutional neural network; optimized ViT model; VGG 19 and AlexNet;
D O I
10.1109/ACCESS.2025.3547416
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Detecting plant diseases is vital for maintaining agricultural productivity and ensuring food security. Advances in computer vision, particularly with Vision Transformers (ViTs), have shown significant potential in improving the accuracy and efficiency of plant disease identification. This study provides a comprehensive evaluation of various ViT parameters to determine the most effective configuration for this purpose. Using the extensive PlantVillage dataset, we systematically analyzed the effects of patch sizes, image resolutions, embedding dimensions, the number of transformer blocks (depth), the number of heads in the multi-head attention layer, and the dimension of the MLP (FeedForward) layer on model performance. We introduced saliency map visualizations to enhance interpretability and evaluate the critical regions contributing to classification decisions, ensuring the approach's transparency and robustness. Our experiments identified the optimal ViT configuration as follows: image size = 224 x 224, patch size = 16, embedding dimension = 512, depth = 6, number of heads = 8, and MLP dimension = 1024. This configuration achieved an impressive accuracy of 99.77% on the PlantVillage dataset. In addition, we incorporated a novel cross-dataset transferability evaluation to validate the generalizability of the proposed model. Comparative analysis with traditional convolutional neural network architectures, such as VGG19 and AlexNet, revealed that our optimized ViT model not only surpasses these models in accuracy but also requires significantly fewer trainable parameters and storage space. The incorporation of a lightweight, domain-specific fine-tuning process ensures the model's adaptability to new datasets with minimal computational overhead. Our findings highlight the scalability and adaptability of ViTs, emphasizing their ability to effectively handle varying image sizes and resolutions. Moreover, our approach outperforms recent state-of-the-art methods across multiple databases, underscoring the efficacy of the chosen ViT parameters.
引用
收藏
页码:48552 / 48570
页数:19
相关论文
共 50 条
  • [21] Pathological Insights: Enhanced Vision Transformers for the Early Detection of Colorectal Cancer
    Ayana, Gelan
    Barki, Hika
    Choe, Se-woon
    CANCERS, 2024, 16 (07)
  • [22] Vision Transformers Applied to Indoor Room Classification
    Veiga, Bruno
    Pinto, Tiago
    Teixeira, Ruben
    Ramos, Carlos
    PROGRESS IN ARTIFICIAL INTELLIGENCE, EPIA 2023, PT II, 2023, 14116 : 561 - 573
  • [23] Revolutionizing dementia detection: Leveraging vision and Swin transformers for early diagnosis
    Rini, P. L.
    Gayathri, K. S.
    AMERICAN JOURNAL OF MEDICAL GENETICS PART B-NEUROPSYCHIATRIC GENETICS, 2024, 195 (07)
  • [24] ViP: Unified Certified Detection and Recovery for Patch Attack with Vision Transformers
    Li, Junbo
    Zhang, Huan
    Xie, Cihang
    COMPUTER VISION, ECCV 2022, PT XXV, 2022, 13685 : 573 - 587
  • [25] Deepfake detection using convolutional vision transformers and convolutional neural networks
    Soudy, Ahmed Hatem
    Sayed, Omnia
    Tag-Elser, Hala
    Ragab, Rewaa
    Mohsen, Sohaila
    Mostafa, Tarek
    Abohany, Amr A.
    Slim, Salwa O.
    Neural Computing and Applications, 2024, 36 (31) : 19759 - 19775
  • [26] Plant Leaf Disease Detection Using an Optimized Evolutionary Gravitational Neocognitron Neural Network
    Goyal, Praveen
    Verma, Dinesh Kumar
    Kumar, Shishir
    NATIONAL ACADEMY SCIENCE LETTERS-INDIA, 2024, 47 (04): : 347 - 354
  • [27] Automation in plant pathology: Optimized Attentional Capsule_BiLSTM optimized with chaotic sparrow algorithm for colour feature-based plant disease detection
    Kondekar, V. H.
    Bodhe, S. K.
    MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 83 (14) : 41727 - 41760
  • [28] Automation in plant pathology: Optimized Attentional Capsule_BiLSTM optimized with chaotic sparrow algorithm for colour feature-based plant disease detection
    V. H. Kondekar
    S. K. Bodhe
    Multimedia Tools and Applications, 2024, 83 : 41727 - 41760
  • [29] A novel hierarchical framework for plant leaf disease detection using residual vision transformer
    Vallabhajosyula, Sasikala
    Sistla, Venkatramaphanikumar
    Kolli, Venkata Krishna Kishore
    HELIYON, 2024, 10 (09)
  • [30] A novel attention based vision transformer optimized with hybrid optimization algorithm for turmeric leaf disease detection
    R. Selvaraj
    M. S. Geetha Devasena
    Scientific Reports, 15 (1)