Optimizing Strawberry Disease and Quality Detection with Vision Transformers and Attention-Based Convolutional Neural Networks

被引:2
作者
Aghamohammadesmaeilketabforoosh, Kimia [1 ]
Nikan, Soodeh [1 ]
Antonini, Giorgio [1 ]
Pearce, Joshua M. [1 ,2 ]
机构
[1] Western Univ, Dept Elect & Comp Engn, London, ON N6A 3K7, Canada
[2] Western Univ, Ivey Business Sch, London, ON N6A 3K7, Canada
关键词
computer vision; monitoring; strawberries; yield monitoring; image classification; machine learning; vision transformers; MobileNetV2; ResNet18;
D O I
10.3390/foods13121869
中图分类号
TS2 [食品工业];
学科分类号
0832 ;
摘要
Machine learning and computer vision have proven to be valuable tools for farmers to streamline their resource utilization to lead to more sustainable and efficient agricultural production. These techniques have been applied to strawberry cultivation in the past with limited success. To build on this past work, in this study, two separate sets of strawberry images, along with their associated diseases, were collected and subjected to resizing and augmentation. Subsequently, a combined dataset consisting of nine classes was utilized to fine-tune three distinct pretrained models: vision transformer (ViT), MobileNetV2, and ResNet18. To address the imbalanced class distribution in the dataset, each class was assigned weights to ensure nearly equal impact during the training process. To enhance the outcomes, new images were generated by removing backgrounds, reducing noise, and flipping them. The performances of ViT, MobileNetV2, and ResNet18 were compared after being selected. Customization specific to the task was applied to all three algorithms, and their performances were assessed. Throughout this experiment, none of the layers were frozen, ensuring all layers remained active during training. Attention heads were incorporated into the first five and last five layers of MobileNetV2 and ResNet18, while the architecture of ViT was modified. The results indicated accuracy factors of 98.4%, 98.1%, and 97.9% for ViT, MobileNetV2, and ResNet18, respectively. Despite the data being imbalanced, the precision, which indicates the proportion of correctly identified positive instances among all predicted positive instances, approached nearly 99% with the ViT. MobileNetV2 and ResNet18 demonstrated similar results. Overall, the analysis revealed that the vision transformer model exhibited superior performance in strawberry ripeness and disease classification. The inclusion of attention heads in the early layers of ResNet18 and MobileNet18, along with the inherent attention mechanism in ViT, improved the accuracy of image identification. These findings offer the potential for farmers to enhance strawberry cultivation through passive camera monitoring alone, promoting the health and well-being of the population.
引用
收藏
页数:16
相关论文
共 46 条
  • [1] An Instance Segmentation Model for Strawberry Diseases Based on Mask R-CNN
    Afzaal, Usman
    Bhattarai, Bhuwan
    Pandeya, Yagya Raj
    Lee, Joonwhoan
    [J]. SENSORS, 2021, 21 (19)
  • [2] Hyperparameter optimization in learning systems
    Andonie, Razvan
    [J]. JOURNAL OF MEMBRANE COMPUTING, 2019, 1 (04) : 279 - 291
  • [3] [Anonymous], 2016, Principal Component Regression for Crop Yield Estimation Electronic Resource
  • [4] A systematic study of the class imbalance problem in convolutional neural networks
    Buda, Mateusz
    Maki, Atsuto
    Mazurowski, Maciej A.
    [J]. NEURAL NETWORKS, 2018, 106 : 249 - 259
  • [5] Emerging Properties in Self-Supervised Vision Transformers
    Caron, Mathilde
    Touvron, Hugo
    Misra, Ishan
    Jegou, Herve
    Mairal, Julien
    Bojanowski, Piotr
    Joulin, Armand
    [J]. 2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 9630 - 9640
  • [6] Identifying crop diseases using attention embedded MobileNet-V2 model
    Chen, Junde
    Zhang, Defu
    Suzauddola, Md
    Zeb, Adnan
    [J]. APPLIED SOFT COMPUTING, 2021, 113 (113)
  • [7] Strawberry Yield Prediction Based on a Deep Neural Network Using High-Resolution Aerial Orthoimages
    Chen, Yang
    Lee, Won Suk
    Gan, Hao
    Peres, Natalia
    Fraisse, Clyde
    Zhang, Yanchao
    He, Yong
    [J]. REMOTE SENSING, 2019, 11 (13)
  • [8] Daubney H.A., 2015, The Canadian Encyclopedia
  • [9] Denkenberger D., 2014, FEEDING EVERYONE NO
  • [10] Dhiman S., 2020, Social Entrepreneurship and Corporate Social Responsibility, P193, DOI DOI 10.1007/978-3-030-39676-313