Deep Flower: A Deep Learning Approach for Accurate Flower Classification

被引：0

作者：

Soundarya, M. ^{[1
]}

Kannan, Muthu Vishal R. ^{[1
]}

Saravanan, T. D. ^{[1
]}

Praveen, V ^{[1
]}

Vinora, A. ^{[1
]}

机构：

[1] Velammal Coll Engn & Technol, Dept Informat Technol, Madurai, Tamil Nadu, India

来源：

2024 INTERNATIONAL CONFERENCE ON ADVANCES IN COMPUTING, COMMUNICATION AND APPLIED INFORMATICS, ACCAI 2024 | 2024年

关键词：

Deep Learning; Convolutional Neural Networks (CNNs); Flower Classification; Image Processing; Computer Vision; Botany; and Vision Transformer;

D O I：

10.1109/ACCAI61061.2024.10602375

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Flower classification is a challenging task in computer vision, requiring models to discern subtle visual differences among a vast array of floral species. In this project, we propose a novel approach to flower classification leveraging the Vision Transformer (ViT) algorithm, a cutting-edge deep learning architecture that has demonstrated remarkable success in image recognition tasks. The ViT model replaces traditional convolutional layers with self-attention mechanisms, allowing it to capture long-range dependencies and global context in images more effectively.Our methodology involves pre-processing a comprehensive dataset of floral images, extracting features using ViT, and training a robust classification model. The dataset encompasses a diverse range of flowers, capturing variations in color, shape, and size. The ViT model's attention mechanisms enable it to learn hierarchical representations, improving its ability to differentiate between subtle visual nuances characteristic of different flower species.We conduct extensive experiments to evaluate the proposed approach's performance, comparing it with traditional convolutional neural networks (CNNs) commonly used in flower classification tasks. Additionally, we explore techniques for model interpretability, shedding light on the decision-making process of the ViT-based classifier.The results demonstrate the effectiveness of the Vision Transformer in flower classification, surpassing the performance of conventional CNNs. The ViT model exhibits enhanced generalization capabilities and robustness to variations in illumination and background. Furthermore, our interpretability analysis provides insights into the discriminative features learned by the ViT model, contributing to a better understanding of its decision-making process.This study not only advances the realm of computer vision but also provides valuable insights applicable in agriculture, horticulture, and ecological monitoring. The constructed model highlights the capabilities of Deep Learning in tackling intricate classification challenges, laying groundwork for forthcoming endeavors in automating plant species identification and conservation initiatives.

引用

页数：9

共 25 条

[1] Chen Kevin, 2024, C COMP VIS IM UND
[2] Davis Robert, 2021, International Journal of Computer Vision (IJCV)
[3] Garcia Maria, 2020, ACM Transactions on Intelligent Systems and Technology
[4] Garcia William, 2021, Journal of Machine Learning Research
[5] Harris Michael, 2021, IEEE Transactions on Image Processing
[6] Johnson Andrew, 2023, NEURIPS C
[7] Johnson Emily, 2018, INT C IM PROC ICIP
[8] Lee Sarah, 2008, Improving Flower Classification Precision through Transfer Learning in Convolutional Neural Networks: A Study
[9] Liu David, 2024, C PATT REC LETT
[10] Lopez Isabella, 2022, International Journal of Robotics Research (IJRR)

← 1 2 3 →