Transformer Neural Network for Weed and Crop Classification of High Resolution UAV Images

被引:125
作者
Reedha, Reenul [1 ]
Dericquebourg, Eric [1 ]
Canals, Raphael [2 ]
Hafiane, Adel [1 ]
机构
[1] Univ Orleans, INSA CVL, PRISME Lab EA 4229, F-18022 Bourges, France
[2] Univ Orleans, PRISME Lab EA 4229, INSA CVL, F-45067 Orleans, France
关键词
computer vision; deep learning; self-attention; vision transformers; remote sensing; drone; image classification; agriculture;
D O I
10.3390/rs14030592
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
Monitoring crops and weeds is a major challenge in agriculture and food production today. Weeds compete directly with crops for moisture, nutrients, and sunlight. They therefore have a significant negative impact on crop yield if not sufficiently controlled. Weed detection and mapping is an essential step in weed control. Many existing research studies recognize the importance of remote sensing systems and machine learning algorithms in weed management. Deep learning approaches have shown good performance in many agriculture-related remote sensing tasks, such as plant classification, disease detection, etc. However, despite the success of these approaches, they still face many challenges such as high computation cost, the need of large labelled datasets, intra-class discrimination (in growing phase weeds and crops share many attributes similarity as color, texture, and shape), etc. This paper aims to show that the attention-based deep network is a promising approach to address the forementioned problems, in the context of weeds and crops recognition with drone system. The specific objective of this study was to investigate visual transformers (ViT) and apply them to plant classification in Unmanned Aerial Vehicles (UAV) images. Data were collected using a high-resolution camera mounted on a UAV, which was deployed in beet, parsley and spinach fields. The acquired data were augmented to build larger dataset, since ViT requires large sample sets for better performance, we also adopted the transfer learning strategy. Experiments were set out to assess the effect of training and validation dataset size, as well as the effect of increasing the test set while reducing the training set. The results show that with a small labeled training dataset, the ViT models outperform state-of-the-art models such as EfficientNet and ResNet. The results of this study are promising and show the potential of ViT to be applied to a wide range of remote sensing image analysis tasks.
引用
收藏
页数:20
相关论文
共 55 条
[1]  
[Anonymous], 2014, COMPUT RES REPOSITOR
[2]  
[Anonymous], 1989, NEURIPS
[3]   A survey of cross-validation procedures for model selection [J].
Arlot, Sylvain ;
Celisse, Alain .
STATISTICS SURVEYS, 2010, 4 :40-79
[4]   Deep Learning with Unsupervised Data Labeling for Weed Detection in Line Crops in UAV Images [J].
Bah, M. Dian ;
Hafiane, Adel ;
Canals, Raphael .
REMOTE SENSING, 2018, 10 (11)
[5]   CRowNet: Deep Network for Crop Row Detection in UAV Images [J].
Bah, Mamadou Dian ;
Hafiane, Adel ;
Canals, Raphael .
IEEE ACCESS, 2020, 8 (08) :5189-5200
[6]  
Chen JZ, 2016, PROCEEDINGS OF 2016 12TH INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE AND SECURITY (CIS), P551, DOI [10.1109/CIS.2016.0134, 10.1109/CIS.2016.133]
[7]   Randaugment: Practical automated data augmentation with a reduced search space [J].
Cubuk, Ekin D. ;
Zoph, Barret ;
Shlens, Jonathon ;
Le, Quoc, V .
2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS (CVPRW 2020), 2020, :3008-3017
[8]   Computer vision-based citrus tree detection in a cultivated environment using UAV imagery [J].
Donmez, Cenk ;
Villi, Osman ;
Berberoglu, Suha ;
Cilek, Ahmet .
COMPUTERS AND ELECTRONICS IN AGRICULTURE, 2021, 187
[9]  
Dosovitskiy A, 2020, ARXIV
[10]   Weed detection in soybean crops using ConvNets [J].
Ferreira, Alessandro dos Santos ;
Freitas, Daniel Matte ;
da Silva, Gercina Goncalves ;
Pistori, Hemerson ;
Folhes, Marcelo Theophilo .
COMPUTERS AND ELECTRONICS IN AGRICULTURE, 2017, 143 :314-324