Cucumber disease recognition with small samples using image-text-label-based multi-modal language model

被引:27
作者
Cao, Yiyi [1 ]
Chen, Lei [2 ]
Yuan, Yuan [2 ]
Sun, Guangling [1 ]
机构
[1] Anhui Jianzhu Univ, Sch Elect & Informat Engn, Hefei 230601, Peoples R China
[2] Chinese Acad Sci, HFIPS, Inst Intelligent Machines, Hefei 230031, Peoples R China
基金
中国国家自然科学基金;
关键词
Cucumber diseases; Multi -modal language model; Contrastive learning; Small samples;
D O I
10.1016/j.compag.2023.107993
中图分类号
S [农业科学];
学科分类号
09 ;
摘要
Few-shot learning methods only need a small size of samples to train a good model. Moreover, most of these methods consider a single modality, ignoring the correlation between multi-modal data. Therefore, using multi -modal methods to solve the small-sample-size problem has become the development trend of artificial intelli-gence. In recent years, a multi-model method called Vision-Language Pre-training (VLP) has emerged. The se-mantic relation between multiple modalities can be learned through pre-training, thus obtaining better performance on downstream tasks. Accordingly, this paper took cucumber disease recognition with small samples as an example and proposed a recognition method of a multi-modal language model based on image-text-label information. First, image-text multi-modal contrastive learning, image self-supervised contrastive learning, and label information were combined to measure the distance of samples in the common image-text-label space. Second, the classification methods and optimization of large-scale vision-language pre-training on small sample cucumber datasets were studied. The proposed model achieved a recognition accuracy rate of 94.84% on a small multi-modal cucumber disease dataset. Finally, some experiments on the public dataset demonstrated that our method has good generalization.
引用
收藏
页数:12
相关论文
共 39 条
[1]   Tomato plant disease detection using transfer learning with C-GAN synthetic images [J].
Abbas, Amreen ;
Jain, Sweta ;
Gour, Mahesh ;
Vankudothu, Swetha .
COMPUTERS AND ELECTRONICS IN AGRICULTURE, 2021, 187
[2]   Few-Shot Learning approach for plant disease classification using images taken in the field [J].
Argueso, David ;
Picon, Artzai ;
Irusta, Unai ;
Medela, Alfonso ;
San-Emeterio, Miguel G. ;
Bereciartua, Arantza ;
Alvarez-Gila, Aitor .
COMPUTERS AND ELECTRONICS IN AGRICULTURE, 2020, 175
[3]  
Caron M, 2020, ADV NEUR IN, V33
[4]   Agricultural Disease Image Dataset for Disease Identification Based on Machine Learning [J].
Chen, Lei ;
Yuan, Yuan .
BIG SCIENTIFIC DATA MANAGEMENT, 2019, 11473 :263-274
[5]  
Chen T., 2020, ADV NEUR IN
[6]  
Chen T, 2020, PR MACH LEARN RES, V119
[7]   An Empirical Study of Training Self-Supervised Vision Transformers [J].
Chen, Xinlei ;
Xie, Saining ;
He, Kaiming .
2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, :9620-9629
[8]  
Dosovitskiy Alexey, 2021, P ICLR
[9]   Hyperspectral Imaging Combined With Deep Transfer Learning for Rice Disease Detection [J].
Feng, Lei ;
Wu, Baohua ;
He, Yong ;
Zhang, Chu .
FRONTIERS IN PLANT SCIENCE, 2021, 12
[10]   Dual-branch, efficient, channel attention-based crop disease identification [J].
Gao, Ronghua ;
Wang, Rong ;
Feng, Lu ;
Li, Qifeng ;
Wu, Huarui .
COMPUTERS AND ELECTRONICS IN AGRICULTURE, 2021, 190