Fast detection and location of longan fruits using UAV images

被引:40
作者
Li, Denghui [1 ]
Sun, Xiaoxuan [2 ,3 ]
Elkhouchlaa, Hamza [1 ]
Jia, Yuhang [1 ]
Yao, Zhongwei [1 ]
Lin, Peiyi [1 ]
Li, Jun [1 ,4 ]
Lu, Huazhong [5 ]
机构
[1] South China Agr Univ, Coll Engn, Guangzhou 510642, Peoples R China
[2] Chinese Acad Sci, Key Lab South China Agr Plant Mol Anal & Genet Im, Guangdong Prov Key Lab Appl Bot, Bot Garden, Guangzhou 510650, Peoples R China
[3] Univ Chinese Acad Sci, Beijing 100049, Peoples R China
[4] Guangdong Lab Lingnan Modern Agr, Guangzhou 510640, Peoples R China
[5] Guangdong Acad Agr Sci, Guangzhou 510640, Peoples R China
关键词
UAV; Image analysis; Convolutional neural network; RGB-D image; Detection and location of longan; MECHANICAL DAMAGE; RGB; AGRICULTURE; VEHICLE; VISION;
D O I
10.1016/j.compag.2021.106465
中图分类号
S [农业科学];
学科分类号
09 ;
摘要
In agriculture, fruit picking robots on the ground have difficulty adapting to the terrain conditions of mountain orchards and cannot pick longan fruit from tall longan trees. In this paper, aiming to allow picking of longan fruit by unmanned aerial vehicles (UAVs), a deep learning-based scheme to quickly and accurately detect and locate suitable picking points on fruit branches is proposed. The scheme includes a UAV fuzzy image preprocessing method, longan detection based on a convolutional neural network (CNN), red, green, blue and depth (RGB-D) information fusion and an accurate target location strategy. First, the UAV is equipped with an Intel Realsense D455 camera, which collects longan images from the front for training and testing the model. Second, the lightweight MobileNet backbone network is used to improve the performance of the You Only Look Once version 4 (YOLOv4) model in feature extraction. The results for the test set show that compared with the classical feature pyramid network (FPN), YOLOv3 and YOLOv4 models, this model reduces the computation, parameters and detection time of the model. Compared with MobileNet single-shot multibox detector (MobileNet-SSD) and YOLOv4-tiny, this model exhibits improved detection accuracy. Third, according to the target detection result map, a strategy is formulated to accurately determine the suitable picking point on the main branch of the result. Finally, the performance of the improved model and picking platform in the harvest scene is evaluated by performing picking experiments in a longan orchard. In summary, we fully exploit the advantages of the combination of UAVs, RGB-D cameras and CNNs to improve the speed and accuracy of target detection and location for longan picking by UAVs based on vision.
引用
收藏
页数:15
相关论文
共 45 条
  • [1] Performance of deep learning models for classifying and detecting common weeds in corn and soybean production systems
    Ahmad, Aanis
    Saraswat, Dharmendra
    Aggarwal, Varun
    Etienne, Aaron
    Hancock, Benjamin
    [J]. COMPUTERS AND ELECTRONICS IN AGRICULTURE, 2021, 184 (184)
  • [2] Alpaydin E., 2021, Neural networks and deep learning, DOI [10.7551/mitpress/13811.003.0007, DOI 10.7551/MITPRESS/13811.003.0007]
  • [3] A deep learning approach for anthracnose infected trees classification in walnut orchards
    Anagnostis, A.
    Tagarakis, A. C.
    Asiminari, G.
    Papageorgiou, E.
    Kateris, D.
    Moshou, D.
    Bochtis, D.
    [J]. COMPUTERS AND ELECTRONICS IN AGRICULTURE, 2021, 182
  • [4] Bargoti Suchet, 2017, 2017 IEEE International Conference on Robotics and Automation (ICRA), P3626, DOI 10.1109/ICRA.2017.7989417
  • [5] Beyaz A., AMA AGR MECH ASIA AF, V51, P1193
  • [6] Harvest glove and LabView based mechanical damage determination on apples
    Beyaz, Abdullah
    [J]. SCIENTIA HORTICULTURAE, 2018, 228 : 49 - 55
  • [7] Beyaz A, 2010, J FOOD AGRIC ENVIRON, V8, P476
  • [8] Bochkovskiy A., 2020, ARXIV, DOI DOI 10.48550/ARXIV.2004.10934
  • [9] Dyrmann M., 2017, Advances in Animal Biosciences, V8, P842, DOI [DOI 10.1017/S2040470017000206, 10.1017/s2040470017000206, 10.1017/S2040470017000206]
  • [10] The Pascal Visual Object Classes (VOC) Challenge
    Everingham, Mark
    Van Gool, Luc
    Williams, Christopher K. I.
    Winn, John
    Zisserman, Andrew
    [J]. INTERNATIONAL JOURNAL OF COMPUTER VISION, 2010, 88 (02) : 303 - 338