Fast implementation of real-time fruit detection in apple orchards using deep learning

被引:162
作者
Kang, Hanwen [1 ]
Chen, Chao [1 ]
机构
[1] Monash Univ, Dept Mech & Aerosp Engn, Melbourne, Vic, Australia
关键词
Fruit detection; Deep learning; Real-time; Data labelling; Robotic harvesting; CONVOLUTIONAL NEURAL-NETWORKS; VISION-BASED CONTROL; CROP;
D O I
10.1016/j.compag.2019.105108
中图分类号
S [农业科学];
学科分类号
09 ;
摘要
To perform robust and efficient fruit detection in orchards is challenging since there are a number of variances in the working environments. Recently, deep-learning have shown a promising performance in many visual-guided agriculture applications. However, deep-learning based approaches requires labelling on training data, which is a labour-intensive and time-consuming task. In this study, a fast implementation framework of a deep-learning based fruit detector for apple harvesting is developed. The developed framework comprises an auto label generation module and a deep-learning-based fruit detector 'LedNet'. The Label Generation algorithm utilises the multi-scale pyramid and clustering classifier to assist fast labelling of training data. LedNet adopts feature pyramid network and atrous spatial pyramid pooling to improve the detection performance of the model. A light-weight backbone is also developed and utilised to improve computational efficiency. From the experimental results, LedNet achieves 0.821 and 0.853 on recall and accuracy on apple detection in orchards, and its weights size and inference time are 7.4 M and 28 ms, respectively. The experimental results show that LedNet can perform real-time apple detection in orchards robustly and efficiently.
引用
收藏
页数:10
相关论文
共 45 条
[1]  
[Anonymous], 2012, International Journal of Computational Vision and Robotics, DOI [DOI 10.1504/IJCVR.2012.046419, 10.1504/IJCVR.2012.046419]
[2]  
[Anonymous], 2017, P IEEE INT C COMPUTE
[3]  
[Anonymous], 2017, J FIELD ROBOT, DOI DOI 10.1002/rob.21699
[4]   SegNet: A Deep Convolutional Encoder-Decoder Architecture for Image Segmentation [J].
Badrinarayanan, Vijay ;
Kendall, Alex ;
Cipolla, Roberto .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2017, 39 (12) :2481-2495
[5]  
Bargoti Suchet, 2017, 2017 IEEE International Conference on Robotics and Automation (ICRA), P3626, DOI 10.1109/ICRA.2017.7989417
[6]   Rich feature hierarchies for accurate object detection and semantic segmentation [J].
Girshick, Ross ;
Donahue, Jeff ;
Darrell, Trevor ;
Malik, Jitendra .
2014 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2014, :580-587
[7]   Mixture model for face-color modeling and segmentation [J].
Greenspan, H ;
Goldberger, J ;
Eshet, I .
PATTERN RECOGNITION LETTERS, 2001, 22 (14) :1525-1536
[8]  
Hashimoto K, 2003, ADV ROBOTICS, V17, P969, DOI 10.1163/156855303322554373
[9]   Deep Residual Learning for Image Recognition [J].
He, Kaiming ;
Zhang, Xiangyu ;
Ren, Shaoqing ;
Sun, Jian .
2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, :770-778
[10]   Densely Connected Convolutional Networks [J].
Huang, Gao ;
Liu, Zhuang ;
van der Maaten, Laurens ;
Weinberger, Kilian Q. .
30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, :2261-2269