Multi-modal deep learning for Fuji apple detection using RGB-D cameras and their radiometric capabilities

被引:113
作者
Gene-Mola, Jordi [1 ]
Vilaplana, Veronica [2 ]
Rosell-Polo, Joan R. [1 ]
Morros, Josep-Ramon [2 ]
Ruiz-Hidalgo, Javier [2 ]
Gregorio, Eduard [1 ]
机构
[1] Univ Lleida UdL, Agrotetnio Ctr, Dept Agr & Forest Engn, Res Grp AgroICT & Precis Agr, Lleida, Catalonia, Spain
[2] Univ Politecn Cataluna, Dept Signal Theory & Commun, Barcelona, Catalonia, Spain
关键词
RGB-D; Multi-modal faster R-CNN; Convolutional neural networks; Fruit detection; Agricultural robotics; Fruit reflectance; TERRESTRIAL LASER SCANNER; FRUIT DETECTION; PRECISION AGRICULTURE; STRUCTURED LIGHT; ORCHARD; IMAGES; COLOR; LIDAR; TREE; SENSORS;
D O I
10.1016/j.compag.2019.05.016
中图分类号
S [农业科学];
学科分类号
09 ;
摘要
Fruit detection and localization will be essential for future agronomic management of fruit crops, with applications in yield prediction, yield mapping and automated harvesting. RGB-D cameras are promising sensors for fruit detection given that they provide geometrical information with color data. Some of these sensors work on the principle of time-of-flight (ToF) and, besides color and depth, provide the backscatter signal intensity. However, this radiometric capability has not been exploited for fruit detection applications. This work presents the KFuji RGB-DS database, composed of 967 multi-modal images containing a total of 12,839 Fuji apples. Compilation of the database allowed a study of the usefulness of fusing RGB-D and radiometric information obtained with Kinect v2 for fruit detection. To do so, the signal intensity was range corrected to overcome signal attenuation, obtaining an image that was proportional to the reflectance of the scene. A registration between RGB, depth and intensity images was then carried out. The Faster R-CNN model was adapted for use with five channel input images: color (RGB), depth (D) and range-corrected intensity signal (S). Results show an improvement of 4.46% in F1-score when adding depth and range-corrected intensity channels, obtaining an F1-score of 0.898 and an AP of 94.8% when all channels are used. From our experimental results, it can be concluded that the radiometric capabilities of ToF sensors give valuable information for fruit detection.
引用
收藏
页码:689 / 698
页数:10
相关论文
共 47 条
  • [1] Amara J., 2017, DATENBANKSYSTEME BUS, P1
  • [2] [Anonymous], EURAGENG 2018 C
  • [3] [Anonymous], P 3 INT C LEARNING R
  • [4] [Anonymous], PYCHET LABELLER
  • [5] [Anonymous], ADV NEURAL INF PROCE
  • [6] [Anonymous], ADV NEURAL INFORM PR, DOI DOI 10.1109/TPAMI.2016.2577031
  • [7] [Anonymous], 2017, J FIELD ROBOT, DOI DOI 10.1002/rob.21699
  • [8] Agricultural Robotics Unmanned Robotic Service Units in Agricultural Tasks
    Auat Cheein, Fernando A.
    Carelli, Ricardo
    [J]. IEEE INDUSTRIAL ELECTRONICS MAGAZINE, 2013, 7 (03) : 48 - 58
  • [9] Bargoti Suchet, 2017, 2017 IEEE International Conference on Robotics and Automation (ICRA), P3626, DOI 10.1109/ICRA.2017.7989417
  • [10] Colour-agnostic shape-based 3D fruit detection for crop harvesting robots
    Barnea, Ehud
    Mairon, Rotem
    Ben-Shahar, Ohad
    [J]. BIOSYSTEMS ENGINEERING, 2016, 146 : 57 - 70