Deep learning-based apple detection using a suppression mask R-CNN

被引:97
作者
Chu, Pengyu [1 ]
Li, Zhaojian [1 ]
Lammers, Kyle [1 ]
Lu, Renfu [2 ]
Liu, Xiaoming [3 ]
机构
[1] Michigan State Univ, Dept Mech Engn, E Lansing, MI 48824 USA
[2] ARS, USDA, E Lansing, MI 48824 USA
[3] Michigan State Univ, Dept Comp Sci & Engn, E Lansing, MI 48824 USA
关键词
Vision system; Fruit detection; Deep learning; Robotic harvesting; Image segmentation; COMPUTER VISION; FRUIT;
D O I
10.1016/j.patrec.2021.04.022
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Robotic apple harvesting has received much research attention in the past few years due to growing shortage and rising cost in labor. One key enabling technology towards automated harvesting is accurate and robust apple detection, which poses great challenges as a result of the complex orchard environment that involves varying lighting conditions and foliage/branch occlusions. This letter reports on the development of a novel deep learning-based apple detection framework named Suppression Mask R-CNN. Specifically, we first collect a comprehensive apple orchard dataset for "Gala" and "Blondee" apples, using a color camera, under different lighting conditions (overcast and front lighting vs. back lighting). We then develop a novel suppression Mask R-CNN for apple detection, in which a suppression branch is added to the standard Mask R-CNN to suppress non-apple features generated by the original network. Comprehensive evaluations are performed, which show that the developed suppression Mask R-CNN network outperforms state-of-the-art models with a higher F1-score of 0.905 and a detection time of 0.25 second per frame on a standard desktop computer. (C) 2021 Elsevier B.V. All rights reserved.
引用
收藏
页码:206 / 211
页数:6
相关论文
共 30 条
[1]  
[Anonymous], 2018, COMPUT IND, DOI DOI 10.1016/j.compind.2018.03.010
[2]  
[Anonymous], 2010, AGR ENG INT CIGR J
[3]   Image Segmentation for Fruit Detection and Yield Estimation in Apple Orchards [J].
Bargoti, Suchet ;
Underwood, James P. .
JOURNAL OF FIELD ROBOTICS, 2017, 34 (06) :1039-1060
[4]  
Benady Meny., 1992, Paper-American Society of Agricultural Engineers (USA)
[5]   The VIA Annotation Software for Images, Audio and Video [J].
Dutta, Abhishek ;
Zisserman, Andrew .
PROCEEDINGS OF THE 27TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA (MM'19), 2019, :2276-2279
[6]   Musculoskeletal disorders in labor-intensive agriculture [J].
Fathallah, Fadi A. .
APPLIED ERGONOMICS, 2010, 41 (06) :738-743
[7]  
Gallardo K., 2012, Cost Estimates of Establishing and Producing Gala Apples in Washington: Washington State University Extension Fact Sheet. FS005E
[8]   Fast R-CNN [J].
Girshick, Ross .
2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2015, :1440-1448
[9]   Region-Based Convolutional Networks for Accurate Object Detection and Segmentation [J].
Girshick, Ross ;
Donahue, Jeff ;
Darrell, Trevor ;
Malik, Jitendra .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2016, 38 (01) :142-158
[10]  
He K., 2017, IEEE I CONF COMP VIS, P2961, DOI DOI 10.1109/ICCV.2017.322