FoveaMask: A fast and accurate deep learning model for green fruit instance segmentation

被引:38
作者
Jia, Weikuan [1 ]
Zhang, Zhonghua [1 ]
Shao, Wenjing [1 ]
Hou, Sujuan [1 ]
Ji, Ze [2 ]
Liu, Guoliang [3 ]
Yin, Xiang [4 ]
机构
[1] Shandong Normal Univ, Sch Informat Sci & Engn, Jinan 250358, Peoples R China
[2] Cardiff Univ, Sch Engn, Cardiff CF24 3AA, Wales
[3] Shandong Univ, Sch Control Sci & Engn, Jinan 250061, Peoples R China
[4] Shandong Univ Technol, Sch Agr Engn & Food Sci, Zibo 255000, Peoples R China
基金
中国国家自然科学基金; 中国博士后科学基金;
关键词
Deep learning; Green fruit; Instance segmentation; FoveaMask; APPLES;
D O I
10.1016/j.compag.2021.106488
中图分类号
S [农业科学];
学科分类号
09 ;
摘要
In the process of agricultural automation production, efficient and accurate segmentation of target fruit is the basis and guarantee for numerous applications including crop growth monitoring, yield prognosis, and machine picking. Green fruit is apt to affect by complicated scenes such as occlusions and overlaps, as well as the homochromatic background, which brings great challenges to the recognition and segmentation. In this paper, we propose a novel methodology named FoveaMask for improving the robustness and generalization of green fruit segmentation framework. Features of input images are firstly extracted by ResNet and fused by FPN. The classification and bounding-box regression of each spatial position on the feature maps are then carried out directly by the way of full convolution. RoI Align layer is applied to fix feature region of proposals to the same size yet preserve exact spatial locations. Finally, instance-level fruit segmentation is realized by pixel-level classification on each proposal using embedded mask branches. The whole network architecture of the new model does not involve the related design and operation of anchor, which greatly improves the generalization ability for different shape fruits, alleviates the computing and storage resources, and balances the contradiction between accuracy and efficiency simultaneously. Additionally, a position attention module(PAM) is introduced into the embedding mask branch to aggregate the effective information pixels, which can improve the robustness of the segmentation model in the actual complex environment of the orchard. We test the model on green apple and immature persimmon data sets and the experimental results show that FoveaMask performs best in terms of both recognition accuracy and model complexity compared with other 11 different types of detection and segmentation models.
引用
收藏
页数:15
相关论文
共 38 条
[1]  
Ahmad MT, 2018, 2018 5 INT MULTITOPI, P1, DOI DOI 10.1109/IMTIC.2018.8467275.
[2]  
Bargoti Suchet, 2017, 2017 IEEE International Conference on Robotics and Automation (ICRA), P3626, DOI 10.1109/ICRA.2017.7989417
[3]   Image Segmentation for Fruit Detection and Yield Estimation in Apple Orchards [J].
Bargoti, Suchet ;
Underwood, James P. .
JOURNAL OF FIELD ROBOTICS, 2017, 34 (06) :1039-1060
[4]   YOLACT Real-time Instance Segmentation [J].
Bolya, Daniel ;
Zhou, Chong ;
Xiao, Fanyi ;
Lee, Yong Jae .
2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, :9156-9165
[5]   Hybrid Task Cascade for Instance Segmentation [J].
Chen, Kai ;
Pang, Jiangmiao ;
Wang, Jiaqi ;
Xiong, Yu ;
Li, Xiaoxiao ;
Sun, Shuyang ;
Feng, Wansen ;
Liu, Ziwei ;
Shi, Jianping ;
Ouyang, Wanli ;
Loy, Chen Change ;
Lin, Dahua .
2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, :4969-4978
[6]   MaskLab: Instance Segmentation by Refining Object Detection with Semantic and Direction Features [J].
Chen, Liang-Chieh ;
Hermans, Alexander ;
Papandreou, George ;
Schroff, Florian ;
Wang, Peng ;
Adam, Hartwig .
2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, :4013-4022
[7]   Shape Completion using 3D-Encoder-Predictor CNNs and Shape Synthesis [J].
Dai, Angela ;
Qi, Charles Ruizhongtai ;
Niessner, Matthias .
30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, :6545-6554
[8]   Immature green citrus fruit detection using color and thermal images [J].
Gan, H. ;
Lee, W. S. ;
Alchanatis, V. ;
Ehsani, R. ;
Schueller, J. K. .
COMPUTERS AND ELECTRONICS IN AGRICULTURE, 2018, 152 :117-125
[9]   Deep Residual Learning for Image Recognition [J].
He, Kaiming ;
Zhang, Xiangyu ;
Ren, Shaoqing ;
Sun, Jian .
2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, :770-778
[10]   Mask Scoring R-CNN [J].
Huang, Zhaojin ;
Huang, Lichao ;
Gong, Yongchao ;
Huang, Chang ;
Wang, Xinggang .
2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, :6402-6411