Study of convolutional neural network-based semantic segmentation methods on edge intelligence devices for field agricultural robot navigation line extraction

被引:27
作者
Yu, Jiya [1 ]
Zhang, Jiye [1 ]
Shu, Aijing [1 ]
Chen, Yujie [1 ]
Chen, Jianneng [2 ]
Yang, Yongjie [3 ]
Tang, Wei [3 ]
Zhang, Yanchao [1 ]
机构
[1] Zhejiang Sci Tech Univ, Sch Informat Sci & Engn, Hangzhou 310018, Zhejiang, Peoples R China
[2] Zhejiang Sci Tech Univ, Sch Mech Engn, Hangzhou 310018, Zhejiang, Peoples R China
[3] China Natl Rice Res Inst, State Key Lab Rice Biol, Hangzhou 311400, Zhejiang, Peoples R China
关键词
Semantic segmentation; Deep learning; Edge computing; Navigation;
D O I
10.1016/j.compag.2023.107811
中图分类号
S [农业科学];
学科分类号
09 ;
摘要
Smart agricultural machinery is emerging as the future trend for field robots, and the fully automatic robot has a great application prospect. However, it is a big challenge for robots to navigate in complex farmland environments. In this research, 5 deep learning-based computer vision methods under different field scenes for field navigation line extraction were studied and successfully deployed on an embedded system, which can be integrated into robots for automatic navigation in the future. The field road was segmented by the semantic segmentation algorithm at first, and then the navigation line is extracted from the segmented image by a polygon fitting method. Finally, all the models are transformed through the TensorRT library and deployed on the edge computing device Jetson Nano. In the experiment, five reprehensive semantic segmentation networks namely UNet, Deeplabv3+, BiseNetv1, BiseNetv2, and ENet networks were selected. Among the five networks, Deeplabv3+ is the most accurate. In five scenes, its average segmentation accuracy is 84.87 %, and the navigation line error is 9.59 pixels. Especially in the third scene with shadow and occlusion, it performs best, with only 8.34 pixel error, But the speed of Deeplabv3+ is only 9.7 FPS. ENet, BiseNetv1, and BiseNetv2 are lightweight networks. The speed of ENet is 16.8 FPS, BiseNetv2 is 17 FPS, and BiseNetv1 is 15.8 FPS. In segmentation accuracy and navigation line error, ENet performs better than BiseNet series networks, which are 84.94 % and 10.73 pixels, respectively. In the third scene with shadow and occlusion, it also performs slightly better than BiseNet series networks. In summary, deep learning-based semantic segmentation methods have strong robustness and stability in complex environment compared with previous research. Among all currently available neural networks, ENet has the best performance and good application potential in field navigation.
引用
收藏
页数:11
相关论文
共 27 条
  • [1] Aguiar A., 2020, COMPUT ELECTRON AGR, V175
  • [2] Unevenness Point Descriptor for Terrain Analysis in Mobile Robot Applications
    Bellone, Mauro
    Reina, Giulio
    Giannoccaro, Nicola I.
    Spedicato, Luigi
    [J]. INTERNATIONAL JOURNAL OF ADVANCED ROBOTIC SYSTEMS, 2013, 10 : 1 - 10
  • [3] Bradski G, 2000, DR DOBBS J, V25, P120
  • [4] Encoder-Decoder with Atrous Separable Convolution for Semantic Image Segmentation
    Chen, Liang-Chieh
    Zhu, Yukun
    Papandreou, George
    Schroff, Florian
    Adam, Hartwig
    [J]. COMPUTER VISION - ECCV 2018, PT VII, 2018, 11211 : 833 - 851
  • [5] Morphology-based guidance line extraction for an autonomous weeding robot in paddy fields
    Choi, Keun Ha
    Han, Sang Kwon
    Han, Sang Hoon
    Park, Kwang-Ho
    Kim, Kyung-Soo
    Kim, Soohyun
    [J]. COMPUTERS AND ELECTRONICS IN AGRICULTURE, 2015, 113 : 266 - 274
  • [6] Chong Toby, 2021, 2021 IEEE REG 10 S T, P1
  • [7] ACNet: Strengthening the Kernel Skeletons for Powerful CNN via Asymmetric Convolution Blocks
    Ding, Xiaohan
    Guo, Yuchen
    Ding, Guiguang
    Han, Jungong
    [J]. 2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 1911 - 1920
  • [8] Path Tracking Control of Field Information-Collecting Robot Based on Improved Convolutional Neural Network Algorithm
    Gu, Yili
    Li, Zhiqiang
    Zhang, Zhen
    Li, Jun
    Chen, Liqing
    [J]. SENSORS, 2020, 20 (03)
  • [9] Han Y. H., 2012, International Conference on Automatic Control and Artificial Intelligence (ACAI 2012), P1381
  • [10] Global context based automatic road segmentation via dilated convolutional neural network
    Lan, Meng
    Zhang, Yipeng
    Zhang, Lefei
    Du, Bo
    [J]. INFORMATION SCIENCES, 2020, 535 : 156 - 171