Study of convolutional neural network-based semantic segmentation methods on edge intelligence devices for field agricultural robot navigation line extraction

被引：27

作者：

Yu, Jiya ^{[1
]}

Zhang, Jiye ^{[1
]}

Shu, Aijing ^{[1
]}

Chen, Yujie ^{[1
]}

Chen, Jianneng ^{[2
]}

Yang, Yongjie ^{[3
]}

Tang, Wei ^{[3
]}

Zhang, Yanchao ^{[1
]}

机构：

[1] Zhejiang Sci Tech Univ, Sch Informat Sci & Engn, Hangzhou 310018, Zhejiang, Peoples R China

[2] Zhejiang Sci Tech Univ, Sch Mech Engn, Hangzhou 310018, Zhejiang, Peoples R China

[3] China Natl Rice Res Inst, State Key Lab Rice Biol, Hangzhou 311400, Zhejiang, Peoples R China

来源：

COMPUTERS AND ELECTRONICS IN AGRICULTURE | 2023年 / 209卷

关键词：

Semantic segmentation; Deep learning; Edge computing; Navigation;

D O I：

10.1016/j.compag.2023.107811

中图分类号：

S [农业科学];

学科分类号：

09 ;

摘要：

Smart agricultural machinery is emerging as the future trend for field robots, and the fully automatic robot has a great application prospect. However, it is a big challenge for robots to navigate in complex farmland environments. In this research, 5 deep learning-based computer vision methods under different field scenes for field navigation line extraction were studied and successfully deployed on an embedded system, which can be integrated into robots for automatic navigation in the future. The field road was segmented by the semantic segmentation algorithm at first, and then the navigation line is extracted from the segmented image by a polygon fitting method. Finally, all the models are transformed through the TensorRT library and deployed on the edge computing device Jetson Nano. In the experiment, five reprehensive semantic segmentation networks namely UNet, Deeplabv3+, BiseNetv1, BiseNetv2, and ENet networks were selected. Among the five networks, Deeplabv3+ is the most accurate. In five scenes, its average segmentation accuracy is 84.87 %, and the navigation line error is 9.59 pixels. Especially in the third scene with shadow and occlusion, it performs best, with only 8.34 pixel error, But the speed of Deeplabv3+ is only 9.7 FPS. ENet, BiseNetv1, and BiseNetv2 are lightweight networks. The speed of ENet is 16.8 FPS, BiseNetv2 is 17 FPS, and BiseNetv1 is 15.8 FPS. In segmentation accuracy and navigation line error, ENet performs better than BiseNet series networks, which are 84.94 % and 10.73 pixels, respectively. In the third scene with shadow and occlusion, it also performs slightly better than BiseNet series networks. In summary, deep learning-based semantic segmentation methods have strong robustness and stability in complex environment compared with previous research. Among all currently available neural networks, ENet has the best performance and good application potential in field navigation.

引用

页数：11

共 27 条

[1] Aguiar A., 2020, COMPUT ELECTRON AGR, V175
[2] Unevenness Point Descriptor for Terrain Analysis in Mobile Robot Applications
Bellone, Mauro
Reina, Giulio
Giannoccaro, Nicola I.
Spedicato, Luigi
[J]. INTERNATIONAL JOURNAL OF ADVANCED ROBOTIC SYSTEMS, 2013, 10 : 1 - 10
[3] Bradski G, 2000, DR DOBBS J, V25, P120
[4] Encoder-Decoder with Atrous Separable Convolution for Semantic Image Segmentation
Chen, Liang-Chieh
Zhu, Yukun
Papandreou, George
Schroff, Florian
Adam, Hartwig
[J]. COMPUTER VISION - ECCV 2018, PT VII, 2018, 11211 : 833 - 851
[5] Morphology-based guidance line extraction for an autonomous weeding robot in paddy fields
Choi, Keun Ha
Han, Sang Kwon
Han, Sang Hoon
Park, Kwang-Ho
Kim, Kyung-Soo
Kim, Soohyun
[J]. COMPUTERS AND ELECTRONICS IN AGRICULTURE, 2015, 113 : 266 - 274
[6] Chong Toby, 2021, 2021 IEEE REG 10 S T, P1
[7] ACNet: Strengthening the Kernel Skeletons for Powerful CNN via Asymmetric Convolution Blocks
Ding, Xiaohan
Guo, Yuchen
Ding, Guiguang
Han, Jungong
[J]. 2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 1911 - 1920
[8] Path Tracking Control of Field Information-Collecting Robot Based on Improved Convolutional Neural Network Algorithm
Gu, Yili
Li, Zhiqiang
Zhang, Zhen
Li, Jun
Chen, Liqing
[J]. SENSORS, 2020, 20 (03)
[9] Han Y. H., 2012, International Conference on Automatic Control and Artificial Intelligence (ACAI 2012), P1381
[10] Global context based automatic road segmentation via dilated convolutional neural network
Lan, Meng
Zhang, Yipeng
Zhang, Lefei
Du, Bo
[J]. INFORMATION SCIENCES, 2020, 535 : 156 - 171

← 1 2 3 →