3D point cloud semantic segmentation toward large-scale unstructured agricultural scene classification

被引:33
作者
Chen, Yi [1 ,3 ]
Xiong, Yingjun [1 ]
Zhang, Baohua [1 ]
Zhou, Jun [2 ]
Zhang, Qian [1 ]
机构
[1] Nanjing Agr Univ, Coll Artificial Intelligence, Nanjing, Jiangsu, Peoples R China
[2] Nanjing Agr Univ, Coll Engn, Nanjing, Jiangsu, Peoples R China
[3] Harbin Engn Univ, Coll Underwater Acoust Engn, Harbin, Heilongjiang, Peoples R China
基金
中国国家自然科学基金;
关键词
Point clouds; Semantic segmentation; Scene classification; Unstructured agricultural scene; Deep learning;
D O I
10.1016/j.compag.2021.106445
中图分类号
S [农业科学];
学科分类号
09 ;
摘要
In recent years, with the development of computer vision, deep learning, and artificial intelligence technologies, the popularity of depth sensors and lidar has promoted the rapid development of three-dimensional (3D) point cloud semantic segmentation. The semantic segmentation of 3D point clouds for large-scale unstructured agricultural scenes is important for agricultural robots to perceive their surrounding environment, and for autonomous navigation and positioning and autonomous scene understanding. In this study, the problem of 3D point cloud semantic segmentation for large-scale unstructured agricultural scenes was studied. By improving the neural network structure of RandLA-Net, a deeper 3D point cloud semantic segmentation neural network model for large-scale unstructured agricultural scenes was built, and good experimental results were obtained. The local feature aggregation module in RandLA-Net was integrated and improved to achieve 3D point cloud semantic segmentation for large-scale unstructured agricultural scenes. To test the influence of the 3D point cloud sampling algorithm on the overall accuracy (OA) and mean intersection-over-union (mIoU) of semantic segmentation, the random sampling algorithm and farthest point sampling algorithm were used to build two models with the same neural network structure. The test results show that the sampling algorithm has little effect on the OA and mIoU of 3D point cloud semantic segmentation, and the final result depends mainly on the extraction of 3D point cloud features. In addition, two different Semantic3D datasets were used to test the effect of the datasets on the generalization ability of the model, and the results showed that the datasets had an important effect on the neural network model.
引用
收藏
页数:10
相关论文
共 27 条
[1]   SemanticKITTI: A Dataset for Semantic Scene Understanding of LiDAR Sequences [J].
Behley, Jens ;
Garbade, Martin ;
Milioto, Andres ;
Quenzel, Jan ;
Behnke, Sven ;
Stachniss, Cyrill ;
Gall, Juergen .
2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, :9296-9306
[2]   SnapNet: 3D point cloud semantic labeling with 2D deep segmentation networks [J].
Boulch, Alexandre ;
Guerry, Yids ;
Le Saux, Bertrand ;
Audebert, Nicolas .
COMPUTERS & GRAPHICS-UK, 2018, 71 :189-198
[3]  
Chen C.-F. R., 2019, ICLR
[4]   Real-time 3D unstructured environment reconstruction utilizing VR and Kinect-based immersive teleoperation for agricultural field robots [J].
Chen, Yi ;
Zhang, Baohua ;
Zhou, Jun ;
Wang, Kai .
COMPUTERS AND ELECTRONICS IN AGRICULTURE, 2020, 175
[5]   Comparison of two data fusion methods for localization of wheeled mobile robot in farm conditions [J].
Erfani, S. ;
Jafari, A. ;
Hajiahmad, A. .
ARTIFICIAL INTELLIGENCE IN AGRICULTURE, 2019, 1 :48-55
[6]  
Fan L, 2017, IEEE ICC
[7]   SnapNet-R: Consistent 3D Multi-View Semantic Labeling for Robotics [J].
Guerry, Joris ;
Boulch, Alexandre ;
Le Saux, Bertrand ;
Moras, Julien ;
Plyer, Aurelien ;
Filliat, David .
2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS (ICCVW 2017), 2017, :669-678
[8]   Pose estimation and adaptable grasp configuration with point cloud registration and geometry understanding for fruit grasp planning [J].
Guo, Ning ;
Zhang, Baohua ;
Zhou, Jun ;
Zhan, Ketian ;
Lai, Shuang .
COMPUTERS AND ELECTRONICS IN AGRICULTURE, 2020, 179
[9]  
Hackel T., 2017, ISPRS Annals of the Photogrammetry, Remote Sensing and Spatial Information Sciences, P91, DOI DOI 10.5194/ISPRS-ANNALS-IV-1-W1-91-2017
[10]   RandLA-Net: Efficient Semantic Segmentation of Large-Scale Point Clouds [J].
Hu, Qingyong ;
Yang, Bo ;
Xie, Linhai ;
Rosa, Stefano ;
Guo, Yulan ;
Wang, Zhihua ;
Trigoni, Niki ;
Markham, Andrew .
2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2020), 2020, :11105-11114