Real-time localization and 3D semantic map reconstruction for unstructured citrus orchards

被引：10

作者：

Xiong, Juntao ^{[1
]}

Liang, Junhao ^{[1
]}

Zhuang, Yanyun ^{[1
]}

Hong, Dan ^{[1
]}

Zheng, Zhenhui ^{[1
]}

Liao, Shisheng ^{[1
]}

Hu, Wenxin ^{[1
]}

Yang, Zhengang ^{[1
]}

机构：

[1] South China Agr Univ, Coll Math & Informat, Guangzhou 510642, Peoples R China

来源：

COMPUTERS AND ELECTRONICS IN AGRICULTURE | 2023年 / 213卷

基金：

中国国家自然科学基金;

关键词：

Visual inertial SLAM; Semantic segmentation; Point cloud map; Semantic map; FRUIT; LIDAR;

D O I：

10.1016/j.compag.2023.108217

中图分类号：

S [农业科学];

学科分类号：

09 ;

摘要：

Semantic maps play a crucial role in smart agriculture, providing practical three-dimensional fruit tree data for orchard management and aiding the optimization of management strategies and improvement of economic benefits. However, previous map studies have mainly focused on geometric features and have lacked semantic information, limiting robots' ability to reason about useful information in complex tasks and achieve human --machine interaction. Furthermore, most existing map reconstructions have been offline or nonreal-time, making it difficult to satisfy the needs of real-time decision-making and planning in agricultural scenarios. Therefore, this paper proposes a real-time localization and semantic map reconstruction method for unstructured citrus or-chards, integrating the visual-inertial SLAM VINS-RGBD framework with the semantic segmentation algorithm BiSeNetV1. By conducting semantic segmentation of 2D RGB images and mapping them to point clouds, a 3D semantic point cloud map is reconstructed. The statistical outlier removal filter and OctoMap are introduced for postprocessing to remove outliers and estimate obstacles in 3D space, constructing a more accurate, efficient and flexible map. The experimental results show that the proposed method achieved a semantic segmentation ac-curacy mIoU of 79.31% on a self-built citrus dataset, a citrus recall relative error of 11.29% and a localization accuracy mean translational error of 1.917 m with the map constructed under an unstructured orchard scenario. Additionally, the average memory saving rate of the statistical outlier removal filter was 10.36%, and the average memory saving rate of OctoMap was 97.39%. The processing time for each frame of real-time front-end feature detection and tracking was 11.14 ms. Moreover, the deployed semantic segmentation network BiSeNetV1 achieved a processing time of 7.35 ms per frame. These results indicate that the proposed method can achieve both high accuracy and real-time performance in semantic map reconstruction. This exploratory work provides theoretical and technical references for future research on more precise localization and more complete semantic mapping and has extensive application potential, providing essential technical support for intelligent agriculture.

引用

页数：15

共 50 条

[21] LIV-GaussMap: LiDAR-Inertial-Visual Fusion for Real-Time 3D Radiance Field Map Rendering
Hong, Sheng
He, Junjie
Zheng, Xinhu
Zheng, Chunran
IEEE ROBOTICS AND AUTOMATION LETTERS, 2024, 9 (11): : 9765 - 9772
[22] 3D Indoor Map Building with Monte Carlo Localization in 2D Map
Zhao, Lei
Fan, Zhun
Li, Wenji
Xie, Honghui
Xiao, Yang
2016 2ND INTERNATIONAL CONFERENCE ON INDUSTRIAL INFORMATICS - COMPUTING TECHNOLOGY, INTELLIGENT TECHNOLOGY, INDUSTRIAL INFORMATION INTEGRATION (ICIICII), 2016, : 236 - 240
[23] FRPNet: An improved Faster-ResNet with PASPP for real-time semantic segmentation in the unstructured field scene
Yang, Biao
Yang, Sen
Wang, Peng
Wang, Hai
Jiang, Jiaming
Ni, Rongrong
Yang, Changchun
COMPUTERS AND ELECTRONICS IN AGRICULTURE, 2024, 217
[24] Real-time hash aggregation for blockchain system with 3D sensor network
Hirai, Kensei
Akiyama, Kuon
Shinkuma, Ryoichi
Mine, Aramu
2023 IEEE 20TH CONSUMER COMMUNICATIONS & NETWORKING CONFERENCE, CCNC, 2023,
[25] REAL-TIME 3D COLOR IMAGING WITH SINGLE-PHOTON LIDAR DATA
Tachella, J.
Altmann, Y.
McLaughlin, S.
Tourneret, J-Y
2019 IEEE 8TH INTERNATIONAL WORKSHOP ON COMPUTATIONAL ADVANCES IN MULTI-SENSOR ADAPTIVE PROCESSING (CAMSAP 2019), 2019, : 206 - 210
[26] Semantic 3D Reconstruction for Volumetric Modeling of Defects in Construction Sites
Katsatos, Dimitrios
Charalampous, Paschalis
Schmidt, Patrick
Kostavelis, Ioannis
Giakoumis, Dimitrios
Nalpantidis, Lazaros
Tzovaras, Dimitrios
ROBOTICS, 2024, 13 (07)
[27] Real-Time 3D Visual Perception by Cross-Dimensional Refined Learning
Hong, Ziyang
Yue, C. Patrick
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2024, 34 (10) : 10326 - 10338
[28] Real-time Localization and Mapping Method for Agricultural Robot in Orchards Based on LiDAR/IMU Tight-coupling
Shen Y.
Xiao X.
Liu H.
Zhang X.
Nongye Jixie Xuebao/Transactions of the Chinese Society for Agricultural Machinery, 2023, 54 (11): : 20 - 28and48
[29] A REAL-TIME GRID MAP GENERATION AND OBJECT CLASSIFICATION FOR GROUND-BASED 3D LIDAR DATA USING IMAGE ANALYSIS TECHNIQUES
Lee, Sang-Mook
Im, Jeong Joon
Lee, Bo-Hee
Leonessa, Alexander
Kurdila, Andrew
2010 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, 2010, : 2253 - 2256
[30] Real-Time RGB-D Simultaneous Localization and Mapping Guided by Terrestrial LiDAR Point Cloud for Indoor 3-D Reconstruction and Camera Pose Estimation
Kang, Xujie
Li, Jing
Fan, Xiangtao
Wan, Wenhui
APPLIED SCIENCES-BASEL, 2019, 9 (16):

← 1 2 3 4 5 →