Real-time localization and 3D semantic map reconstruction for unstructured citrus orchards

被引:10
|
作者
Xiong, Juntao [1 ]
Liang, Junhao [1 ]
Zhuang, Yanyun [1 ]
Hong, Dan [1 ]
Zheng, Zhenhui [1 ]
Liao, Shisheng [1 ]
Hu, Wenxin [1 ]
Yang, Zhengang [1 ]
机构
[1] South China Agr Univ, Coll Math & Informat, Guangzhou 510642, Peoples R China
基金
中国国家自然科学基金;
关键词
Visual inertial SLAM; Semantic segmentation; Point cloud map; Semantic map; FRUIT; LIDAR;
D O I
10.1016/j.compag.2023.108217
中图分类号
S [农业科学];
学科分类号
09 ;
摘要
Semantic maps play a crucial role in smart agriculture, providing practical three-dimensional fruit tree data for orchard management and aiding the optimization of management strategies and improvement of economic benefits. However, previous map studies have mainly focused on geometric features and have lacked semantic information, limiting robots' ability to reason about useful information in complex tasks and achieve human --machine interaction. Furthermore, most existing map reconstructions have been offline or nonreal-time, making it difficult to satisfy the needs of real-time decision-making and planning in agricultural scenarios. Therefore, this paper proposes a real-time localization and semantic map reconstruction method for unstructured citrus or-chards, integrating the visual-inertial SLAM VINS-RGBD framework with the semantic segmentation algorithm BiSeNetV1. By conducting semantic segmentation of 2D RGB images and mapping them to point clouds, a 3D semantic point cloud map is reconstructed. The statistical outlier removal filter and OctoMap are introduced for postprocessing to remove outliers and estimate obstacles in 3D space, constructing a more accurate, efficient and flexible map. The experimental results show that the proposed method achieved a semantic segmentation ac-curacy mIoU of 79.31% on a self-built citrus dataset, a citrus recall relative error of 11.29% and a localization accuracy mean translational error of 1.917 m with the map constructed under an unstructured orchard scenario. Additionally, the average memory saving rate of the statistical outlier removal filter was 10.36%, and the average memory saving rate of OctoMap was 97.39%. The processing time for each frame of real-time front-end feature detection and tracking was 11.14 ms. Moreover, the deployed semantic segmentation network BiSeNetV1 achieved a processing time of 7.35 ms per frame. These results indicate that the proposed method can achieve both high accuracy and real-time performance in semantic map reconstruction. This exploratory work provides theoretical and technical references for future research on more precise localization and more complete semantic mapping and has extensive application potential, providing essential technical support for intelligent agriculture.
引用
收藏
页数:15
相关论文
共 50 条
  • [21] LIV-GaussMap: LiDAR-Inertial-Visual Fusion for Real-Time 3D Radiance Field Map Rendering
    Hong, Sheng
    He, Junjie
    Zheng, Xinhu
    Zheng, Chunran
    IEEE ROBOTICS AND AUTOMATION LETTERS, 2024, 9 (11): : 9765 - 9772
  • [22] 3D Indoor Map Building with Monte Carlo Localization in 2D Map
    Zhao, Lei
    Fan, Zhun
    Li, Wenji
    Xie, Honghui
    Xiao, Yang
    2016 2ND INTERNATIONAL CONFERENCE ON INDUSTRIAL INFORMATICS - COMPUTING TECHNOLOGY, INTELLIGENT TECHNOLOGY, INDUSTRIAL INFORMATION INTEGRATION (ICIICII), 2016, : 236 - 240
  • [23] FRPNet: An improved Faster-ResNet with PASPP for real-time semantic segmentation in the unstructured field scene
    Yang, Biao
    Yang, Sen
    Wang, Peng
    Wang, Hai
    Jiang, Jiaming
    Ni, Rongrong
    Yang, Changchun
    COMPUTERS AND ELECTRONICS IN AGRICULTURE, 2024, 217
  • [24] Real-time hash aggregation for blockchain system with 3D sensor network
    Hirai, Kensei
    Akiyama, Kuon
    Shinkuma, Ryoichi
    Mine, Aramu
    2023 IEEE 20TH CONSUMER COMMUNICATIONS & NETWORKING CONFERENCE, CCNC, 2023,
  • [25] REAL-TIME 3D COLOR IMAGING WITH SINGLE-PHOTON LIDAR DATA
    Tachella, J.
    Altmann, Y.
    McLaughlin, S.
    Tourneret, J-Y
    2019 IEEE 8TH INTERNATIONAL WORKSHOP ON COMPUTATIONAL ADVANCES IN MULTI-SENSOR ADAPTIVE PROCESSING (CAMSAP 2019), 2019, : 206 - 210
  • [26] Semantic 3D Reconstruction for Volumetric Modeling of Defects in Construction Sites
    Katsatos, Dimitrios
    Charalampous, Paschalis
    Schmidt, Patrick
    Kostavelis, Ioannis
    Giakoumis, Dimitrios
    Nalpantidis, Lazaros
    Tzovaras, Dimitrios
    ROBOTICS, 2024, 13 (07)
  • [27] Real-Time 3D Visual Perception by Cross-Dimensional Refined Learning
    Hong, Ziyang
    Yue, C. Patrick
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2024, 34 (10) : 10326 - 10338
  • [28] Real-time Localization and Mapping Method for Agricultural Robot in Orchards Based on LiDAR/IMU Tight-coupling
    Shen Y.
    Xiao X.
    Liu H.
    Zhang X.
    Nongye Jixie Xuebao/Transactions of the Chinese Society for Agricultural Machinery, 2023, 54 (11): : 20 - 28and48
  • [29] A REAL-TIME GRID MAP GENERATION AND OBJECT CLASSIFICATION FOR GROUND-BASED 3D LIDAR DATA USING IMAGE ANALYSIS TECHNIQUES
    Lee, Sang-Mook
    Im, Jeong Joon
    Lee, Bo-Hee
    Leonessa, Alexander
    Kurdila, Andrew
    2010 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, 2010, : 2253 - 2256
  • [30] Real-Time RGB-D Simultaneous Localization and Mapping Guided by Terrestrial LiDAR Point Cloud for Indoor 3-D Reconstruction and Camera Pose Estimation
    Kang, Xujie
    Li, Jing
    Fan, Xiangtao
    Wan, Wenhui
    APPLIED SCIENCES-BASEL, 2019, 9 (16):