SG-SLAM: A Real-Time RGB-D Visual SLAM Toward Dynamic Scenes With Semantic and Geometric Information

被引:73
作者
Cheng, Shuhong [1 ]
Sun, Changhe [1 ]
Zhang, Shijun [2 ]
Zhang, Dianfan [3 ]
机构
[1] Yanshan Univ, Sch Elect Engn, Qinhuangdao 066000, Hebei, Peoples R China
[2] Yanshan Univ, Sch Mech Engn, Qinhuangdao 066000, Hebei, Peoples R China
[3] Yanshan Univ, Key Lab Special Delivery Equipment, Qinhuangdao 066004, Hebei, Peoples R China
关键词
Semantics; Heuristic algorithms; Measurement; Simultaneous localization and mapping; Visualization; Vehicle dynamics; Robots; Dynamic scenes; geometric constraint; semantic metric map; visual-based measurement; visual simultaneous localization and mapping (SLAM); SIMULTANEOUS LOCALIZATION;
D O I
10.1109/TIM.2022.3228006
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Simultaneous localization and mapping (SLAM) is one of the fundamental capabilities for intelligent mobile robots to perform state estimation in unknown environments. However, most visual SLAM systems rely on the static scene assumption and consequently have severely reduced accuracy and robustness in dynamic scenes. Moreover, the metric maps constructed by many systems lack semantic information, so the robots cannot understand their surroundings at a human cognitive level. In this article, we propose SG-SLAM, which is a real-time RGB-D semantic visual SLAM system based on the ORB-SLAM2 framework. First, SG-SLAM adds two new parallel threads: an object detecting thread to obtain 2-D semantic information and a semantic mapping thread. Then, a fast dynamic feature rejection algorithm combining semantic and geometric information is added to the tracking thread. Finally, they are published to the robot operating system (ROS) system for visualization after generating 3-D point clouds and 3-D semantic objects in the semantic mapping thread. We performed an experimental evaluation on the TUM dataset, the Bonn dataset, and the OpenLORIS-Scene dataset. The results show that SG-SLAM is not only one of the most real-time, accurate, and robust systems in dynamic scenes but also allows the creation of intuitive semantic metric maps.
引用
收藏
页数:12
相关论文
共 38 条
  • [1] SegNet: A Deep Convolutional Encoder-Decoder Architecture for Image Segmentation
    Badrinarayanan, Vijay
    Kendall, Alex
    Cipolla, Roberto
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2017, 39 (12) : 2481 - 2495
  • [2] DynaSLAM: Tracking, Mapping, and Inpainting in Dynamic Scenes
    Bescos, Berta
    Facil, Jose M.
    Civera, Javier
    Neira, Jose
    [J]. IEEE ROBOTICS AND AUTOMATION LETTERS, 2018, 3 (04): : 4076 - 4083
  • [3] Past, Present, and Future of Simultaneous Localization and Mapping: Toward the Robust-Perception Age
    Cadena, Cesar
    Carlone, Luca
    Carrillo, Henry
    Latif, Yasir
    Scaramuzza, Davide
    Neira, Jose
    Reid, Ian
    Leonard, John J.
    [J]. IEEE TRANSACTIONS ON ROBOTICS, 2016, 32 (06) : 1309 - 1332
  • [4] ORB-SLAM3: An Accurate Open-Source Library for Visual, Visual-Inertial, and Multimap SLAM
    Campos, Carlos
    Elvira, Richard
    Gomez Rodriguez, Juan J.
    Montiel, Jose M. M.
    Tardos, Juan D.
    [J]. IEEE TRANSACTIONS ON ROBOTICS, 2021, 37 (06) : 1874 - 1890
  • [5] A Real-Time Dynamic Object Segmentation Framework for SLAM System in Dynamic Scenes
    Chang, Jianfang
    Dong, Na
    Li, Donghui
    [J]. IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT, 2021, 70
  • [6] Optical flow-based Moving-Static Separation in Driving Assistance Systems
    Duong-Van Nguyen
    Hughes, Ciaran
    Horgan, Jonathan
    [J]. 2015 IEEE 18TH INTERNATIONAL CONFERENCE ON INTELLIGENT TRANSPORTATION SYSTEMS, 2015, : 1644 - 1651
  • [7] Simultaneous localization and mapping: Part I
    Durrant-Whyte, Hugh
    Bailey, Tim
    [J]. IEEE ROBOTICS & AUTOMATION MAGAZINE, 2006, 13 (02) : 99 - 108
  • [8] Everingham M., 2008, Int. J. Comput. Vis.
  • [9] RANDOM SAMPLE CONSENSUS - A PARADIGM FOR MODEL-FITTING WITH APPLICATIONS TO IMAGE-ANALYSIS AND AUTOMATED CARTOGRAPHY
    FISCHLER, MA
    BOLLES, RC
    [J]. COMMUNICATIONS OF THE ACM, 1981, 24 (06) : 381 - 395
  • [10] Hartley R, 2003, Multiple view geometry in computer vision, DOI 10.1016/S0143-8166(01)00145-2