Can Semantic-based Filtering of Dynamic Objects improve Visual SLAM and Visual Odometry?

被引:1
作者
Costa, Leonardo Rezende [1 ]
Colombini, Esther Luna [1 ]
机构
[1] Unicamp Campinas, Inst Comp, Campinas, Brazil
来源
2023 LATIN AMERICAN ROBOTICS SYMPOSIUM, LARS, 2023 BRAZILIAN SYMPOSIUM ON ROBOTICS, SBR, AND 2023 WORKSHOP ON ROBOTICS IN EDUCATION, WRE | 2023年
关键词
V-SLAM; Visual Odometry; Semantic;
D O I
10.1109/LARS/SBR/WRE59448.2023.10332956
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
This work introduces a novel approach to improve robot perception in dynamic environments using Semantic Filtering. The goal is to enhance Visual Simultaneous Localization and Mapping (V-SLAM) and Visual Odometry (VO) tasks by excluding feature points associated with moving objects. Four different approaches for semantic extraction, namely YOLOv3, DeepLabv3 with two different backbones, and Mask R-CNN, were evaluated. The framework was tested on various datasets, including KITTI, TUM and a simulated sequence generated on AirSim. The results demonstrated that the proposed semantic filtering significantly reduced estimation errors in VO tasks, with average error reduction ranging from 2.81% to 15.98%, while the results for V-SLAM were similar to the base work, especially for sequences with detected loops. Although fewer keypoints are used, the estimations benefit from the points excluded in VO. More experiments are needed to address the effects in VSLAM due to the presence of loop closure and the nature of the datasets.
引用
收藏
页码:567 / 572
页数:6
相关论文
共 12 条
  • [1] Chen LC, 2017, Arxiv, DOI arXiv:1706.05587
  • [2] Chollet F., 2015, About us
  • [3] Deng J, 2009, PROC CVPR IEEE, P248, DOI 10.1109/CVPRW.2009.5206848
  • [4] Vision meets robotics: The KITTI dataset
    Geiger, A.
    Lenz, P.
    Stiller, C.
    Urtasun, R.
    [J]. INTERNATIONAL JOURNAL OF ROBOTICS RESEARCH, 2013, 32 (11) : 1231 - 1237
  • [5] He KM, 2017, IEEE I CONF COMP VIS, P2980, DOI [10.1109/ICCV.2017.322, 10.1109/TPAMI.2018.2844175]
  • [6] Microsoft COCO: Common Objects in Context
    Lin, Tsung-Yi
    Maire, Michael
    Belongie, Serge
    Hays, James
    Perona, Pietro
    Ramanan, Deva
    Dollar, Piotr
    Zitnick, C. Lawrence
    [J]. COMPUTER VISION - ECCV 2014, PT V, 2014, 8693 : 740 - 755
  • [7] ORB-SLAM2: An Open-Source SLAM System for Monocular, Stereo, and RGB-D Cameras
    Mur-Artal, Raul
    Tardos, Juan D.
    [J]. IEEE TRANSACTIONS ON ROBOTICS, 2017, 33 (05) : 1255 - 1262
  • [8] Redmon J., Darknet: Pre-trained models
  • [9] Redmon J., 2018, PREPRINT, DOI [DOI 10.48550/ARXIV.1804.02767, 10.48550/arXiv.1804.02767]
  • [10] Shah S, 2018, AIRSIM HIGH FIDELITY, DOI 10.1007/978-3-319-67361-5_40