Can Semantic-based Filtering of Dynamic Objects improve Visual SLAM and Visual Odometry?

被引:1
作者
Costa, Leonardo Rezende [1 ]
Colombini, Esther Luna [1 ]
机构
[1] Unicamp Campinas, Inst Comp, Campinas, Brazil
来源
2023 LATIN AMERICAN ROBOTICS SYMPOSIUM, LARS, 2023 BRAZILIAN SYMPOSIUM ON ROBOTICS, SBR, AND 2023 WORKSHOP ON ROBOTICS IN EDUCATION, WRE | 2023年
关键词
V-SLAM; Visual Odometry; Semantic;
D O I
10.1109/LARS/SBR/WRE59448.2023.10332956
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
This work introduces a novel approach to improve robot perception in dynamic environments using Semantic Filtering. The goal is to enhance Visual Simultaneous Localization and Mapping (V-SLAM) and Visual Odometry (VO) tasks by excluding feature points associated with moving objects. Four different approaches for semantic extraction, namely YOLOv3, DeepLabv3 with two different backbones, and Mask R-CNN, were evaluated. The framework was tested on various datasets, including KITTI, TUM and a simulated sequence generated on AirSim. The results demonstrated that the proposed semantic filtering significantly reduced estimation errors in VO tasks, with average error reduction ranging from 2.81% to 15.98%, while the results for V-SLAM were similar to the base work, especially for sequences with detected loops. Although fewer keypoints are used, the estimations benefit from the points excluded in VO. More experiments are needed to address the effects in VSLAM due to the presence of loop closure and the nature of the datasets.
引用
收藏
页码:567 / 572
页数:6
相关论文
共 12 条
[1]  
Chen LC, 2017, Arxiv, DOI arXiv:1706.05587
[2]  
Chollet F, 2015, Keras
[3]  
Deng J, 2009, PROC CVPR IEEE, P248, DOI 10.1109/CVPRW.2009.5206848
[4]   Vision meets robotics: The KITTI dataset [J].
Geiger, A. ;
Lenz, P. ;
Stiller, C. ;
Urtasun, R. .
INTERNATIONAL JOURNAL OF ROBOTICS RESEARCH, 2013, 32 (11) :1231-1237
[5]  
He KM, 2017, IEEE I CONF COMP VIS, P2980, DOI [10.1109/ICCV.2017.322, 10.1109/TPAMI.2018.2844175]
[6]  
Kapoor A., 2017, P FIELD SERV ROB RES, P621, DOI DOI 10.1007/978-3-319-67361-540
[7]   Microsoft COCO: Common Objects in Context [J].
Lin, Tsung-Yi ;
Maire, Michael ;
Belongie, Serge ;
Hays, James ;
Perona, Pietro ;
Ramanan, Deva ;
Dollar, Piotr ;
Zitnick, C. Lawrence .
COMPUTER VISION - ECCV 2014, PT V, 2014, 8693 :740-755
[8]   ORB-SLAM2: An Open-Source SLAM System for Monocular, Stereo, and RGB-D Cameras [J].
Mur-Artal, Raul ;
Tardos, Juan D. .
IEEE TRANSACTIONS ON ROBOTICS, 2017, 33 (05) :1255-1262
[9]  
Redmon J., Darknet: Pre-trained models
[10]  
Redmon J., 2018, Yolov3: An incremental improvement