Visual SLAM Based on Object Detection Network: A Review br

被引:4
作者
Peng, Jiansheng [1 ,2 ]
Chen, Dunhua [1 ]
Yang, Qing [1 ]
Yang, Chengjun [2 ]
Xu, Yong [2 ]
Qin, Yong [2 ]
机构
[1] Guangxi Univ Sci & Technol, Coll Automat, Liuzhou 545000, Peoples R China
[2] Hechi Univ, Dept Artificial Intelligence & Mfg, Hechi 547000, Peoples R China
来源
CMC-COMPUTERS MATERIALS & CONTINUA | 2023年 / 77卷 / 03期
基金
中国国家自然科学基金;
关键词
semantic map; Object detection; visual SLAM; visual odometry; loop closure detection; LANGUAGE MODEL; NEURAL-NETWORK;
D O I
10.32604/cmc.2023.041898
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Visual simultaneous localization and mapping (SLAM) is crucial in robotics and autonomous driving. However, traditional visual SLAM faces challenges in dynamic environments. To address this issue, researchers have proposed semantic SLAM, which combines object detection, semantic segmentation, instance segmentation, and visual SLAM. Despite the growing body of literature on semantic SLAM, there is currently a lack of comprehensive research on the integration of object detection and visual SLAM. Therefore, this study aims to gather information from multiple databases and review relevant literature using specific keywords. It focuses on visual SLAM based on object detection, covering different aspects. Firstly, it discusses the current research status and challenges in this field, highlighting methods for incorporating semantic information from object detection networks into mileage measurement, closed-loop detection, and map construction. It also compares the characteristics and performance of various visual SLAM object detection algorithms. Lastly, it provides an outlook on future research directions and emerging trends in visual SLAM. Research has shown that visual SLAM based on object detection has significant improvements compared to traditional SLAM in dynamic point removal, data association, point cloud segmentation, and other technologies. It can improve the robustness and accuracy of the entire SLAM system and can run in real time. With the continuous optimization of algorithms and the improvement of hardware level, object visual SLAM has great potential for development.
引用
收藏
页码:3209 / 3236
页数:28
相关论文
共 65 条
[1]  
Al-Rfou R, 2019, AAAI CONF ARTIF INTE, P3159
[2]  
[Anonymous], 2013, Pmlr, DOI DOI 10.48550/ARXIV.1211.5063
[3]  
Arisoy E, 2015, INT CONF ACOUST SPEE, P5421, DOI 10.1109/ICASSP.2015.7179007
[4]  
Baevski A, 2020, ADV NEUR IN, V33
[5]   A neural probabilistic language model [J].
Bengio, Y ;
Ducharme, R ;
Vincent, P ;
Jauvin, C .
JOURNAL OF MACHINE LEARNING RESEARCH, 2003, 3 (06) :1137-1155
[6]  
Chan W, 2016, INT CONF ACOUST SPEE, P4960, DOI 10.1109/ICASSP.2016.7472621
[7]   WavLM: Large-Scale Self-Supervised Pre-Training for Full Stack Speech Processing [J].
Chen, Sanyuan ;
Wang, Chengyi ;
Chen, Zhengyang ;
Wu, Yu ;
Liu, Shujie ;
Chen, Zhuo ;
Li, Jinyu ;
Kanda, Naoyuki ;
Yoshioka, Takuya ;
Xiao, Xiong ;
Wu, Jian ;
Zhou, Long ;
Ren, Shuo ;
Qian, Yanmin ;
Qian, Yao ;
Zeng, Michael ;
Yu, Xiangzhan ;
Wei, Furu .
IEEE JOURNAL OF SELECTED TOPICS IN SIGNAL PROCESSING, 2022, 16 (06) :1505-1518
[8]   Investigating Bidirectional Recurrent Neural Network Language Models for Speech Recognition [J].
Chen, X. ;
Ragni, A. ;
Liu, X. ;
Gales, M. J. F. .
18TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2017), VOLS 1-6: SITUATED INTERACTION, 2017, :269-273
[9]   Filter Contribution Recycle: Boosting Model Pruning with Small Norm Filters [J].
Chen, Zehong ;
Xie, Zhonghua ;
Wang, Zhen ;
Xu, Tao ;
Zhang, Zhengrui .
KSII TRANSACTIONS ON INTERNET AND INFORMATION SYSTEMS, 2022, 16 (11) :3507-3522
[10]   Joint structured pruning and dense knowledge distillation for efficient transformer model compression [J].
Cui, Baiyun ;
Li, Yingming ;
Zhang, Zhongfei .
NEUROCOMPUTING, 2021, 458 :56-69