A Wearable Navigation Device for Visually Impaired People Based on the Real-Time Semantic Visual SLAM System

被引:29
作者
Chen, Zhuo [1 ,2 ]
Liu, Xiaoming [1 ,2 ]
Kojima, Masaru [3 ]
Huang, Qiang [1 ,2 ]
Arai, Tatsuo [1 ,2 ,4 ]
机构
[1] Beijing Inst Technol, State Key Lab Intelligent Control & Decis Complex, Beijing Adv Innovat Ctr Intelligent Robots & Syst, Key Lab Biomimet Robots & Syst,Minist Educ, Beijing 100081, Peoples R China
[2] Beijing Inst Technol, Sch Mechatron Engn, Beijing 100081, Peoples R China
[3] Osaka Univ, Dept Mat Engn Sci, Osaka 5608531, Japan
[4] Univ Electrocommun, Global Alliance Lab, Tokyo 1828585, Japan
基金
中国博士后科学基金; 中国国家自然科学基金;
关键词
wearable device; semantic segmentation; SLAM; assistance for visually impaired people; localization; semantic map;
D O I
10.3390/s21041536
中图分类号
O65 [分析化学];
学科分类号
070302 ; 081704 ;
摘要
Wearable auxiliary devices for visually impaired people are highly attractive research topics. Although many proposed wearable navigation devices can assist visually impaired people in obstacle avoidance and navigation, these devices cannot feedback detailed information about the obstacles or help the visually impaired understand the environment. In this paper, we proposed a wearable navigation device for the visually impaired by integrating the semantic visual SLAM (Simultaneous Localization And Mapping) and the newly launched powerful mobile computing platform. This system uses an Image-Depth (RGB-D) camera based on structured light as the sensor, as the control center. We also focused on the technology that combines SLAM technology with the extraction of semantic information from the environment. It ensures that the computing platform understands the surrounding environment in real-time and can feed it back to the visually impaired in the form of voice broadcast. Finally, we tested the performance of the proposed semantic visual SLAM system on this device. The results indicate that the system can run in real-time on a wearable navigation device with sufficient accuracy.
引用
收藏
页码:1 / 14
页数:13
相关论文
共 34 条
[1]  
Agrawal M, 2008, LECT NOTES COMPUT SC, V5305, P102, DOI 10.1007/978-3-540-88693-8_8
[2]  
[Anonymous], 2008, COMPUT VIS IMAGE UND
[3]   SegNet: A Deep Convolutional Encoder-Decoder Architecture for Image Segmentation [J].
Badrinarayanan, Vijay ;
Kendall, Alex ;
Cipolla, Roberto .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2017, 39 (12) :2481-2495
[4]   SURF: Speeded up robust features [J].
Bay, Herbert ;
Tuytelaars, Tinne ;
Van Gool, Luc .
COMPUTER VISION - ECCV 2006 , PT 1, PROCEEDINGS, 2006, 3951 :404-417
[5]  
Bowman Sean L., 2017, 2017 IEEE International Conference on Robotics and Automation (ICRA), P1722, DOI 10.1109/ICRA.2017.7989203
[6]   DeepLab: Semantic Image Segmentation with Deep Convolutional Nets, Atrous Convolution, and Fully Connected CRFs [J].
Chen, Liang-Chieh ;
Papandreou, George ;
Kokkinos, Iasonas ;
Murphy, Kevin ;
Yuille, Alan L. .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2018, 40 (04) :834-848
[7]  
Dhivya K, 2019, PROCEEDINGS OF THE 2019 INTERNATIONAL CONFERENCE ON INTELLIGENT COMPUTING AND CONTROL SYSTEMS (ICCS), P130, DOI [10.1109/iccs45141.2019.9065816, 10.1109/ICCS45141.2019.9065816]
[8]   LSD-SLAM: Large-Scale Direct Monocular SLAM [J].
Engel, Jakob ;
Schoeps, Thomas ;
Cremers, Daniel .
COMPUTER VISION - ECCV 2014, PT II, 2014, 8690 :834-849
[9]  
Forster C, 2014, IEEE INT CONF ROBOT, P15, DOI 10.1109/ICRA.2014.6906584
[10]   Deep Residual Learning for Image Recognition [J].
He, Kaiming ;
Zhang, Xiangyu ;
Ren, Shaoqing ;
Sun, Jian .
2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, :770-778