A 6-DOFs event-based camera relocalization system by CNN-LSTM and image denoising

被引:21
作者
Jin, Yifan [1 ]
Yu, Lei [1 ]
Li, Guangqiang [1 ]
Fei, Shumin [2 ]
机构
[1] Soochow Univ, Sch Mech & Elect Engn, Suzhou 215000, Peoples R China
[2] Southeast Univ, Sch Automat, Nanjing 210009, Peoples R China
基金
中国国家自然科学基金;
关键词
Event image; Camera relocalization; Image denoising; 6-DOFs pose; Convolutional neural network; NETWORKS;
D O I
10.1016/j.eswa.2020.114535
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
At present, in the research of simultaneous localization and mapping systems, many traditional relocalization methods have been replaced by camera relocalization techniques based on convolutional neural network (CNN) and long and short-term memory (LSTM). However, in a system using an event dataset to train the neural network, the complex scenes are chaotic, and the noise of the event images is excessive. Both issues make the model unable to return to the six-degrees of freedom (6-DOFs) pose well. This paper proposes a 6-DOFs pose camera relocalization method based on the CNN image denoising model and CNN-LSTM. Firstly, the CNN image denoising model is used to solve the problem of excessive noise points in complex scenes. Then, a network framework combining CNN and LSTM trains the event camera relocalization model to obtain better 6-DOFs pose accuracy. Finally, the study performs experimental simulations by using complex scene datasets without and with denoising images. Experimental results show that the proposed method of camera relocalization has many advantages. It enhances the robustness of the model in the training process, reduces the mutation situation, and the trained model has a smaller error and faster speed when predicting the pose, thus improving the accuracy and real-time of the camera relocalization model.
引用
收藏
页数:12
相关论文
共 28 条
[1]  
[Anonymous], 2017, ARXIV PREPRINT ARXIV
[2]   Nonlocality-Reinforced Convolutional Neural Networks for Image Denoising [J].
Cruz, Cristovao ;
Foi, Alessandro ;
Katkovnik, Vladimir ;
Egiazarian, Karen .
IEEE SIGNAL PROCESSING LETTERS, 2018, 25 (08) :1216-1220
[3]   Appearance-only SLAM at large scale with FAB-MAP 2.0 [J].
Cummins, Mark ;
Newman, Paul .
INTERNATIONAL JOURNAL OF ROBOTICS RESEARCH, 2011, 30 (09) :1100-1123
[4]   Online dictionary learning algorithm with periodic updates and its application to image denoising [J].
Eksioglu, Ender M. .
EXPERT SYSTEMS WITH APPLICATIONS, 2014, 41 (08) :3682-3690
[5]   MultiD-CNN: A multi-dimensional feature learning approach based on deep convolutional networks for gesture recognition in RGB-D image sequences [J].
Elboushaki, Abdessamad ;
Hannane, Rachida ;
Afdel, Karim ;
Koutti, Lahcen .
EXPERT SYSTEMS WITH APPLICATIONS, 2020, 139
[6]   New image denoising algorithm via improved deep convolutional neural network with perceptive loss [J].
Gai, Shan ;
Bao, Zhongyun .
EXPERT SYSTEMS WITH APPLICATIONS, 2019, 138
[7]   Bags of Binary Words for Fast Place Recognition in Image Sequences [J].
Galvez-Lopez, Dorian ;
Tardos, Juan D. .
IEEE TRANSACTIONS ON ROBOTICS, 2012, 28 (05) :1188-1197
[8]   Path-based reasoning approach for knowledge graph completion using CNN-BiLSTM with attention mechanism [J].
Jagvaral, Batselem ;
Lee, Wan-Kon ;
Roh, Jae-Seung ;
Kim, Min-Sung ;
Park, Young Tack .
EXPERT SYSTEMS WITH APPLICATIONS, 2020, 142
[9]  
Jian Wu, 2017, 2017 IEEE International Conference on Robotics and Automation (ICRA), P5644, DOI 10.1109/ICRA.2017.7989663
[10]   Geometric loss functions for camera pose regression with deep learning [J].
Kendall, Alex ;
Cipolla, Roberto .
30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, :6555-6564