A novel fusing semantic- and appearance-based descriptors for visual loop closure detection

被引:7
作者
Wu, Peng [1 ]
Wang, Junxiao [1 ]
Wang, Chen [1 ]
Zhang, Lei [2 ]
Wang, Yuanzhi [3 ]
机构
[1] Zhejiang Sci Tech Univ, Sch Mech Engn & Automat, Hangzhou 310018, Peoples R China
[2] Northeast Forestry Univ, Coll Mech & Elect Engn, Harbin 150040, Peoples R China
[3] Anqing Normal Univ, Sch Comp & Infomat, Anqing 246011, Peoples R China
来源
OPTIK | 2021年 / 243卷
关键词
Simultaneous localisation and mapping (SLAM); Pose estimation; Vector of locally aggregated descriptors (VLAD); Semantic information; Two nearest neighbour local sensor tensor (TNNLoST); SLAM;
D O I
10.1016/j.ijleo.2021.167230
中图分类号
O43 [光学];
学科分类号
070207 ; 0803 ;
摘要
Loop-closure detection plays an important role in visual simultaneous localisation and map-ping;it is an independent part of the visual odometer and can effectively reduce its accumulated error, in addition to helping with loop-closure detection for relocalisation. With the development of deep learning methods in recent years, the training models of convolutional neural networks for major data sets have been improved for loop-closure detection. Presently, some high-level engineering problems still rely on auxiliary equipment, such as panoramic cameras and radar lasers, which greatly increase the expensive extra cost; however, owing to the extreme appearance and viewpoint changes involved in such problems, loop-closure detection that relies on two-dimensional images is not applicable. Based on the two nearest neighbour vector of locally aggregated descriptors (TNNVLAD) method, a novel feature descriptor called two nearest neighbour local sensor tensor(TNNLoST) is proposed herein by combining the semantic features of high-level neural networks with dense descriptors. This approach introduces a semantic concept similar to human cognition for the surrounding environment, thus enabling better understanding of the environment. The proposed method was applied to publicly available benchmark datasets to show its performance.
引用
收藏
页数:8
相关论文
共 18 条
  • [1] Arandjelovic R, 2018, IEEE T PATTERN ANAL, V40, P1437, DOI [10.1109/TPAMI.2017.2711011, 10.1109/CVPR.2016.572]
  • [2] Choi S, 2015, PROC CVPR IEEE, P5556, DOI 10.1109/CVPR.2015.7299195
  • [3] Appearance-only SLAM at large scale with FAB-MAP 2.0
    Cummins, Mark
    Newman, Paul
    [J]. INTERNATIONAL JOURNAL OF ROBOTICS RESEARCH, 2011, 30 (09) : 1100 - 1123
  • [4] Unsupervised learning to detect loops using deep neural networks for visual SLAM system
    Gao, Xiang
    Zhang, Tao
    [J]. AUTONOMOUS ROBOTS, 2017, 41 (01) : 1 - 18
  • [5] Garg S, 2018, ROBOTICS: SCIENCE AND SYSTEMS XIV
  • [6] Locality-Sensitive Hashing for Chi2 Distance
    Gorisse, David
    Cord, Matthieu
    Precioso, Frederic
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2012, 34 (02) : 402 - 409
  • [7] Jégou H, 2010, PROC CVPR IEEE, P3304, DOI 10.1109/CVPR.2010.5540039
  • [8] A deep-learning real-time visual SLAM system based on multi-task feature extraction network and self-supervised feature points
    Li, Guangqiang
    Yu, Lei
    Fei, Shumin
    [J]. MEASUREMENT, 2021, 168
  • [9] Liu Y, 2012, IEEE INT C INT ROBOT, P1051, DOI 10.1109/IROS.2012.6386145
  • [10] Image classification based on improved VLAD
    Long, Xianzhong
    Lu, Hongtao
    Peng, Yong
    Wang, Xianzhong
    Feng, Shaokun
    [J]. MULTIMEDIA TOOLS AND APPLICATIONS, 2016, 75 (10) : 5533 - 5555