A novel fusing semantic- and appearance-based descriptors for visual loop closure detection

被引：7

作者：

Wu, Peng ^{[1
]}

Wang, Junxiao ^{[1
]}

Wang, Chen ^{[1
]}

Zhang, Lei ^{[2
]}

Wang, Yuanzhi ^{[3
]}

机构：

[1] Zhejiang Sci Tech Univ, Sch Mech Engn & Automat, Hangzhou 310018, Peoples R China

[2] Northeast Forestry Univ, Coll Mech & Elect Engn, Harbin 150040, Peoples R China

[3] Anqing Normal Univ, Sch Comp & Infomat, Anqing 246011, Peoples R China

来源：

OPTIK | 2021年 / 243卷

关键词：

Simultaneous localisation and mapping (SLAM); Pose estimation; Vector of locally aggregated descriptors (VLAD); Semantic information; Two nearest neighbour local sensor tensor (TNNLoST); SLAM;

D O I：

10.1016/j.ijleo.2021.167230

中图分类号：

O43 [光学];

学科分类号：

070207 ; 0803 ;

摘要：

Loop-closure detection plays an important role in visual simultaneous localisation and map-ping;it is an independent part of the visual odometer and can effectively reduce its accumulated error, in addition to helping with loop-closure detection for relocalisation. With the development of deep learning methods in recent years, the training models of convolutional neural networks for major data sets have been improved for loop-closure detection. Presently, some high-level engineering problems still rely on auxiliary equipment, such as panoramic cameras and radar lasers, which greatly increase the expensive extra cost; however, owing to the extreme appearance and viewpoint changes involved in such problems, loop-closure detection that relies on two-dimensional images is not applicable. Based on the two nearest neighbour vector of locally aggregated descriptors (TNNVLAD) method, a novel feature descriptor called two nearest neighbour local sensor tensor(TNNLoST) is proposed herein by combining the semantic features of high-level neural networks with dense descriptors. This approach introduces a semantic concept similar to human cognition for the surrounding environment, thus enabling better understanding of the environment. The proposed method was applied to publicly available benchmark datasets to show its performance.

引用

页数：8

共 18 条

[1]

Arandjelovic R, 2018, IEEE T PATTERN ANAL, V40, P1437, DOI [10.1109/TPAMI.2017.2711011, 10.1109/CVPR.2016.572]

[2]

Choi S, 2015, PROC CVPR IEEE, P5556, DOI 10.1109/CVPR.2015.7299195

[3] Appearance-only SLAM at large scale with FAB-MAP 2.0 [J].

Cummins, Mark ;

Newman, Paul .

INTERNATIONAL JOURNAL OF ROBOTICS RESEARCH, 2011, 30 (09) :1100-1123

[4] Unsupervised learning to detect loops using deep neural networks for visual SLAM system [J].

Gao, Xiang ;

Zhang, Tao .

AUTONOMOUS ROBOTS, 2017, 41 (01) :1-18

[5]

Garg S, 2018, ROBOTICS: SCIENCE AND SYSTEMS XIV

[6] Locality-Sensitive Hashing for Chi2 Distance [J].

Gorisse, David ;

Cord, Matthieu ;

Precioso, Frederic .

IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2012, 34 (02) :402-409

[7]

Jégou H, 2010, PROC CVPR IEEE, P3304, DOI 10.1109/CVPR.2010.5540039

[8] A deep-learning real-time visual SLAM system based on multi-task feature extraction network and self-supervised feature points [J].

Li, Guangqiang ;

Yu, Lei ;

Fei, Shumin .

MEASUREMENT, 2021, 168

[9]

Liu Y, 2012, IEEE INT C INT ROBOT, P1051, DOI 10.1109/IROS.2012.6386145

[10] Image classification based on improved VLAD [J].

Long, Xianzhong ;

Lu, Hongtao ;

Peng, Yong ;

Wang, Xianzhong ;

Feng, Shaokun .

MULTIMEDIA TOOLS AND APPLICATIONS, 2016, 75 (10) :5533-5555

← 1 2 →