Scene-independent Localization by Learning Residual Coordinate Map with Cascaded Localizers

被引：0

作者：

Wang, Junyi ^{[1
,2
,3
]}

Qi, Yue ^{[1
,3
]}

机构：

[1] Beihang Univ, State Key Lab Virtual Real Technol & Syst, Beijing, Peoples R China

[2] Shandong Univ, Sch Comp Sci & Technol, Jinan, Peoples R China

[3] Beihang Univ, Qingdao Res Inst, Qingdao, Peoples R China

来源：

2023 IEEE INTERNATIONAL SYMPOSIUM ON MIXED AND AUGMENTED REALITY, ISMAR | 2023年

基金：

中国国家自然科学基金;

关键词：

Scene-independent localization; residual coordinate map; cascaded localizer; dynamic scene; TRACKING;

D O I：

10.1109/ISMAR59233.2023.00022

中图分类号：

TP3 [计算技术、计算机技术];

学科分类号：

0812 ;

摘要：

Visual localization plays an essential role in a variety of different fields. The indirect learning based method obtains an excellent performance, but it requests a training process in the target scene before the localization. To achieve deep scene-independent localization, we start by proposing the representation called residual coordinate map between a pair of images. Based on the structure, we put forward a network called SILocNet with the proposed residual coordinate map as the output. The network consists of feature extraction, multi-level feature fusion and transformer based coordinate decoder. Moreover, considering the dynamic scene, we introduce an additional segmentation branch that distinguishes fixed and dynamic parts to promote network perception. With SILocNet in place, a cascaded localizer design is presented for reducing the accumulative error. Meanwhile, the simple mathematical analysis behind the cascaded localizers is also provided. To verify how well our algorithm could perform, we conduct experiments on static 7 Scenes, ScanNet and dynamic TUM RGB-D. In particular, we train the network on ScanNet and test it on 7 Scenes and TUM RGB-D to demonstrate the generality performance. All experiments demonstrate superior performance to other existing methods. Additionally, the effects of the cascaded localizer design, feature fusion, transformer based coordinate decoder and segmentation loss are also discussed.

引用

页码：79 / 88

页数：10