Leveraging Spatial Attention and Edge Context for Optimized Feature Selection in Visual Localization

被引:0
作者
Istighfarin, Nanda Febri [1 ]
Jo, Hyunggi [1 ]
机构
[1] Jeonbuk Natl Univ, Div Elect Engn, Jeonju 54896, South Korea
基金
新加坡国家研究基金会;
关键词
Attention network; computer vision; edge detector; scene coordinate regression; visual localization;
D O I
10.1007/s12555-024-0487-y
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Visual localization determines an agent's precise position and orientation within an environment using visual data. It has become a critical task in the field of robotics, particularly in applications such as autonomous navigation. This is due to the ability to determine an agent's pose using cost-effective sensors such as RGB cameras. Recent methods in visual localization employ scene coordinate regression to determine the agent's pose. However, these methods face challenges as they attempt to regress 2D-3D correspondences across the entire image region, despite not all regions providing useful information. To address this issue, we introduce an attention network that selectively targets informative regions of the image. Using this network, we identify the highest-scoring features to improve the feature selection process and combine the result with edge detection. This integration ensures that the features chosen for the training buffer are located within robust regions, thereby improving 2D-3D correspondence and overall localization performance. Our approach was tested on the outdoor benchmark dataset, demonstrating superior results compared to previous methods.
引用
收藏
页码:418 / 428
页数:11
相关论文
共 40 条
[12]   ScanNet: Richly-annotated 3D Reconstructions of Indoor Scenes [J].
Dai, Angela ;
Chang, Angel X. ;
Savva, Manolis ;
Halber, Maciej ;
Funkhouser, Thomas ;
Niessner, Matthias .
30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, :2432-2443
[13]   SuperPoint: Self-Supervised Interest Point Detection and Description [J].
DeTone, Daniel ;
Malisiewicz, Tomasz ;
Rabinovich, Andrew .
PROCEEDINGS 2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS (CVPRW), 2018, :337-349
[14]   Visual Localization via Few-Shot Scene Region Classification [J].
Dong, Siyan ;
Wang, Shuzhe ;
Zhuang, Yixin ;
Kannala, Juho ;
Pollefeys, Marc ;
Chen, Baoquan .
2022 INTERNATIONAL CONFERENCE ON 3D VISION, 3DV, 2022, :393-402
[15]  
En S., 2018, Proc. of the European Conference on Computer Vision (ECCV) Workshops
[16]   Self-supervising Fine-Grained Region Similarities for Large-Scale Image Localization [J].
Ge, Yixiao ;
Wang, Haibo ;
Zhu, Feng ;
Zhao, Rui ;
Li, Hongsheng .
COMPUTER VISION - ECCV 2020, PT IV, 2020, 12349 :369-386
[17]  
Istighfarin N F., 2024, Map-free Sampling Module for Scene Coordinate Regression Network in Visual Localization
[18]   SOLUTION FOR BEST ROTATION TO RELATE 2 SETS OF VECTORS [J].
KABSCH, W .
ACTA CRYSTALLOGRAPHICA SECTION A, 1976, 32 (SEP1) :922-923
[19]   Geometric loss functions for camera pose regression with deep learning [J].
Kendall, Alex ;
Cipolla, Roberto .
30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, :6555-6564
[20]  
Kendall A, 2016, IEEE INT CONF ROBOT, P4762, DOI 10.1109/ICRA.2016.7487679