SAM-Net: Self-Attention based Feature Matching with Spatial Transformers and Knowledge Distillation

被引：4

作者：

Kelenyi, Benjamin ^{[1
]}

Domsa, Victor ^{[1
]}

Tamas, Levente ^{[1
]}

机构：

[1] Tech Univ Cluj Napoca, Memorandumului 28, Cluj Napoca 400114, Romania

来源：

EXPERT SYSTEMS WITH APPLICATIONS | 2024年 / 242卷

关键词：

Geometric features extraction; Self-attention; Knowledge-distillation; Spatial transformers; Pose estimation;

D O I：

10.1016/j.eswa.2023.122804

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

In this research paper, we introduce a novel approach to enhance the performance of 2D feature matching and pose estimation through the integration of a hierarchical attention mechanism and knowledge distillation. Our proposed hierarchical attention mechanism operates at multiple scales, enabling both global context awareness and precise matching of 2D features, which is crucial for various computer vision tasks. To further improve our model's performance, we incorporate insights from an existing model PixLoc (Sarlin et al., 2021) through knowledge distillation, effectively acquiring its behavior and capabilities by ignoring dynamic objects. SAM-Net outperforms state-of-the-art methods, validated on both indoor and outdoor public datasets. For the indoor dataset, our approach achieves remarkable AUC (5 degrees /10 degrees /20 degrees) scores of 55.31/71.70/83.37. Similarly, for the outdoor dataset, we demonstrate outstanding AUC values of 26.01/46.44/63.61. Furthermore, SAM-Net achieves top ranking among published methods in two public visual localization benchmarks, highlighting the real benefits of the proposed method. The code and test suite can be accessed at link.1

引用

页数：12

共 61 条

[51] DAISY: An Efficient Dense Descriptor Applied to Wide-Baseline Stereo
Tola, Engin
Lepetit, Vincent
Fua, Pascal
[J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2010, 32 (05) : 815 - 830
[52] Learning Accurate Dense Correspondences and When to Trust Them
Truong, Prune
Danelljan, Martin
Van Gool, Luc
Timofte, Radu
[J]. 2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 5710 - 5720
[53] Priority and age specific vaccination algorithm for the pandemic diseases: a comprehensive parametric prediction model
Tutsoy, Onder
Tanrikulu, Mahmud Yusuf
[J]. BMC MEDICAL INFORMATICS AND DECISION MAKING, 2022, 22 (01)
[54] Knowledge Distillation and Student-Teacher Learning for Visual Intelligence: A Review and New Outlooks
Wang, Lin
Yoon, Kuk-Jin
[J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2022, 44 (06) : 3048 - 3068
[55] MatchFormer: Interleaving Attention in Transformers for Feature Matching
Wang, Qing
Zhang, Jiaming
Yang, Kailun
Peng, Kunyu
Stiefelhagen, Rainer
[J]. COMPUTER VISION - ACCV 2022, PT III, 2023, 13843 : 256 - 273
[56] Xie T., 2023, arXiv
[57] LIFT: Learned Invariant Feature Transform
Yi, Kwang Moo
Trulls, Eduard
Lepetit, Vincent
Fua, Pascal
[J]. COMPUTER VISION - ECCV 2016, PT VI, 2016, 9910 : 467 - 483
[58] Learning Two-View Correspondences and Geometry Using Order-Aware Network
Zhang, Jiahui
Sun, Dawei
Luo, Zixin
Yao, Anbang
Zhou, Lei
Shen, Tianwei
Chen, Yurong
Quan, Long
Liao, Hongen
[J]. 2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 5844 - 5853
[59] Reference Pose Generation for Long-term Visual Localization via Learned Features and View Synthesis
Zhang, Zichao
Sattler, Torsten
Scaramuzza, Davide
[J]. INTERNATIONAL JOURNAL OF COMPUTER VISION, 2021, 129 (04) : 821 - 844
[60] KFNet: Learning Temporal Camera Relocalization using Kalman Filtering
Zhou, Lei
Luo, Zixin
Shen, Tianwei
Zhang, Jiahui
Zhen, Mingmin
Yao, Yao
Fang, Tian
Quan, Long
[J]. 2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2020, : 4918 - 4927

← 1 2 3 4 5 6 7 →