SAM-Net: Self-Attention based Feature Matching with Spatial Transformers and Knowledge Distillation

被引:4
作者
Kelenyi, Benjamin [1 ]
Domsa, Victor [1 ]
Tamas, Levente [1 ]
机构
[1] Tech Univ Cluj Napoca, Memorandumului 28, Cluj Napoca 400114, Romania
关键词
Geometric features extraction; Self-attention; Knowledge-distillation; Spatial transformers; Pose estimation;
D O I
10.1016/j.eswa.2023.122804
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this research paper, we introduce a novel approach to enhance the performance of 2D feature matching and pose estimation through the integration of a hierarchical attention mechanism and knowledge distillation. Our proposed hierarchical attention mechanism operates at multiple scales, enabling both global context awareness and precise matching of 2D features, which is crucial for various computer vision tasks. To further improve our model's performance, we incorporate insights from an existing model PixLoc (Sarlin et al., 2021) through knowledge distillation, effectively acquiring its behavior and capabilities by ignoring dynamic objects. SAM-Net outperforms state-of-the-art methods, validated on both indoor and outdoor public datasets. For the indoor dataset, our approach achieves remarkable AUC (5 degrees /10 degrees /20 degrees) scores of 55.31/71.70/83.37. Similarly, for the outdoor dataset, we demonstrate outstanding AUC values of 26.01/46.44/63.61. Furthermore, SAM-Net achieves top ranking among published methods in two public visual localization benchmarks, highlighting the real benefits of the proposed method. The code and test suite can be accessed at link.1
引用
收藏
页数:12
相关论文
共 61 条
  • [51] DAISY: An Efficient Dense Descriptor Applied to Wide-Baseline Stereo
    Tola, Engin
    Lepetit, Vincent
    Fua, Pascal
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2010, 32 (05) : 815 - 830
  • [52] Learning Accurate Dense Correspondences and When to Trust Them
    Truong, Prune
    Danelljan, Martin
    Van Gool, Luc
    Timofte, Radu
    [J]. 2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 5710 - 5720
  • [53] Priority and age specific vaccination algorithm for the pandemic diseases: a comprehensive parametric prediction model
    Tutsoy, Onder
    Tanrikulu, Mahmud Yusuf
    [J]. BMC MEDICAL INFORMATICS AND DECISION MAKING, 2022, 22 (01)
  • [54] Knowledge Distillation and Student-Teacher Learning for Visual Intelligence: A Review and New Outlooks
    Wang, Lin
    Yoon, Kuk-Jin
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2022, 44 (06) : 3048 - 3068
  • [55] MatchFormer: Interleaving Attention in Transformers for Feature Matching
    Wang, Qing
    Zhang, Jiaming
    Yang, Kailun
    Peng, Kunyu
    Stiefelhagen, Rainer
    [J]. COMPUTER VISION - ACCV 2022, PT III, 2023, 13843 : 256 - 273
  • [56] Xie T., 2023, arXiv
  • [57] LIFT: Learned Invariant Feature Transform
    Yi, Kwang Moo
    Trulls, Eduard
    Lepetit, Vincent
    Fua, Pascal
    [J]. COMPUTER VISION - ECCV 2016, PT VI, 2016, 9910 : 467 - 483
  • [58] Learning Two-View Correspondences and Geometry Using Order-Aware Network
    Zhang, Jiahui
    Sun, Dawei
    Luo, Zixin
    Yao, Anbang
    Zhou, Lei
    Shen, Tianwei
    Chen, Yurong
    Quan, Long
    Liao, Hongen
    [J]. 2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 5844 - 5853
  • [59] Reference Pose Generation for Long-term Visual Localization via Learned Features and View Synthesis
    Zhang, Zichao
    Sattler, Torsten
    Scaramuzza, Davide
    [J]. INTERNATIONAL JOURNAL OF COMPUTER VISION, 2021, 129 (04) : 821 - 844
  • [60] KFNet: Learning Temporal Camera Relocalization using Kalman Filtering
    Zhou, Lei
    Luo, Zixin
    Shen, Tianwei
    Zhang, Jiahui
    Zhen, Mingmin
    Yao, Yao
    Fang, Tian
    Quan, Long
    [J]. 2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2020, : 4918 - 4927