MatchFormer: Interleaving Attention in Transformers for Feature Matching

被引:24
作者
Wang, Qing [1 ]
Zhang, Jiaming [1 ]
Yang, Kailun [1 ]
Peng, Kunyu [1 ]
Stiefelhagen, Rainer [1 ]
机构
[1] Karlsruhe Inst Technol, Karlsruhe, Germany
来源
COMPUTER VISION - ACCV 2022, PT III | 2023年 / 13843卷
关键词
Feature matching; Vision transformers;
D O I
10.1007/978-3-031-26313-2_16
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Local feature matching is a computationally intensive task at the subpixel level. While detector-based methods coupled with feature descriptors struggle in low-texture scenes, CNN-based methods with a sequential extract-to-match pipeline, fail to make use of the matching capacity of the encoder and tend to overburden the decoder for matching. In contrast, we propose a novel hierarchical extract-and-match transformer, termed as MatchFormer. Inside each stage of the hierarchical encoder, we interleave self-attention for feature extraction and cross-attention for feature matching, yielding a human-intuitive extract-and-match scheme. Such a match-aware encoder releases the overloaded decoder and makes the model highly efficient. Further, combining self- and cross-attention on multi-scale features in a hierarchical architecture improves matching robustness, particularly in low-texture indoor scenes or with less outdoor training data. Thanks to such a strategy, MatchFormer is a multi-win solution in efficiency, robustness, and precision. Compared to the previous best method in indoor pose estimation, our lite MatchFormer has only 45% GFLOPs, yet achieves a +1.3% precision gain and a 41% running speed boost. The large MatchFormer reaches state-of-the-art on four different benchmarks, including indoor pose estimation (ScanNet), outdoor pose estimation (MegaDepth), homography estimation and image matching (HPatch), and visual localization (InLoc).
引用
收藏
页码:256 / 273
页数:18
相关论文
共 50 条
  • [21] ViTs as backbones: Leveraging vision transformers for feature extraction
    Elharrouss, Omar
    Himeur, Yassine
    Mahmood, Yasir
    Alrabaee, Saed
    Ouamane, Abdelmalik
    Bensaali, Faycal
    Bechqito, Yassine
    Chouchane, Ammar
    INFORMATION FUSION, 2025, 118
  • [22] Feature Fusion Information Statistics for feature matching in cluttered scenes
    Zhou, Wei
    Ma, Caiwen
    Liao, Shenghui
    Shi, Jinjing
    Yao, Tong
    Chang, Peng
    Kuijper, Arjan
    COMPUTERS & GRAPHICS-UK, 2018, 77 : 50 - 64
  • [23] A Novel Affine Covariant Feature Mismatch Removal for Feature Matching
    Shen, Liang
    Zhu, Jiahua
    Fan, Chongyi
    Huang, Xiaotao
    Jin, Tian
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2022, 60
  • [24] Object-aware deep feature extraction for feature matching
    Li, Zuoyong
    Wang, Weice
    Lai, Taotao
    Xu, Haiping
    Keikhosrokiani, Pantea
    CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2024, 36 (05)
  • [25] USING FEATURE SPATIAL ORDER IN PROGRESSIVE IMAGE FEATURE MATCHING
    Teng, Chin-Hung
    Dong, Ben-Jian
    PROCEEDINGS OF 2019 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND CYBERNETICS (ICMLC), 2019, : 31 - 36
  • [26] Guided Local Feature Matching with Transformer
    Du, Siliang
    Xiao, Yilin
    Huang, Jingwei
    Sun, Mingwei
    Liu, Mingzhong
    REMOTE SENSING, 2023, 15 (16)
  • [27] A Method with Optimization SURF for Feature Matching
    Wang, Jiajing
    Zhang, Shusheng
    PROCEEDINGS OF 2011 INTERNATIONAL CONFERENCE ON INFORMATION TECHNOLOGY AND INDUSTRIAL ENGINEERING, 2011, : 8 - 11
  • [28] A novel evolutionary framework for feature matching
    Wang, Biao
    Tang, Chaoying
    VISAPP 2008: PROCEEDINGS OF THE THIRD INTERNATIONAL CONFERENCE ON COMPUTER VISION THEORY AND APPLICATIONS, VOL 2, 2008, : 641 - 644
  • [29] Active Descriptor Learning for Feature Matching
    Kocanaogullari, Aziz
    Ataer-Cansizoglu, Esra
    COMPUTER VISION - ECCV 2018 WORKSHOPS, PT IV, 2019, 11132 : 619 - 630
  • [30] Ranking list preservation for feature matching
    Jiang, Junjun
    Ma, Qing
    Jiang, Xingyu
    Ma, Jiayi
    PATTERN RECOGNITION, 2021, 111