MatchFormer: Interleaving Attention in Transformers for Feature Matching

被引:24
作者
Wang, Qing [1 ]
Zhang, Jiaming [1 ]
Yang, Kailun [1 ]
Peng, Kunyu [1 ]
Stiefelhagen, Rainer [1 ]
机构
[1] Karlsruhe Inst Technol, Karlsruhe, Germany
来源
COMPUTER VISION - ACCV 2022, PT III | 2023年 / 13843卷
关键词
Feature matching; Vision transformers;
D O I
10.1007/978-3-031-26313-2_16
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Local feature matching is a computationally intensive task at the subpixel level. While detector-based methods coupled with feature descriptors struggle in low-texture scenes, CNN-based methods with a sequential extract-to-match pipeline, fail to make use of the matching capacity of the encoder and tend to overburden the decoder for matching. In contrast, we propose a novel hierarchical extract-and-match transformer, termed as MatchFormer. Inside each stage of the hierarchical encoder, we interleave self-attention for feature extraction and cross-attention for feature matching, yielding a human-intuitive extract-and-match scheme. Such a match-aware encoder releases the overloaded decoder and makes the model highly efficient. Further, combining self- and cross-attention on multi-scale features in a hierarchical architecture improves matching robustness, particularly in low-texture indoor scenes or with less outdoor training data. Thanks to such a strategy, MatchFormer is a multi-win solution in efficiency, robustness, and precision. Compared to the previous best method in indoor pose estimation, our lite MatchFormer has only 45% GFLOPs, yet achieves a +1.3% precision gain and a 41% running speed boost. The large MatchFormer reaches state-of-the-art on four different benchmarks, including indoor pose estimation (ScanNet), outdoor pose estimation (MegaDepth), homography estimation and image matching (HPatch), and visual localization (InLoc).
引用
收藏
页码:256 / 273
页数:18
相关论文
共 50 条
[41]   A Feature Matching Method For Simultaneous Localization And Mapping [J].
Chen Xiangkui ;
Jiang Min ;
Zuo Liangyu ;
Jiang Jian .
PROCEEDINGS OF 2017 IEEE 2ND INFORMATION TECHNOLOGY, NETWORKING, ELECTRONIC AND AUTOMATION CONTROL CONFERENCE (ITNEC), 2017, :1091-1094
[42]   A new feature matching algorithm of CT slices [J].
Yang Shi-da ;
Yi Ya-lin ;
Shan Zhi-yong ;
Li Qing-hua .
INTERNATIONAL CONFERENCE ON ADVANCES IN ENGINEERING 2011, 2011, 24 :267-271
[43]   Adaptive feature matching network for object occlusion [J].
Mao L. ;
Su H. ;
Yang D. .
Guangxue Jingmi Gongcheng/Optics and Precision Engineering, 2023, 31 (22) :3345-3356
[44]   Geometric Expansion for Local Feature Analysis and Matching [J].
Farhan, Erez ;
Hagege, Rami .
SIAM JOURNAL ON IMAGING SCIENCES, 2015, 8 (04) :2771-2813
[45]   Location recognition based on local feature matching [J].
Gao, Zhuoyue ;
Chai, Lin ;
Jin, Lizuo .
MIPPR 2019: AUTOMATIC TARGET RECOGNITION AND NAVIGATION, 2020, 11429
[46]   Feature Matching Based on Triangle Guidance and Constraints [J].
Liu, Hongmin ;
Zhang, Hongya ;
Wang, Zhiheng ;
Zheng, Yiming .
INTERNATIONAL JOURNAL OF PATTERN RECOGNITION AND ARTIFICIAL INTELLIGENCE, 2018, 32 (08)
[47]   Application of improved SURF algorithm to feature matching [J].
Zhao, Li-Rong ;
Zhu, Wei ;
Cao, Yong-Gang ;
Liu, Yu-Han ;
Sun, Jun-Xi .
Zhu, W. (zw288515@sohu.com), 1600, Chinese Academy of Sciences (21) :3263-3271
[48]   Object Tracking Based on Local Feature Matching [J].
Xiao, Qinkun ;
Liu, Xiangjun ;
Liu, Mina .
2012 FIFTH INTERNATIONAL SYMPOSIUM ON COMPUTATIONAL INTELLIGENCE AND DESIGN (ISCID 2012), VOL 1, 2012, :399-402
[49]   FAST and FLANN for feature matching based on SURF [J].
Huang, Shiguo ;
Sun, Guobing ;
Li, Minglun .
PROCEEDINGS OF THE 33RD CHINESE CONTROL AND DECISION CONFERENCE (CCDC 2021), 2021, :1584-1589
[50]   Robust Feature Matching with Spatial Smoothness Constraints [J].
Huang, Xu ;
Wan, Xue ;
Peng, Daifeng .
REMOTE SENSING, 2020, 12 (19) :1-20