Multi-scale parallel gated local feature transformer

被引:0
|
作者
Qu, Hangzhou [1 ]
Hu, Zhuhua [1 ]
Wu, Jiaqi [1 ]
机构
[1] Hainan Univ, Sch Informat & Commun Engn, Haikou 570228, Peoples R China
来源
SCIENTIFIC REPORTS | 2025年 / 15卷 / 01期
基金
中国国家自然科学基金;
关键词
Visual SLAM; Multi-scale; Feature matching; Linear transformation; Gated convolution;
D O I
10.1038/s41598-025-91857-5
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Visual Simultaneous Localization and Mapping (VSLAM) is a crucial technology for autonomous mobile vision robots. However, existing methods often suffer from low localization accuracy and poor robustness in scenarios with significant scale variations and low-texture environments, primarily due to insufficient feature extraction and reduced matching precision. To address these challenges, this paper proposes an improved multi-scale local feature matching algorithm based on LoFTR, named MSpGLoFTR. First, we introduce a Multi-Scale Local Attention Module (MSLAM), which achieves feature fusion and resolution alignment through multi-scale window partitioning and a shared multi-layer perceptron (MLP). Second, a Multi-Scale Parallel Attention Module is designed to capture features across various scales, enhancing the model's adaptability to large-scale features and highly similar pixel regions. Finally, a Gated Convolutional Network (GCN) mechanism is incorporated to dynamically adjust weights, emphasizing key features while suppressing background noise, thereby further improving matching precision and robustness. Experimental results demonstrate that MSpGLoFTR outperforms LoFTR in terms of matching precision, relative pose estimation performance, and adaptability to complex scenarios. Notably, it excels in environments with significant illumination changes, scale variations, and viewpoint shifts. This makes MSpGLoFTR an efficient and robust feature matching solution for complex vision tasks.
引用
收藏
页数:11
相关论文
共 50 条
  • [1] Gated Multi-Scale Transformer for Temporal Action Localization
    Yang, Jin
    Wei, Ping
    Ren, Ziyang
    Zheng, Nanning
    IEEE TRANSACTIONS ON MULTIMEDIA, 2024, 26 : 5705 - 5717
  • [2] Transformer based on multi-scale local feature for colon cancer histopathological image classification
    Fu, Zhibing
    Chen, Qingkui
    Wang, Mingming
    Huang, Chen
    BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2025, 100
  • [3] Multi-Scale Feature Attention-DEtection TRansformer: Multi-Scale Feature Attention for security check object detection
    Sima, Haifeng
    Chen, Bailiang
    Tang, Chaosheng
    Zhang, Yudong
    Sun, Junding
    IET COMPUTER VISION, 2024, 18 (05) : 613 - 625
  • [4] NLFFTNet: A non-local feature fusion transformer network for multi-scale object detection
    Zeng, Kai
    Ma, Qian
    Wu, Jiawen
    Xiang, Sijia
    Shen, Tao
    Zhang, Lei
    NEUROCOMPUTING, 2022, 493 : 15 - 27
  • [5] FPDT: a multi-scale feature pyramidal object detection transformer
    Huang, Kailai
    Wen, Mi
    Wang, Chen
    Ling, Lina
    JOURNAL OF APPLIED REMOTE SENSING, 2023, 17 (02)
  • [6] Gated CNN: Integrating multi-scale feature layers for object detection
    Yuan, Jin
    Xiong, Heng-Chang
    Xiao, Yi
    Guan, Weili
    Wang, Meng
    Hong, Richang
    Li, Zhi-Yong
    PATTERN RECOGNITION, 2020, 105
  • [7] Context Contrasted Feature and Gated Multi-scale Aggregation for Scene Segmentation
    Ding, Henghui
    Jiang, Xudong
    Shuai, Bing
    Liu, Ai Qun
    Wang, Gang
    2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 2393 - 2402
  • [8] Multi-scale frequency feature fusion transformer for pediatric echocardiography analysis
    Zhao, Cheng
    Liu, Yuanlin
    Chen, Weiling
    Xiang, Zhuo
    Liu, Yiyao
    Xia, Bei
    Qin, Jing
    Wang, Tianfu
    Lei, Baiying
    APPLIED SOFT COMPUTING, 2025, 173
  • [9] MULTI-SCALE TRANSFORMER-BASED FEATURE COMBINATION FOR IMAGE RETRIEVAL
    Roig Mari, Carlos
    Varas Gonzalez, David
    Bou-Balust, Elisenda
    2022 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, ICIP, 2022, : 3166 - 3170
  • [10] Multi-scale feature fusion for pavement crack detection based on Transformer
    Yang, Yalong
    Niu, Zhen
    Su, Liangliang
    Xu, Wenjing
    Wang, Yuanhang
    MATHEMATICAL BIOSCIENCES AND ENGINEERING, 2023, 20 (08) : 14920 - 14937