MULTI-SCALE TRANSFORMER-BASED FEATURE COMBINATION FOR IMAGE RETRIEVAL

被引:3
|
作者
Roig Mari, Carlos [1 ]
Varas Gonzalez, David [1 ]
Bou-Balust, Elisenda [1 ]
机构
[1] Apple, Barcelona, Spain
来源
2022 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, ICIP | 2022年
关键词
Image retrieval; Attention; Multi-scale; Transformer; Feature combination;
D O I
10.1109/ICIP46576.2022.9897512
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Image retrieval consists in the selection of a set of images from a database, based on their visual similarity to a given query image. To compute this visual similarity, a global or multiple local features are required per image. In this work, we present a system that uses multiple high-level semantic feature maps from a single image for global feature generation. Our method uses feature maps extracted from a backbone architecture at different spatial resolutions, enhancing the semantic information relevant at each scale using a transformer-based approach. Then, these feature maps are combined using a self-attention mechanism generating a global feature. The main contribution of this work is a novel global feature generation method, which outperforms current state-of-the-art techniques that use either global-only or a combination of global and local features. This result is assessed using the Google Landmarks v2 resulting in an improvement of 3.8% in mAP@100. Also, we assess our system using Revisiting Oxford and Paris datasets, obtaining an improvement up to 3.7% in mAP compared with current methods.
引用
收藏
页码:3166 / 3170
页数:5
相关论文
共 50 条
  • [1] Transformer-Based Multi-Scale Feature Remote Sensing Image Classification Model
    Sun, Ting
    Li, Jun
    Zhou, Xiangrui
    Chen, Zan
    IEEE ACCESS, 2025, 13 : 34095 - 34104
  • [2] Transformer-based Multi-scale Underwater Image Enhancement Network
    Yang, Ai-Ping
    Fang, Si-Jie
    Shao, Ming-Fu
    Zhang, Teng-Fei
    Dongbei Daxue Xuebao/Journal of Northeastern University, 2024, 45 (12): : 1696 - 1705
  • [3] TFNet: Transformer-Based Multi-Scale Feature Fusion Forest Fire Image Detection Network
    Liu, Hongying
    Zhang, Fuquan
    Xu, Yiqing
    Wang, Junling
    Lu, Hong
    Wei, Wei
    Zhu, Jun
    FIRE-SWITZERLAND, 2025, 8 (02):
  • [4] Transformer-Based Multi-Scale Feature Integration Network for Video Saliency Prediction
    Zhou, Xiaofei
    Wu, Songhe
    Shi, Ran
    Zheng, Bolun
    Wang, Shuai
    Yin, Haibing
    Zhang, Jiyong
    Yan, Chenggang
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2023, 33 (12) : 7696 - 7707
  • [5] Transformer-based multi-scale feature fusion network for remote sensing change detection
    Liang, Shike
    Hua, Zhen
    Li, Jinjiang
    JOURNAL OF APPLIED REMOTE SENSING, 2022, 16 (04)
  • [6] Transformer-Based Multi-scale Optimization Network for Low-Light Image Enhancement
    Niu Y.
    Lin X.
    Xu H.
    Li Y.
    Chen Y.
    Moshi Shibie yu Rengong Zhineng/Pattern Recognition and Artificial Intelligence, 2023, 36 (06): : 511 - 529
  • [7] A Transformer-based method to simulate multi-scale soil moisture
    Liu, Yangxiaoyue
    Xin, Ying
    Yin, Cong
    JOURNAL OF HYDROLOGY, 2025, 655
  • [8] MUSTER: A Multi-Scale Transformer-Based Decoder for Semantic Segmentation
    Xu, Jing
    Shi, Wentao
    Gao, Pan
    Li, Qizhu
    Wang, Zhengwei
    IEEE TRANSACTIONS ON EMERGING TOPICS IN COMPUTATIONAL INTELLIGENCE, 2025, 9 (01): : 202 - 212
  • [9] Multi-scale Adaptive Feature Fusion Hashing for Image Retrieval
    Jiang, Xiangkui
    Hu, Fei
    ARABIAN JOURNAL FOR SCIENCE AND ENGINEERING, 2024,
  • [10] Transformer based on multi-scale local feature for colon cancer histopathological image classification
    Fu, Zhibing
    Chen, Qingkui
    Wang, Mingming
    Huang, Chen
    BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2025, 100