A Learning to Rank framework applied to text-image retrieval

被引:0
作者
David Buffoni
Sabrina Tollari
Patrick Gallinari
机构
[1] Université Pierre et Marie CURIE - Paris 6 / LIP6,
来源
Multimedia Tools and Applications | 2012年 / 60卷
关键词
Learning to Rank; Text-image retrieval; OWPC; Visuo-textual fusion; Pooling for Learning to Rank;
D O I
暂无
中图分类号
学科分类号
摘要
We present a framework based on a Learning to Rank setting for a text-image retrieval task. In Information Retrieval, the goal is to compute the similarity between a document and an user query. In the context of text-image retrieval where several similarities exist, human intervention is often needed to decide on the way to combine them. On the other hand, with the Learning to Rank approach the combination of the similarities is done automatically. Learning to Rank is a paradigm where the learnt objective function is able to produce a ranked list of images when a user query is given. These score functions are generally a combination of similarities between a document and a query. In the past, Learning to Rank algorithms were successfully applied to text retrieval where they outperformed baselines such as BM25 or TFIDF. This inspired us to apply our state-of-the-art algorithm, called OWPC (Usunier et al. 2009), to the text-image retrieval task. At this time, no benchmarks are available, therefore we present a framework for building one. The empirical validation of this algorithm is done on the dataset constructed through comparison of typical text-image retrieval similarities. In both cases, visual only and text and visual, our algorithm performs better than a simple baseline.
引用
收藏
页码:161 / 180
页数:19
相关论文
共 50 条
  • [1] A Learning to Rank framework applied to text-image retrieval
    Buffoni, David
    Tollari, Sabrina
    Gallinari, Patrick
    MULTIMEDIA TOOLS AND APPLICATIONS, 2012, 60 (01) : 161 - 180
  • [2] Text-Image Retrieval With Salient Features
    Feng, Xia
    Hu, Zhiyi
    Liu, Caihua
    Ip, W. H.
    Chen, Huiying
    JOURNAL OF DATABASE MANAGEMENT, 2021, 32 (04) : 1 - 13
  • [3] Knowledge-Aware Text-Image Retrieval for Remote Sensing Images
    Mi, Li
    Dai, Xianjie
    Castillo-Navarro, Javiera
    Tuia, Devis
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2024, 62
  • [4] Federated training of GNNs with similarity graph reasoning for text-image retrieval
    Yan, Xueming
    Wang, Chuyue
    Jin, Yaochu
    NEUROCOMPUTING, 2025, 623
  • [5] CLIP-Branches: Interactive Fine-Tuning for Text-Image Retrieval
    Luelf, Christian
    Lima Martins, Denis Mayr
    Vaz Salles, Marcos Antonio
    Zhou, Yongluan
    Gieseke, Fabian
    PROCEEDINGS OF THE 47TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, SIGIR 2024, 2024, : 2719 - 2723
  • [6] Flickr30K-CFQ: A Compact and Fragmented Query Dataset for Text-image Retrieval
    Liu, Haoyu
    Song, Yaoxian
    Wang, Xuwu
    Zhu, Xiangru
    Li, Zhixu
    Song, Wei
    Lie, Tiefeng
    DATABASE SYSTEMS FOR ADVANCED APPLICATIONS, DASFAA 2024, PT 3, 2025, 14852 : 419 - 434
  • [7] Invisible Relevance Bias: Text-Image Retrieval Models Prefer AI-Generated Images
    Xu, Shicheng
    Hou, Danyang
    Pang, Liang
    Deng, Jingcheng
    Xu, Jun
    Shen, Huawei
    Cheng, Xueqi
    PROCEEDINGS OF THE 47TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, SIGIR 2024, 2024, : 208 - 217
  • [8] Image Retrieval Based on Learning to Rank and Multiple Loss
    Fan, Lili
    Zhao, Hongwei
    Zhao, Haoyu
    Liu, Pingping
    Hu, Huangshui
    ISPRS INTERNATIONAL JOURNAL OF GEO-INFORMATION, 2019, 8 (09)
  • [9] CLCP: Realtime Text-Image Retrieval for Retailing via Pre-trained Clustering and Priority Queue
    Zhang, Shuyang
    Wei, Liangwu
    Wang, Qingyu
    Wei, Yuntao
    Song, Yanzhi
    PROCEEDINGS OF THE 4TH ANNUAL ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA RETRIEVAL, ICMR 2024, 2024, : 1089 - 1093
  • [10] A comprehensive study on learning to rank for content-based image retrieval
    Li, Yangxi
    Zhou, Chao
    Geng, Bo
    Xu, Chao
    Liu, Hong
    SIGNAL PROCESSING, 2013, 93 (06) : 1426 - 1434