A Learning to Rank framework applied to text-image retrieval

被引：0

作者：

David Buffoni

Sabrina Tollari

Patrick Gallinari

机构：

[1] Université Pierre et Marie CURIE - Paris 6 / LIP6,

来源：

Multimedia Tools and Applications | 2012年 / 60卷

关键词：

Learning to Rank; Text-image retrieval; OWPC; Visuo-textual fusion; Pooling for Learning to Rank;

D O I：

暂无

中图分类号：

学科分类号：

摘要：

We present a framework based on a Learning to Rank setting for a text-image retrieval task. In Information Retrieval, the goal is to compute the similarity between a document and an user query. In the context of text-image retrieval where several similarities exist, human intervention is often needed to decide on the way to combine them. On the other hand, with the Learning to Rank approach the combination of the similarities is done automatically. Learning to Rank is a paradigm where the learnt objective function is able to produce a ranked list of images when a user query is given. These score functions are generally a combination of similarities between a document and a query. In the past, Learning to Rank algorithms were successfully applied to text retrieval where they outperformed baselines such as BM25 or TFIDF. This inspired us to apply our state-of-the-art algorithm, called OWPC (Usunier et al. 2009), to the text-image retrieval task. At this time, no benchmarks are available, therefore we present a framework for building one. The empirical validation of this algorithm is done on the dataset constructed through comparison of typical text-image retrieval similarities. In both cases, visual only and text and visual, our algorithm performs better than a simple baseline.

引用

页码：161 / 180

页数：19

共 50 条

[1] A Learning to Rank framework applied to text-image retrieval
Buffoni, David
Tollari, Sabrina
Gallinari, Patrick
MULTIMEDIA TOOLS AND APPLICATIONS, 2012, 60 (01) : 161 - 180
[2] Text-Image Retrieval With Salient Features
Feng, Xia
Hu, Zhiyi
Liu, Caihua
Ip, W. H.
Chen, Huiying
JOURNAL OF DATABASE MANAGEMENT, 2021, 32 (04) : 1 - 13
[3] Knowledge-Aware Text-Image Retrieval for Remote Sensing Images
Mi, Li
Dai, Xianjie
Castillo-Navarro, Javiera
Tuia, Devis
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2024, 62
[4] Federated training of GNNs with similarity graph reasoning for text-image retrieval
Yan, Xueming
Wang, Chuyue
Jin, Yaochu
NEUROCOMPUTING, 2025, 623
[5] CLIP-Branches: Interactive Fine-Tuning for Text-Image Retrieval
Luelf, Christian
Lima Martins, Denis Mayr
Vaz Salles, Marcos Antonio
Zhou, Yongluan
Gieseke, Fabian
PROCEEDINGS OF THE 47TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, SIGIR 2024, 2024, : 2719 - 2723
[6] Flickr30K-CFQ: A Compact and Fragmented Query Dataset for Text-image Retrieval
Liu, Haoyu
Song, Yaoxian
Wang, Xuwu
Zhu, Xiangru
Li, Zhixu
Song, Wei
Lie, Tiefeng
DATABASE SYSTEMS FOR ADVANCED APPLICATIONS, DASFAA 2024, PT 3, 2025, 14852 : 419 - 434
[7] Invisible Relevance Bias: Text-Image Retrieval Models Prefer AI-Generated Images
Xu, Shicheng
Hou, Danyang
Pang, Liang
Deng, Jingcheng
Xu, Jun
Shen, Huawei
Cheng, Xueqi
PROCEEDINGS OF THE 47TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, SIGIR 2024, 2024, : 208 - 217
[8] Image Retrieval Based on Learning to Rank and Multiple Loss
Fan, Lili
Zhao, Hongwei
Zhao, Haoyu
Liu, Pingping
Hu, Huangshui
ISPRS INTERNATIONAL JOURNAL OF GEO-INFORMATION, 2019, 8 (09)
[9] CLCP: Realtime Text-Image Retrieval for Retailing via Pre-trained Clustering and Priority Queue
Zhang, Shuyang
Wei, Liangwu
Wang, Qingyu
Wei, Yuntao
Song, Yanzhi
PROCEEDINGS OF THE 4TH ANNUAL ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA RETRIEVAL, ICMR 2024, 2024, : 1089 - 1093
[10] A comprehensive study on learning to rank for content-based image retrieval
Li, Yangxi
Zhou, Chao
Geng, Bo
Xu, Chao
Liu, Hong
SIGNAL PROCESSING, 2013, 93 (06) : 1426 - 1434

← 1 2 3 4 5 →