LETOR: A benchmark collection for research on learning to rank for information retrieval

被引:0
作者
Tao Qin
Tie-Yan Liu
Jun Xu
Hang Li
机构
[1] Microsoft Research Asia,
来源
Information Retrieval | 2010年 / 13卷
关键词
Learning to rank; Information retrieval; Benchmark datasets; Feature extraction;
D O I
暂无
中图分类号
学科分类号
摘要
LETOR is a benchmark collection for the research on learning to rank for information retrieval, released by Microsoft Research Asia. In this paper, we describe the details of the LETOR collection and show how it can be used in different kinds of researches. Specifically, we describe how the document corpora and query sets in LETOR are selected, how the documents are sampled, how the learning features and meta information are extracted, and how the datasets are partitioned for comprehensive evaluation. We then compare several state-of-the-art learning to rank algorithms on LETOR, report their ranking performances, and make discussions on the results. After that, we discuss possible new research topics that can be supported by LETOR, in addition to algorithm comparison. We hope that this paper can help people to gain deeper understanding of LETOR, and enable more interesting research projects on learning to rank and related topics.
引用
收藏
页码:346 / 374
页数:28
相关论文
共 23 条
[1]  
Brin S.(1998)The anatomy of a large-scale hypertextual web search engine Computer Networks and ISDN Systems 30 107-117
[2]  
Page L.(2008)Max-margin classification of data with absent features Journal of Machine Learning Research 9 1-21
[3]  
Chechik G.(2003)An efficient boosting algorithm for combining preferences Journal of Machine Learning Research 4 933-969
[4]  
Heitz G.(2002)Cumulated gain-based evaluation of IR techniques ACM Transactions on Information Systems 20 422-446
[5]  
Elidan G.(2004)RCV1: A new benchmark collection for text categorization research The Journal of Machine Learning Research 5 361-397
[6]  
Abbeel P.(2008)Query-level loss functions for information retrieval Information Processing & Management 44 838-855
[7]  
Koller D.(undefined)undefined undefined undefined undefined-undefined
[8]  
Freund Y.(undefined)undefined undefined undefined undefined-undefined
[9]  
Iyer R.(undefined)undefined undefined undefined undefined-undefined
[10]  
Schapire R. E.(undefined)undefined undefined undefined undefined-undefined