Learning to Rank for Information Retrieval and Natural Language Processing, Second Edition

被引:0
作者
Li, Hang [1 ]
机构
[1] Huawei Technologies, China
来源
Synthesis Lectures on Human Language Technologies | 2015年 / 7卷 / 03期
关键词
collaborative filtering; information retrieval; learning to rank; machine translation; natural language processing; ranking; ranking aggregation; ranking creation; supervised learning; web search;
D O I
10.2200/S00607ED2V01Y201410HLT026
中图分类号
学科分类号
摘要
Download Free Sample Learning to rank refers to machine learning techniques for training a model in a ranking task. Learning to rank is useful for many applications in information retrieval, natural language processing, and data mining. Intensive studies have been conducted on its problems recently, and significant progress has been made. This lecture gives an introduction to the area including the fundamental problems, major approaches, theories, applications, and future work. The author begins by showing that various ranking problems in information retrieval and natural language processing can be formalized as two basic ranking tasks, namely ranking creation (or simply ranking) and ranking aggregation. In ranking creation, given a request, one wants to generate a ranking list of offerings based on the features derived from the request and the offerings. In ranking aggregation, given a request, as well as a number of ranking lists of offerings, one wants to generate a new ranking list of the offerings. Ranking creation (or ranking) is the major problem in learning to rank. It is usually formalized as a supervised learning task. The author gives detailed explanations on learning for ranking creation and ranking aggregation, including training and testing, evaluation, feature creation, and major approaches. Many methods have been proposed for ranking creation. The methods can be categorized as the pointwise, pairwise, and listwise approaches according to the loss functions they employ. They can also be categorized according to the techniques they employ, such as the SVM based, Boosting based, and Neural Network based approaches. The author also introduces some popular learning to rank methods in details. These include: PRank, OC SVM, McRank, Ranking SVM, IR SVM, GBRank, RankNet, ListNet & ListMLE, AdaRank, SVM MAP, SoftRank, LambdaRank, LambdaMART, Borda Count, Markov Chain, and CRanking. The author explains several example applications of learning to rank including web search, collaborative filtering, definition search, keyphrase extraction, query dependent summarization, and re-ranking in machine translation. A formulation of learning for ranking creation is given in the statistical learning framework. Ongoing and future research directions for learning to rank are also discussed. © 2011 by Morgan & Claypool.
引用
收藏
页码:1 / 123
页数:122
相关论文
共 126 条
[1]  
Agarwal A., Chakrabarti S., Aggarwal S., Learning to rank networked entities, KDD, pp. 14-23, (2006)
[2]  
Agarwal S., Niyogi P., Stability and generalization of bipartite ranking algorithms, COLT, 86, pp. 32-47, (2005)
[3]  
Agichtein E., Brill E., Dumais S., Improving web search ranking by incorporating user behavior information, Proceedings of the 29th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, SIGIR '06, 17, pp. 19-26, (2006)
[4]  
Agrawal R., Halverson A., Kenthapadi K., Mishra N., Tsaparas P., Generating labels from clicks, WSDM, 88, pp. 172-181, (2009)
[5]  
Ailon N., Mohri M., An efficient reduction of ranking to classification, COLT, 86, pp. 87-98, (2008)
[6]  
Amini M.R., Truong T.V., Goutte C., A boosting algorithm for learning bipartite ranking functions with partially labeled data, SIGIR 08: Proceedings of the 31st Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, 88, pp. 99-106, (2008)
[7]  
Aslam J.A., Kanoulas E., Pavlu V., Savev S., Yilmaz E., Document selection methodologies for efficient and effective learning-to-rank, SIGIR 09: Proceedings of the 32nd International ACM SIGIR Conference on Research and Development in Information Retrieval, 88, pp. 468-475, (2009)
[8]  
Aslam J.A., Montague M., Models for metasearch, Proceedings of the 24th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, SIGIR '01, 35, pp. 276-284, (2001)
[9]  
Bai J., Zhou K., Xue G., Zha H., Sun G., Tseng B., Zheng Z., Chang Y., Multi-task learning for learning to rank in web search, CIKM 09: Proceeding of the 18th ACM Conference on Information and Knowledge Management, pp. 1549-1552, (2009)
[10]  
Bendersky M., Croft W.B., Diao Y., Quality-biased ranking of web documents, WSDM, pp. 95-104, (2011)