Revisiting Approximate Metric Optimization in the Age of Deep Neural Networks

被引:39
作者
Bruch, Sebastian [1 ]
Zoghi, Masrour [1 ]
Bendersky, Michael [1 ]
Najork, Marc [1 ]
机构
[1] Google Res, Mountain View, CA 94043 USA
来源
PROCEEDINGS OF THE 42ND INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL (SIGIR '19) | 2019年
关键词
Direct Ranking Metric Optimization; Deep Neural Networks for IR; Learning to Rank;
D O I
10.1145/3331184.3331347
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Learning-to-Rank is a branch of supervised machine learning that seeks to produce an ordering of a list of items such that the utility of the ranked list is maximized. Unlike most machine learning techniques, however, the objective cannot be directly optimized using gradient descent methods as it is either discontinuous or flat everywhere. As such, learning-to-rank methods often optimize a loss function that either is loosely related to or upper-bounds a ranking utility instead. A notable exception is the approximation framework originally proposed by Qin et al. [14] that facilitates a more direct approach to ranking metric optimization. We revisit that framework almost a decade later in light of recent advances in neural networks and demonstrate its superiority empirically. Through this study, we hope to show that the ideas from that work are more relevant than ever and can lay the foundation of learning-to-rank research in the age of deep neural networks.
引用
收藏
页码:1241 / 1244
页数:4
相关论文
共 19 条
  • [1] Abadi M, 2016, PROCEEDINGS OF OSDI'16: 12TH USENIX SYMPOSIUM ON OPERATING SYSTEMS DESIGN AND IMPLEMENTATION, P265
  • [2] [Anonymous], 2005, DIRECT MAXIMIZATION
  • [3] Burges C., 2005, P 22 INT C MACH LEAR, P89, DOI DOI 10.1145/1102351.1102363
  • [4] Burges C. J. C., 2010, Learning, V11, P81
  • [5] Cao H, 2007, PROCEEDINGS OF THE 26TH CHINESE CONTROL CONFERENCE, VOL 4, P129
  • [6] Chapelle D, 2011, COMPUT FLUID SOLID M, P1, DOI 10.1007/978-3-642-16408-8
  • [7] Greedy function approximation: A gradient boosting machine
    Friedman, JH
    [J]. ANNALS OF STATISTICS, 2001, 29 (05) : 1189 - 1232
  • [8] Ioffe S, 2015, 32 INT C MACH LEARN
  • [9] Cumulated gain-based evaluation of IR techniques
    Järvelin, K
    Kekäläinen, J
    [J]. ACM TRANSACTIONS ON INFORMATION SYSTEMS, 2002, 20 (04) : 422 - 446
  • [10] Joachims T., 2006, KDD, P217