Statistical Analysis of Bayes Optimal Subset Ranking

被引:94
作者
Cossock, David [1 ]
Zhang, Tong [2 ]
机构
[1] Yahoo Inc, Sunnyvale, CA 94089 USA
[2] Rutgers State Univ, Piscataway, NJ 08854 USA
基金
美国国家科学基金会;
关键词
Bayes optimal; consistency; convex surrogate; ranking;
D O I
10.1109/TIT.2008.929939
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The ranking problem has become increasingly important in modern applications of statistical methods in automated decision making systems. In particular, we consider a formulation of the statistical ranking problem which we call subset ranking, and focus on the discounted cumulated gain (DCG) criterion that measures the quality of items near the top of the rank-list. Similar to error minimization for binary classification, direct optimization of natural ranking criteria such as DCG leads to a nonconvex optimization problems that can be NP-hard. Therefore, a computationally more tractable approach is needed. We present bounds that relate the approximate optimization of DCG to the approximate minimization of certain regression errors. These bounds justify the use of convex learning formulations for solving the subset ranking problem. The resulting estimation methods are not conventional, in that we focus on the estimation quality in the top-portion of the rank-list. We further investigate the asymptotic statistical behavior of these formulations. Under appropriate conditions, the consistency of the estimation schemes with respect to the DCG metric can be derived.
引用
收藏
页码:5140 / 5154
页数:15
相关论文
共 32 条
[1]   Learnability of bipartite ranking functions [J].
Agarwal, S ;
Roth, D .
LEARNING THEORY, PROCEEDINGS, 2005, 3559 :16-31
[2]  
Agarwal S, 2005, J MACH LEARN RES, V6, P393
[3]  
[Anonymous], [No title captured]
[4]  
[Anonymous], 2003, Journal of machine learning research
[5]  
[Anonymous], 2003, J MACH LEARN RES
[6]   Convexity, classification, and risk bounds [J].
Bartlett, PL ;
Jordan, MI ;
McAuliffe, JD .
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2006, 101 (473) :138-156
[7]   Boosting for high-dimensional linear models [J].
Buhlmann, Peter .
ANNALS OF STATISTICS, 2006, 34 (02) :559-583
[8]  
Burges C., 2005, Proceedings of the 22Nd International Conference on Machine Learning, ICML'05, P89, DOI DOI 10.1145/1102351.1102363
[9]  
CAPONNETTO A, 2005, NOTE ROLE SQUARED LO
[10]   Ranking and scoring using empirical risk minimization [J].
Clémençon, S ;
Lugosi, G ;
Vayatis, N .
LEARNING THEORY, PROCEEDINGS, 2005, 3559 :1-15