Preference-based learning to rank

被引:18
作者
Ailon, Nir [1 ]
Mohri, Mehryar [1 ,2 ]
机构
[1] Technion Israel Inst Technol, Fac Comp Sci, IL-32000 Haifa, Israel
[2] NYU, Courant Inst Math Sci, New York, NY 10012 USA
关键词
Learning to rank; Machine learning reductions; ROC; ROC CURVE; AREA;
D O I
10.1007/s10994-010-5176-9
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper presents an efficient preference-based ranking algorithm running in two stages. In the first stage, the algorithm learns a preference function defined over pairs, as in a standard binary classification problem. In the second stage, it makes use of that preference function to produce an accurate ranking, thereby reducing the learning problem of ranking to binary classification. This reduction is based on the familiar QuickSort and guarantees an expected pairwise misranking loss of at most twice that of the binary classifier derived in the first stage. Furthermore, in the important special case of bipartite ranking, the factor of two in loss is reduced to one. This improved bound also applies to the regret achieved by our ranking and that of the binary classifier obtained. Our algorithm is randomized, but we prove a lower bound for any deterministic reduction of ranking to binary classification showing that randomization is necessary to achieve our guarantees. This, and a recent result by Balcan et al., who show a regret bound of two for a deterministic algorithm in the bipartite case, suggest a trade-off between achieving low regret and determinism in this context. Our reduction also admits an improved running time guarantee with respect to that deterministic algorithm. In particular, the number of calls to the preference function in the reduction is improved from Omega(n (2)) to O(nlog n). In addition, when the top k ranked elements only are required (ka parts per thousand(a)n), as in many applications in information extraction or search engine design, the time complexity of our algorithm can be further reduced to O(klog k+n). Our algorithm is thus practical for realistic applications where the number of points to rank exceeds several thousand.
引用
收藏
页码:189 / 211
页数:23
相关论文
共 50 条
  • [21] Graph-based comparative analysis of learning to rank datasets
    Amir Hosein Keyhanipour
    International Journal of Data Science and Analytics, 2024, 17 : 165 - 187
  • [22] Deep neural network based learning to rank for address standardization
    Hai-Nam Cao
    Viet-Trung, Tran
    2021 RIVF INTERNATIONAL CONFERENCE ON COMPUTING AND COMMUNICATION TECHNOLOGIES (RIVF 2021), 2021, : 25 - 30
  • [23] Rule-based specification mining leveraging learning to rank
    Cao, Zherui
    Tian, Yuan
    Le, Tien-Duy B.
    Lo, David
    AUTOMATED SOFTWARE ENGINEERING, 2018, 25 (03) : 501 - 530
  • [24] Graph-based Feature Selection Method for Learning to Rank
    Yeh, Jen-Yuan
    Tsai, Cheng-Jung
    2020 6TH INTERNATIONAL CONFERENCE ON COMMUNICATION AND INFORMATION PROCESSING, ICCIP 2020, 2020, : 70 - 73
  • [25] Rank-Based Decomposable Losses in Machine Learning: A Survey
    Hu, Shu
    Wang, Xin
    Lyu, Siwei
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 45 (11) : 13599 - 13620
  • [26] Scalar is Not Enough: Vectorization-based Unbiased Learning to Rank
    Chen, Mouxiang
    Liu, Chenghao
    Liu, Zemin
    Sun, Jianling
    PROCEEDINGS OF THE 28TH ACM SIGKDD CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, KDD 2022, 2022, : 136 - 145
  • [27] Graph-based comparative analysis of learning to rank datasets
    Keyhanipour, Amir Hosein
    INTERNATIONAL JOURNAL OF DATA SCIENCE AND ANALYTICS, 2024, 17 (02) : 165 - 187
  • [28] Opinion-Based Entity Ranking using learning to rank
    Bashir, Shariq
    Afzal, Wasif
    Baig, Abdul Rauf
    APPLIED SOFT COMPUTING, 2016, 38 : 151 - 163
  • [29] Learning to rank in PRISM
    Kojima, Ryosuke
    Sato, Taisuke
    INTERNATIONAL JOURNAL OF APPROXIMATE REASONING, 2018, 93 : 561 - 577
  • [30] Learning to Efficiently Rank
    Wang, Lidan
    Lin, Jimmy
    Metzler, Donald
    SIGIR 2010: PROCEEDINGS OF THE 33RD ANNUAL INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH DEVELOPMENT IN INFORMATION RETRIEVAL, 2010, : 138 - 145