Scalar is Not Enough: Vectorization-based Unbiased Learning to Rank

被引:4
作者
Chen, Mouxiang [1 ]
Liu, Chenghao [2 ]
Liu, Zemin [3 ]
Sun, Jianling [1 ]
机构
[1] Zhejiang Univ & Alibaba Zhejiang Univ Joint Inst, Hangzhou, Peoples R China
[2] Salesforce Res Asia, Singapore, Singapore
[3] Singapore Management Univ, Singapore, Singapore
来源
PROCEEDINGS OF THE 28TH ACM SIGKDD CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, KDD 2022 | 2022年
关键词
learning to rank; unbiased learning to rank; examination hypothesis;
D O I
10.1145/3534678.3539468
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Unbiased learning to rank (ULTR) aims to train an unbiased ranking model from biased user click logs. Most of the current ULTR methods are based on the examination hypothesis (EH), which assumes that the click probability can be factorized into two scalar functions, one related to ranking features and the other related to bias factors. Unfortunately, the interactions among features, bias factors and clicks are complicated in practice, and usually cannot be factorized in this independent way. Fitting click data with EH could lead to model misspecification and bring the approximation error. In this paper, we propose a vector-based EH and formulate the click probability as a dot product of two vector functions. This solution is complete due to its universality in fitting arbitrary click functions. Based on it, we propose a novel model named Vectorization to adaptively learn the relevance embeddings and sort documents by projecting embeddings onto a base vector. Extensive experiments show that our method significantly outperforms the state-of-the-art ULTR methods on complex real clicks as well as simple simulated clicks. (1)
引用
收藏
页码:136 / 145
页数:10
相关论文
共 45 条
[11]   Debiasing Learning to Rank Models with Generative Adversarial Networks [J].
Cai, Hui ;
Wang, Chengyu ;
He, Xiaofeng .
WEB AND BIG DATA, PT II, APWEB-WAIM 2020, 2020, 12318 :45-60
[12]  
Chapelle O., 2011, P LEARNING RANK CHAL, P1
[13]  
Chapelle Olivier, 2009, P 18 ACM C INFORM KN, P621, DOI DOI 10.1145/1645953.1646033
[14]   TianGong-ST: A New Dataset with Large-scale Refined Real-world Web Search Sessions [J].
Chen, Jia ;
Mao, Jiaxin ;
Liu, Yiqun ;
Zhang, Min ;
Ma, Shaoping .
PROCEEDINGS OF THE 28TH ACM INTERNATIONAL CONFERENCE ON INFORMATION & KNOWLEDGE MANAGEMENT (CIKM '19), 2019, :2485-2488
[15]   Adapting Interactional Observation Embedding for Counterfactual Learning to Rank [J].
Chen, Mouxiang ;
Liu, Chenghao ;
Sun, Jianling ;
Hoi, Steven C. H. .
SIGIR '21 - PROCEEDINGS OF THE 44TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, 2021, :285-294
[16]   Intervention Harvesting for Context-Dependent Examination-Bias Estimation [J].
Fang, Zhichong ;
Agarwal, Aman ;
Joachims, Thorsten .
PROCEEDINGS OF THE 42ND INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL (SIGIR '19), 2019, :825-834
[17]  
Guo F., 2009, Proceedings of the 2nd ACM International Conference on Web Search and Data Mining (WSDM), P124
[18]  
Guo F., 2009, P 18 INT C WORLD WID, P11
[19]   Fast Matrix Factorization for Online Recommendation with Implicit Feedback [J].
He, Xiangnan ;
Zhang, Hanwang ;
Kan, Min-Yen ;
Chua, Tat-Seng .
SIGIR'16: PROCEEDINGS OF THE 39TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, 2016, :549-558
[20]   Unbiased LambdaMART: An Unbiased Pairwise Learning-to-Rank Algorithm [J].
Hu, Ziniu ;
Wang, Yang ;
Peng, Qu ;
Li, Hang .
WEB CONFERENCE 2019: PROCEEDINGS OF THE WORLD WIDE WEB CONFERENCE (WWW 2019), 2019, :2830-2836