Learning to Rank with Selection Bias in Personal Search

被引:155
|
作者
Wang, Xuanhui [1 ]
Bendersky, Michael [1 ]
Metzler, Donald [1 ]
Najork, Marc [1 ]
机构
[1] Google Inc, Mountain View, CA 94043 USA
来源
SIGIR'16: PROCEEDINGS OF THE 39TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL | 2016年
关键词
Personal Search; Selection Bias; Learning-to-Rank;
D O I
10.1145/2911451.2911537
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Click-through data has proven to be a critical resource for improving search ranking quality. Though a large amount of click data can be easily collected by search engines, various biases make it difficult to fully leverage this type of data. In the past, many click models have been proposed and successfully used to estimate the relevance for individual query-document pairs in the context of web search. These click models typically require a large quantity of clicks for each individual pair and this makes them difficult to apply in systems where click data is highly sparse due to personalized corpora and information needs, e.g., personal search. In this paper, we study the problem of how to leverage sparse click data in personal search and introduce a novel selection bias problem and address it in the learning-to-rank framework. This paper proposes a few bias estimation methods, including a novel query-dependent one that captures queries with similar results and can successfully deal with sparse data. We empirically demonstrate that learning-to-rank that accounts for query-dependent selection bias yields significant improvements in search effectiveness through online experiments with one of the world's largest personal search engines.
引用
收藏
页码:115 / 124
页数:10
相关论文
共 50 条
  • [41] Deep Generative Positive-Unlabeled Learning under Selection Bias
    Na, Byeonghu
    Kim, Hyemi
    Song, Kyungwoo
    Joo, Weonyoung
    Kim, Yoon-Yeong
    Moon, Il-Chul
    CIKM '20: PROCEEDINGS OF THE 29TH ACM INTERNATIONAL CONFERENCE ON INFORMATION & KNOWLEDGE MANAGEMENT, 2020, : 1155 - 1164
  • [42] Stock portfolio selection using learning-to-rank algorithms with news sentiment
    Song, Qiang
    Liu, Anqi
    Yang, Steve Y.
    NEUROCOMPUTING, 2017, 264 : 20 - 28
  • [43] Personal exposure assessment studies may suffer from exposure-relevant selection bias
    Oglesby, L
    Rotko, T
    Krütli, P
    Boudet, C
    Kruize, H
    Jantunen, MJ
    Künzli, N
    JOURNAL OF EXPOSURE ANALYSIS AND ENVIRONMENTAL EPIDEMIOLOGY, 2000, 10 (03): : 251 - 266
  • [44] Personal exposure assessment studies may suffer from exposure-relevant selection bias
    LUCY OGLESBY
    TUULIA ROTKO
    PIUS KRÜTLI
    CÉLINE BOUDET
    HANNEKE KRUIZE
    MATTI J JANTUNEN
    NINO KÜNZLI
    Journal of Exposure Science & Environmental Epidemiology, 2000, 10 : 251 - 266
  • [45] LTRRS: A Learning to Rank Based Algorithm for Resource Selection in Distributed Information Retrieval
    Wu, Tianfeng
    Liu, Xiaofeng
    Dong, Shoubin
    INFORMATION RETRIEVAL (CCIR 2019), 2019, 11772 : 52 - 63
  • [46] A feature selection method based on minimum redundancy maximum relevance for learning to rank
    Shirzad, Mehrnoush Barani
    Keyvanpour, Mohammad Reza
    2015 AI & ROBOTICS (IRANOPEN), 2015,
  • [47] Fast Pairwise Query Selection for Large-Scale Active Learning to Rank
    Qian, Buyue
    Wang, Xiang
    Wang, Jun
    Li, Hongfei
    Cao, Nan
    Zhi, Weifeng
    Davidson, Ian
    2013 IEEE 13TH INTERNATIONAL CONFERENCE ON DATA MINING (ICDM), 2013, : 607 - 616
  • [48] Identification of efficient algorithms for web search through implementation of learning-to-rank algorithms
    Dhake, Nikhil
    Raut, Shital
    Rahangdale, Ashwini
    SADHANA-ACADEMY PROCEEDINGS IN ENGINEERING SCIENCES, 2019, 44 (04): : 1 - 12
  • [49] Improving Consumer Health Search with Field-Level Learning-to-Rank Techniques
    Yang, Hua
    Goncalves, Teresa
    INFORMATION, 2024, 15 (11)
  • [50] Identification of efficient algorithms for web search through implementation of learning-to-rank algorithms
    Nikhil Dhake
    Shital Raut
    Ashwini Rahangdale
    Sādhanā, 2019, 44