Learning to Rank with Selection Bias in Personal Search

被引:155
|
作者
Wang, Xuanhui [1 ]
Bendersky, Michael [1 ]
Metzler, Donald [1 ]
Najork, Marc [1 ]
机构
[1] Google Inc, Mountain View, CA 94043 USA
来源
SIGIR'16: PROCEEDINGS OF THE 39TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL | 2016年
关键词
Personal Search; Selection Bias; Learning-to-Rank;
D O I
10.1145/2911451.2911537
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Click-through data has proven to be a critical resource for improving search ranking quality. Though a large amount of click data can be easily collected by search engines, various biases make it difficult to fully leverage this type of data. In the past, many click models have been proposed and successfully used to estimate the relevance for individual query-document pairs in the context of web search. These click models typically require a large quantity of clicks for each individual pair and this makes them difficult to apply in systems where click data is highly sparse due to personalized corpora and information needs, e.g., personal search. In this paper, we study the problem of how to leverage sparse click data in personal search and introduce a novel selection bias problem and address it in the learning-to-rank framework. This paper proposes a few bias estimation methods, including a novel query-dependent one that captures queries with similar results and can successfully deal with sparse data. We empirically demonstrate that learning-to-rank that accounts for query-dependent selection bias yields significant improvements in search effectiveness through online experiments with one of the world's largest personal search engines.
引用
收藏
页码:115 / 124
页数:10
相关论文
共 50 条
  • [21] Drug Selection via Joint Push and Learning to Rank
    He, Yicheng
    Liu, Junfeng
    Ning, Xia
    IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2020, 17 (01) : 110 - 123
  • [22] Learning to Rank for Search Results Re-ranking in Learning Experience Platforms
    Kataria, Ayush
    Venkateshprasanna, H. M.
    Kummetha, Ashok Kumar Reddy
    PROCEEDINGS OF THE 16TH ANNUAL ACM INDIA COMPUTE CONFERENCE, COMPUTE 2023, 2023, : 25 - 30
  • [23] Graph-Based Pairwise Learning to Rank for Video Search
    Liu, Yuan
    Mei, Tao
    Tang, Jinhui
    Wu, Xiuqing
    Hua, Xian-Sheng
    ADVANCES IN MULTIMEDIA MODELING, PROCEEDINGS, 2009, 5371 : 175 - +
  • [24] Deep Bayesian Active Learning for Learning to Rank: A Case Study in Answer Selection
    Wang, Qunbo
    Wu, Wenjun
    Qi, Yuxing
    Zhao, Yongchi
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2022, 34 (11) : 5251 - 5262
  • [25] A Systematic Study of Feature Selection Methods for Learning to Rank Algorithms
    Shirzad, Mehrnoush Barani
    Keyvanpour, Mohammad Reza
    INTERNATIONAL JOURNAL OF INFORMATION RETRIEVAL RESEARCH, 2018, 8 (03) : 46 - 67
  • [26] Graph-based Feature Selection Method for Learning to Rank
    Yeh, Jen-Yuan
    Tsai, Cheng-Jung
    2020 6TH INTERNATIONAL CONFERENCE ON COMMUNICATION AND INFORMATION PROCESSING, ICCIP 2020, 2020, : 70 - 73
  • [27] Feature Selection for Learning-to-Rank using Simulated Annealing
    Allvi, Mustafa Wasif
    Hasan, Mahamudul
    Rayon, Lazim
    Shahabuddin, Mohammad
    Khan, Md Mosaddek
    Ibrahim, Muhammad
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2020, 11 (03) : 699 - 705
  • [28] Incorporating Risk-Sensitiveness into Feature Selection for Learning to Rank
    de Sousa, Daniel Xavier
    Canuto, Sergio Daniel
    Rosa, Thierson Couto
    Santos, Wellington
    Goncalves, Marcos Andre
    CIKM'16: PROCEEDINGS OF THE 2016 ACM CONFERENCE ON INFORMATION AND KNOWLEDGE MANAGEMENT, 2016, : 257 - 266
  • [29] Document Selection Methodologies for Efficient and Effective Learning-to-Rank
    Aslam, Javed A.
    Kanoulas, Evangelos
    Pavlu, Virgil
    Savev, Stefan
    Yilmaz, Emine
    PROCEEDINGS 32ND ANNUAL INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, 2009, : 468 - 475
  • [30] Learning to Rank Instant Search Results with Multiple Indices: A Case Study in Search Aggregation for Entertainment
    Rome, Scott
    Hamidian, Sardar
    Walsh, Richard
    Foley, Kevin
    Ture, Ferhan
    PROCEEDINGS OF THE 45TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL (SIGIR '22), 2022, : 3412 - 3416