Learning Query and Document Relevance from a Web-scale Click Graph

被引:32
作者
Jiang, Shan [1 ]
Hu, Yuening [2 ]
Kang, Changsung [2 ]
Daly, Tim, Jr. [2 ]
Yin, Dawei [2 ]
Chang, Yi [2 ]
Zhai, Chengxiang [1 ]
机构
[1] Univ Illinois, Dept Comp Sci, Urbana, IL 61801 USA
[2] Yahoo Res, Sunnyvale, CA USA
来源
SIGIR'16: PROCEEDINGS OF THE 39TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL | 2016年
关键词
Click-through bipartite graph; vector propagation; vector generation; Web search; query-document relevance;
D O I
10.1145/2911451.2911531
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Click-through logs over query-document pairs provide rich and valuable information for multiple tasks in information retrieval. This paper proposes a vector propagation algorithm on the click graph to learn vector representations for both queries and documents in the same semantic space. The proposed approach incorporates both click and content information, and the produced vector representations can directly improve ranking performance for queries and documents that have been observed in the click log. For new queries and documents that are not in the click log, we propose a two-step framework to generate the vector representation, which significantly improves the coverage of our vectors while maintaining the high quality. Experiments on Web-scale search logs from a major commercial search engine demonstrate the effectiveness and scalability of the proposed method. Evaluation results show that NDCG scores are significantly improved against multiple baselines by using the proposed method both as a ranking model and as a feature in a learning-to-rank framework.
引用
收藏
页码:185 / 194
页数:10
相关论文
共 9 条
  • [1] MIRA:Leveraging Multi-Intention Co-click Information in Web-scale Document Retrieval using Deep Neural Networks
    Zhang, Yusi
    Liu, Chuanjie
    Luo, Angen
    Xue, Hui
    Shan, Xuan
    Luo, Yuxiang
    Xia, Yiqian
    Yan, Yuanchi
    Wang, Haidong
    PROCEEDINGS OF THE WORLD WIDE WEB CONFERENCE 2021 (WWW 2021), 2021, : 227 - 238
  • [2] MPGraf: a Modular and Pre-trained Graphformer for Learning to Rank at Web-scale
    Li, Yuchen
    Xiong, Haoyi
    Kong, Linghe
    Sun, Zeyi
    Chen, Hongyang
    Wang, Shuaiqiang
    Yin, Dawei
    23RD IEEE INTERNATIONAL CONFERENCE ON DATA MINING, ICDM 2023, 2023, : 339 - 348
  • [3] CWRCzech: 100M Query-Document Czech Click Dataset and Its Application toWeb Relevance Ranking
    Vonasek, Josef
    Straka, Milan
    Krc, Rostislav
    Lasonova, Lenka
    Egorova, Ekaterina
    Strakova, Jana
    Naplava, Jakub
    PROCEEDINGS OF THE 47TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, SIGIR 2024, 2024, : 1221 - 1231
  • [4] Learning a unified embedding space of web search from large-scale query log
    Bing, Lidong
    Niu, Zheng-Yu
    Li, Piji
    Lam, Wai
    Wang, Haifeng
    KNOWLEDGE-BASED SYSTEMS, 2018, 150 : 38 - 48
  • [5] Beyond Bag-of-Words: Machine Learning for Query-Document Matching in Web Search
    Li, Hang
    Xu, Jun
    SIGIR 2012: PROCEEDINGS OF THE 35TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, 2012, : 1177 - 1177
  • [6] GS2P: a generative pre-trained learning to rank model with over-parameterization for web-scale search
    Li, Yuchen
    Xiong, Haoyi
    Kong, Linghe
    Bian, Jiang
    Wang, Shuaiqiang
    Chen, Guihai
    Yin, Dawei
    MACHINE LEARNING, 2024, 113 (08) : 5331 - 5349
  • [7] Large-Scale Estimation and Analysis of Web Users' Mood from Web Search Query and Mobile Sensor Data
    Sasaki, Wataru
    Hamanaka, Satoki
    Miyahara, Satoko
    Tsubouchi, Kota
    Nakazawa, Jin
    Okoshi, Tadashi
    BIG DATA, 2024, 12 (03) : 191 - 209
  • [8] Bayesian Browsing Model: Exact Inference of Document Relevance from Petabyte-Scale Data
    Liu, Chao
    Guo, Fan
    Faloutsos, Christos
    ACM TRANSACTIONS ON KNOWLEDGE DISCOVERY FROM DATA, 2010, 4 (04)
  • [9] LtrGCN: Large-Scale Graph Convolutional Networks-Based Learning to Rank for Web Search
    Li, Yuchen
    Xiong, Haoyi
    Kong, Linghe
    Wang, Shuaiqiang
    Sun, Zeyi
    Chen, Hongyang
    Chen, Guihai
    Yin, Dawei
    MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES: APPLIED DATA SCIENCE AND DEMO TRACK, ECML PKDD 2023, PT VI, 2023, 14174 : 635 - 651