A new approach to query segmentation for relevance ranking in web search

被引:4
|
作者
Wu, Haocheng [1 ]
Hu, Yunhua [2 ]
Li, Hang [3 ]
Chen, Enhong [1 ]
机构
[1] Univ Sci & Technol China, Hefei 230026, Peoples R China
[2] Alibaba Com, Beijing, Peoples R China
[3] Noahs Ark Lab Huawei Technol, Hong Kong, Hong Kong, Peoples R China
来源
INFORMATION RETRIEVAL JOURNAL | 2015年 / 18卷 / 01期
关键词
Web search; Query segmentation; Relevance ranking; Query processing; Re-ranking; BM25; Term dependency model; Key n-gram extraction;
D O I
10.1007/s10791-014-9246-7
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In this paper, we try to determine how best to improve state-of-the-art methods for relevance ranking in web searching by query segmentation. Query segmentation is meant to separate the input query into segments, typically natural language phrases. We propose employing the re-ranking approach in query segmentation, which first employs a generative model to create the top k candidates and then employs a discriminative model to re-rank the candidates to obtain the final segmentation result. The method has been widely utilized for structure prediction in natural language processing, but has not been applied to query segmentation, as far as we know. Furthermore, we propose a new method for using the results of query segmentation in relevance ranking, which takes both the original query words and the segmented query phrases as units of query representation. We investigate whether our method can improve three relevance models, namely n-gram BM25, key n-gram model and term dependency model, within the framework of learning to rank. Our experimental results on large scale web search datasets show that our method can indeed significantly improve relevance ranking in all three cases.
引用
收藏
页码:26 / 50
页数:25
相关论文
共 50 条
  • [1] A new approach to query segmentation for relevance ranking in web search
    Haocheng Wu
    Yunhua Hu
    Hang Li
    Enhong Chen
    Information Retrieval Journal, 2015, 18 : 26 - 50
  • [2] Relevance Ranking for Web Search
    Lages, Joao
    Carvalho, Joao Paulo
    2020 IEEE INTERNATIONAL CONFERENCE ON FUZZY SYSTEMS (FUZZ-IEEE), 2020,
  • [3] An IR-based Evaluation Framework for Web Search Query Segmentation
    Roy, Rishiraj Saha
    Ganguly, Niloy
    Choudhury, Monojit
    Laxman, Srivatsan
    SIGIR 2012: PROCEEDINGS OF THE 35TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, 2012, : 881 - 890
  • [4] PSkip: Estimating Relevance Ranking Quality from Web Search Clickthrough Data
    Wang, Kuansan
    Walker, Toby
    Zheng, Zijian
    KDD-09: 15TH ACM SIGKDD CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, 2009, : 1355 - 1363
  • [5] Numeric Query Ranking Approach
    Wu, Jie
    Liu, Yi
    Wen, Ji-Rong
    PROCEEDINGS OF THE 22ND INTERNATIONAL CONFERENCE ON WORLD WIDE WEB (WWW'13 COMPANION), 2013, : 229 - 230
  • [6] Semantic relevance ranking for XML keyword search
    Lou, Ying
    Li, Zhanhuai
    Chen, Qun
    INFORMATION SCIENCES, 2012, 190 : 127 - 143
  • [7] Multitask Learning for Query Segmentation in Job Search
    Salehi, Bahar
    Liu, Fei
    Baldwin, Timothy
    Wong, Wilson
    PROCEEDINGS OF THE 2018 ACM SIGIR INTERNATIONAL CONFERENCE ON THEORY OF INFORMATION RETRIEVAL (ICTIR'18), 2018, : 179 - 182
  • [8] Customized query response for an improved web search
    Loia, Vincenzo
    Senatore, Sabrina
    THEORETICAL ADVANCES AND APPLICATIONS OF FUZZY LOGIC AND SOFT COMPUTING, 2007, 42 : 653 - +
  • [9] New Re-Ranking Approach in Merging Search Results
    Vo Trung Hung
    INFORMATICA-JOURNAL OF COMPUTING AND INFORMATICS, 2019, 43 (02): : 235 - 241
  • [10] Smoothing Clickthrough Data for Web Search Ranking
    Gao, Jianfeng
    Yuan, Wei
    Li, Xiao
    Deng, Kefeng
    Nie, Jian-Yun
    PROCEEDINGS 32ND ANNUAL INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, 2009, : 355 - 362