A new approach to query segmentation for relevance ranking in web search

被引:4
作者
Wu, Haocheng [1 ]
Hu, Yunhua [2 ]
Li, Hang [3 ]
Chen, Enhong [1 ]
机构
[1] Univ Sci & Technol China, Hefei 230026, Peoples R China
[2] Alibaba Com, Beijing, Peoples R China
[3] Noahs Ark Lab Huawei Technol, Hong Kong, Hong Kong, Peoples R China
来源
INFORMATION RETRIEVAL JOURNAL | 2015年 / 18卷 / 01期
关键词
Web search; Query segmentation; Relevance ranking; Query processing; Re-ranking; BM25; Term dependency model; Key n-gram extraction;
D O I
10.1007/s10791-014-9246-7
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In this paper, we try to determine how best to improve state-of-the-art methods for relevance ranking in web searching by query segmentation. Query segmentation is meant to separate the input query into segments, typically natural language phrases. We propose employing the re-ranking approach in query segmentation, which first employs a generative model to create the top k candidates and then employs a discriminative model to re-rank the candidates to obtain the final segmentation result. The method has been widely utilized for structure prediction in natural language processing, but has not been applied to query segmentation, as far as we know. Furthermore, we propose a new method for using the results of query segmentation in relevance ranking, which takes both the original query words and the segmented query phrases as units of query representation. We investigate whether our method can improve three relevance models, namely n-gram BM25, key n-gram model and term dependency model, within the framework of learning to rank. Our experimental results on large scale web search datasets show that our method can indeed significantly improve relevance ranking in all three cases.
引用
收藏
页码:26 / 50
页数:25
相关论文
共 50 条
  • [21] Incorporating Query Reformulating Behavior into Web Search Evaluation
    Chen, Jia
    Liu, Yiqun
    Mao, Jiaxin
    Zhang, Fan
    Sakai, Tetsuya
    Ma, Weizhi
    Zhang, Min
    Ma, Shaoping
    PROCEEDINGS OF THE 30TH ACM INTERNATIONAL CONFERENCE ON INFORMATION & KNOWLEDGE MANAGEMENT, CIKM 2021, 2021, : 171 - 180
  • [22] A Children-Oriented Re-ranking Method for Web Search Engines
    Iwata, Mayu
    Arase, Yuki
    Hara, Takahiro
    Nishio, Shojiro
    WEB INFORMATION SYSTEM ENGINEERING-WISE 2010, 2010, 6488 : 225 - +
  • [23] Ranking Search Results in Library Information Systems - Considering Ranking Approaches Adapted From Web Search Engines
    Behnert, Christiane
    Lewandowski, Dirk
    JOURNAL OF ACADEMIC LIBRARIANSHIP, 2015, 41 (06) : 725 - 735
  • [24] Ranking of keyword-combined searches in relational databases based on relevance to the user query
    Loh, W. -K.
    Kwon, H. -Y.
    ELECTRONICS LETTERS, 2020, 56 (10) : 495 - 497
  • [25] Optimizing Query Evaluations Using Reinforcement Learning for Web Search
    Rosset, Corby
    Jose, Damien
    Ghosh, Gargi
    Mitra, Bhaskar
    Tiwary, Saurabh
    ACM/SIGIR PROCEEDINGS 2018, 2018, : 1193 - 1196
  • [26] CWRCzech: 100M Query-Document Czech Click Dataset and Its Application toWeb Relevance Ranking
    Vonasek, Josef
    Straka, Milan
    Krc, Rostislav
    Lasonova, Lenka
    Egorova, Ekaterina
    Strakova, Jana
    Naplava, Jakub
    PROCEEDINGS OF THE 47TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, SIGIR 2024, 2024, : 1221 - 1231
  • [27] Efficient Ranking based on Web Page Importance and Personalized Search
    Selvan, Mercy Paul
    Shekar, A. Chandra
    Babu, Deepak R.
    Teja, A. Krishna
    2015 INTERNATIONAL CONFERENCE ON COMMUNICATIONS AND SIGNAL PROCESSING (ICCSP), 2015, : 1093 - 1097
  • [28] Relevance Feedback versus Web Search Document Clustering
    Alam, Mansaf
    Sadaf, Kishwar
    2015 2ND INTERNATIONAL CONFERENCE ON COMPUTING FOR SUSTAINABLE GLOBAL DEVELOPMENT (INDIACOM), 2015, : 1665 - 1669
  • [29] Enhancing web search by using query-based clusters and multi-document summaries
    Rani Qumsiyeh
    Yiu-Kai Ng
    Knowledge and Information Systems, 2016, 47 : 355 - 380
  • [30] Enhancing web search by using query-based clusters and multi-document summaries
    Qumsiyeh, Rani
    Ng, Yiu-Kai
    KNOWLEDGE AND INFORMATION SYSTEMS, 2016, 47 (02) : 355 - 380