Implementing and evaluating phrasal query suggestions for proximity search

被引:4
|
作者
Feuer, Alan [1 ]
Savev, Stefan [1 ]
Aslam, Javed A. [1 ]
机构
[1] Northeastern Univ, Coll Comp & Informat Sci, Boston, MA 02115 USA
基金
美国国家科学基金会;
关键词
Proximity search; Proximal subphrases; Unordered super phrases; Query log analysis; User study; Web search; ALGORITHM;
D O I
10.1016/j.is.2009.03.012
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This paper describes and evaluates a unified approach to phrasal query suggestions in the context of a high-precision search engine. The search engine performs ranked extended-Boolean searches with the proximity operator NFAR being the default operation. Suggestions are offered to the searcher when the length of the result list falls outside predefined bounds. If the list is too long, the engine specializes the query through the use of super phrases; if the list is too short, the engine generalizes the query through the use of proximal subphrases. We describe methods for generating both types of suggestions and present algorithms for ranking the suggestions. Specifically, we present the problem of counting proximal subphrases for specialization and the problem of counting unordered super phrases for generalization. The uptake of our approach was evaluated by analyzing search log data from before and after the suggestion feature was added to a commercial version of the search engine. We looked at approximately 1.5 million queries and found that, after they were added, suggestions represented nearly 30% of the total queries. Efficacy was evaluated through a controlled study of 24 participants performing nine searches using three different search engines. We found that the engine with phrasal query suggestions had better high-precision recall than both the same search engine without suggestions and a search engine with a similar interface but using an Okapi BM25 ranking algorithm. (c) 2009 Elsevier B.V. All rights reserved.
引用
收藏
页码:711 / 723
页数:13
相关论文
共 50 条
  • [21] Customized query response for an improved web search
    Loia, Vincenzo
    Senatore, Sabrina
    THEORETICAL ADVANCES AND APPLICATIONS OF FUZZY LOGIC AND SOFT COMPUTING, 2007, 42 : 653 - +
  • [22] Semantics of Query Rewriting Patterns in Search Logs
    Fujita, Sumio
    Dupret, Georges
    Baeza-Yates, Ricardo
    PROCEEDINGS OF THE FIFTH WORKSHOP ON EXPLOITING SEMANTIC ANNOTATIONS IN INFORMATION RETRIEVAL, 2012, : 7 - +
  • [23] Mining Web search engines for query suggestion
    Xu, Zheng
    Luo, Xiangfeng
    Yu, Jie
    Xu, Weimin
    CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2011, 23 (10) : 1101 - 1113
  • [24] Improving Query Reformulation in Voice Search Systemorg
    Sa, Ning
    PROCEEDINGS OF THE 2016 ACM CONFERENCE ON HUMAN INFORMATION INTERACTION AND RETRIEVAL (CHIIR'16), 2016, : 365 - 367
  • [25] An incremental model on search engine query recommendation
    Wang, JianGuo
    Huang, Joshua Zhexue
    Wu, Dingming
    Guo, Jiafeng
    Lan, Yanyan
    NEUROCOMPUTING, 2016, 218 : 423 - 431
  • [26] Investigating query bursts in a web search engine
    Subašić, Ilija
    Castillo, Carlos
    Web Intelligence and Agent Systems, 2013, 11 (02): : 107 - 124
  • [27] Evaluating the Privacy Guarantees of Location Proximity Services
    Argyros, George
    Petsios, Theofilos
    Sivakorn, Suphannee
    Keromytis, Angelos D.
    Polakis, Jason
    ACM TRANSACTIONS ON PRIVACY AND SECURITY, 2017, 19 (04)
  • [28] Intent Boundary Detection in Search Query Logs
    Wang, Chieh-Jen
    Lin, Kevin Hsin-Yih
    Chen, Hsin-Hsi
    SIGIR 2010: PROCEEDINGS OF THE 33RD ANNUAL INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH DEVELOPMENT IN INFORMATION RETRIEVAL, 2010, : 749 - 750
  • [29] DOWN THE RABBIT HOLE: ROBUST PROXIMITY SEARCH AND DENSITY ESTIMATION IN SUBLINEAR SPACE
    Har-Peled, Sariel
    Kumar, Nirman
    SIAM JOURNAL ON COMPUTING, 2014, 43 (04) : 1486 - 1511
  • [30] Approximating Minimization Diagrams and Generalized Proximity Search
    Har-Peled, Sariel
    Kumar, Nirman
    2013 IEEE 54TH ANNUAL SYMPOSIUM ON FOUNDATIONS OF COMPUTER SCIENCE (FOCS), 2013, : 717 - 726