Cost-Aware Strategies for Query Result Caching in Web Search Engines

被引:41
|
作者
Ozcan, Rifat [1 ]
Altingovde, Ismail Sengor [1 ]
Ulusoy, Ozgor [1 ]
机构
[1] Bilkent Univ, Dept Comp Engn, TR-06800 Ankara, Turkey
关键词
Algorithms; Performance; Experimentation; Query result caching; Web search engines;
D O I
10.1145/1961659.1961663
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Search engines and large-scale IR systems need to cache query results for efficiency and scalability purposes. Static and dynamic caching techniques (as well as their combinations) are employed to effectively cache query results. In this study, we propose cost-aware strategies for static and dynamic caching setups. Our research is motivated by two key observations: (i) query processing costs may significantly vary among different queries, and (ii) the processing cost of a query is not proportional to its popularity (i.e., frequency in the previous logs). The first observation implies that cache misses have different, that is, nonuniform, costs in this context. The latter observation implies that typical caching policies, solely based on query popularity, can not always minimize the total cost. Therefore, we propose to explicitly incorporate the query costs into the caching policies. Simulation results using two large Web crawl datasets and a real query log reveal that the proposed approach improves overall system performance in terms of the average query execution time.
引用
收藏
页数:25
相关论文
共 50 条
  • [1] A Cost-Aware Strategy for Query Result Caching in Web Search Engines
    Altingovde, Ismail Sengor
    Ozcan, Rifat
    Ulusoy, Oezguer
    ADVANCES IN INFORMATION RETRIEVAL, PROCEEDINGS, 2009, 5478 : 628 - 636
  • [2] Cost-Aware Result Caching for Meta-Search Engines
    Bakkal, Emre
    Altingovde, Ismail Sengor
    Toroslu, Ismail Hakki
    SIGIR 2015: PROCEEDINGS OF THE 38TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, 2015, : 739 - 742
  • [3] User-Aware Caching and Prefetching Query Results in Web Search Engines
    Ma, Hongyuan
    Wang, Bin
    SIGIR 2012: PROCEEDINGS OF THE 35TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, 2012, : 1163 - 1164
  • [4] Topical result caching in web search engines
    Mele, Ida
    Tonellotto, Nicola
    Frieder, Ophir
    Perego, Raffaele
    INFORMATION PROCESSING & MANAGEMENT, 2020, 57 (03)
  • [5] Cost-aware query planning for similarity search
    Lange, Dustin
    Naumann, Felix
    INFORMATION SYSTEMS, 2013, 38 (04) : 455 - 469
  • [6] Analysis of Cost-Aware Policies for Intersection Caching in Search Nodes
    Feuerstein, Esteban
    Tolosa, Gabriel
    PROCEEDINGS OF 2013 32ND INTERNATIONAL CONFERENCE OF THE CHILEAN COMPUTER SCIENCE SOCIETY (SCCC), 2016, : 4 - 7
  • [7] A machine learning, approach for result caching in web search engines
    Kucukyilmaz, Tayfun
    Cambazoglu, B. Barla
    Aykanat, Cevdet
    Baeza-Yates, Ricardo
    INFORMATION PROCESSING & MANAGEMENT, 2017, 53 (04) : 834 - 850
  • [8] Exploiting Query Term Correlation for List Caching in Web Search Engines
    Tong, Jiancong
    Wang, Gang
    Stones, Douglas S.
    Sun, Shizhao
    Liu, Xiaoguang
    Zhang, Fan
    PROCEEDINGS OF THE 22ND ACM INTERNATIONAL CONFERENCE ON INFORMATION & KNOWLEDGE MANAGEMENT (CIKM'13), 2013, : 1817 - 1820
  • [9] Exploiting Navigational Queries for Result Presentation and Caching in Web Search Engines
    Ozcan, Rifat
    Altingovde, Ismail Sengor
    Ulusoy, Ozgur
    JOURNAL OF THE AMERICAN SOCIETY FOR INFORMATION SCIENCE AND TECHNOLOGY, 2011, 62 (04): : 714 - 726
  • [10] Caching-Aware Techniques for Query Workload Partitioning in Parallel Search Engines
    Xu, Chuanfei
    Wang, Yanqiu
    Lv, Pin
    Xu, Jia
    2017 14TH WEB INFORMATION SYSTEMS AND APPLICATIONS CONFERENCE (WISA 2017), 2017, : 44 - 49