Query Forwarding in Geographically Distributed Search Engines

被引:0
作者
Barla Cambazoglu, B. [1 ]
Varol, Emre
Kayaaslan, Enver
Aykanat, Cevdet
Baeza-Yates, Ricardo [1 ]
机构
[1] Yahoo Res, Barcelona, Spain
来源
SIGIR 2010: PROCEEDINGS OF THE 33RD ANNUAL INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH DEVELOPMENT IN INFORMATION RETRIEVAL | 2010年
关键词
Search engines; distributed IR; query forwarding; optimization; linear programming; index replication; result caching;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Query forwarding is an important technique for preserving the result quality in distributed search engines where the index is geographically partitioned over multiple search sites. The key component in query forwarding is the thresholding algorithm by which the forwarding decisions are given. In this paper, we propose a linear-programming-based thresholding algorithm that significantly outperforms the current state-of-the-art in terms of achieved search efficiency values. Moreover, we evaluate a greedy heuristic for partial index replication and investigate the impact of result cache freshness on query forwarding performance. Finally, we present some optimizations that improve the performance further, under certain conditions. We evaluate the proposed techniques by simulations over a real-life setting, using a large query log and a document collection obtained from Yahoo!.
引用
收藏
页码:90 / 97
页数:8
相关论文
共 17 条
  • [1] [Anonymous], 2009, CIKM 09
  • [2] The impact of caching on search engines
    Baeza-Yates, Ricardo
    Gionis, Aristides
    Junqueira, Flavio
    Murdock, Vanessa
    Plachouras, Vassilis
    Silvestri, Fabrizio
    [J]. Proceedings of the 30th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, SIGIR'07, 2007, : 183 - 190
  • [3] Baeza-Yates R., 2007, DATA ENG, P6
  • [4] Baeza-Yates R, 2009, 2009 IEEE/WIC/ACM INTERNATIONAL JOINT CONFERENCES ON WEB INTELLIGENCE (WI) AND INTELLIGENT AGENT TECHNOLOGIES (IAT), VOL 1, P252
  • [5] Efficiency Trade-Offs in Two-Tier Web Search Systems
    Baeza-Yates, Ricardo
    Murdock, Vanessa
    Hauff, Claudia
    [J]. PROCEEDINGS 32ND ANNUAL INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, 2009, : 163 - 170
  • [6] Quantifying Performance and Quality Gains in Distributed Web Search Engines
    Barla Cambazoglu, B.
    Plachouras, Vassilis
    Baeza-Yates, Ricardo
    [J]. PROCEEDINGS 32ND ANNUAL INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, 2009, : 411 - 418
  • [7] Callan J. P., 1995, SIGIR Forum, P21
  • [8] Cambazoglu B. B., 2010, 19 INT C WO IN PRESS
  • [9] CAMBAZOGLU BB, 2008, P 3 INT C SCAL INF S
  • [10] Das G., 2006, Proceedings of the 32nd International Conference on Very Large Data Bases (VLDB'06), P451