The FedLemur project: Federated search in the real world

被引:33
作者
Avrahami, TT [1 ]
Yau, L [1 ]
Si, L [1 ]
Callan, J [1 ]
机构
[1] Carnegie Mellon Univ, Sch Comp Sci, Language Technol Inst, Pittsburgh, PA 15213 USA
来源
JOURNAL OF THE AMERICAN SOCIETY FOR INFORMATION SCIENCE AND TECHNOLOGY | 2006年 / 57卷 / 03期
关键词
D O I
10.1002/asi.20283
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Federated search and distributed information retrieval systems provide a single user interface for searching multiple full-text search engines. They have been an active area of research for more than a decade, but in spite of their success as a research topic, they are still rare in operational environments. This article discusses a prototype federated search system developed for the U.S. government's FedStats Web portal, and the issues addressed in adapting research solutions to this operational environment. A series of experiments explore how well prior research results, parameter settings, and heuristics apply in the FedStats environment. The article concludes with a set of lessons learned from this technology transfer effort, including observations about search engine quality in the "real world."
引用
收藏
页码:347 / 358
页数:12
相关论文
共 26 条
  • [1] [Anonymous], P 18 INT ACM SIGIR C
  • [2] Query-based sampling of text databases
    Callan, J
    Connell, M
    [J]. ACM TRANSACTIONS ON INFORMATION SYSTEMS, 2001, 19 (02) : 97 - 130
  • [3] Callan J, 1999, SIGMOD RECORD, VOL 28, NO 2 - JUNE 1999, P479, DOI 10.1145/304181.304224
  • [4] CALLAN J, 2000, ADV INFORM RETRIEVAL, P127
  • [5] Conrad J. G., 2002, Proceedings of the Twenty-eighth International Conference on Very Large Data Bases, P71
  • [6] CRASWELL N, 2000, P 5 ACM C DIG LIB SA, P37
  • [7] FRENCH JC, 1998, P 3D ACM INT C DIG L, P283
  • [8] FRENCH JC, 1999, P 22 ANN INT ACM SIG
  • [9] A decision-theoretic approach to database selection in networked IR
    Fuhr, N
    [J]. ACM TRANSACTIONS ON INFORMATION SYSTEMS, 1999, 17 (03) : 229 - 249
  • [10] GlOSS:: Text-source discovery over the Internet
    Gravano, L
    García-Molina, H
    Tomasic, A
    [J]. ACM TRANSACTIONS ON DATABASE SYSTEMS, 1999, 24 (02): : 229 - 264