When experts agree: Using non-affiliated experts to rank popular topics

被引:15
作者
Bharat, K [1 ]
Mihaila, GA [1 ]
机构
[1] Compaq Syst Res Ctr, Palo Alto, CA USA
关键词
design; experimentation; WWW search; ranking; link analysis; host affiliation; connectivity; authorities; topic experts;
D O I
10.1145/503104.503107
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In response to a query, a search engine returns a ranked list of documents. If the query is about a popular topic (i.e., it matches many documents), then the returned list is usually too long to view fully. Studies show that users usually look at only the top 10 to 20 results. However, we can exploit the fact that the best targets for popular topics are usually linked to by enthusiasts in the same domain. In this paper, we propose a novel ranking scheme for popular topics that places the most authoritative pages on the query topic at the top of the ranking. Our algorithm operates on a special index of "expert documents." These are a subset of the pages on the WWW identified as directories of links to non-affiliated sources on specific topics. Results are ranked based on the match between the query and relevant descriptive text for hyperlinks on expert pages pointing to a given result page. We present a prototype search engine that implements our ranking scheme and discuss its performance. With a relatively small (2.5 million page) expert index, our algorithm was able to perform comparably on popular queries with the best of the mainstream search engines.
引用
收藏
页码:47 / 58
页数:12
相关论文
共 7 条
[1]  
Bharat K., 1998, Proceedings of the 21st Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, P104, DOI 10.1145/290941.290972
[2]  
Brin S., 1998, 7 INT WORLD WIDE WEB
[3]   Automatic resource compilation by analyzing hyperlink structure and associated text [J].
Chakrabarti, S ;
Dom, B ;
Raghava, P ;
Rajagopalan, S ;
Gibson, D ;
Kleinberg, J .
COMPUTER NETWORKS AND ISDN SYSTEMS, 1998, 30 (1-7) :65-74
[4]  
CHAKRABARTI S, 1999, P 8 WORLD WID WEB C
[5]   Authoritative sources in a hyperlinked environment [J].
Kleinberg, JM .
JOURNAL OF THE ACM, 1999, 46 (05) :604-632
[6]   The stochastic approach for link-structure analysis (SALSA) and the TKC effect [J].
Lempel, R ;
Moran, S .
COMPUTER NETWORKS-THE INTERNATIONAL JOURNAL OF COMPUTER AND TELECOMMUNICATIONS NETWORKING, 2000, 33 (1-6) :387-401
[7]  
McBryan O.A., 1994, P 1 INT WORLD WIDE W, P79