A METHOD FOR SEARCH ENGINE RETRIEVAL SYSTEM BASED ON HYBRID TRIE-INVERTED FILE

被引:0
作者
Yang, Hongwei [1 ]
Zhou, Hua [1 ]
机构
[1] Yunnan Univ, Sch Software, Kunming 650021, Peoples R China
来源
PROCEEDINGS OF THE 2ND INTERNATIONAL CONFERENCE ON ADVANCED COMPUTER THEORY AND ENGINEERING (ICACTE 2009), VOLS 1 AND 2 | 2009年
关键词
Search engine; Inverted file; Hybrid Trie; WEB; COMMUNICATION;
D O I
暂无
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Due to the rapid growth in the size of the web, web search engines are facing enormous performance challenges. This paper presents an inverted file method to improve the effectiveness of retrieving documents. It integrates the effects of the document identifier data and document weight data to improve the efficiency of retrieving process. Moreover, we propose the Hybrid Trie-Inverted file (HTI) index because the evaluation of containment queries relies on merge-joining the inverted lists, and then we conduct several experiments to evaluate our proposed approach. The experimental results show that it can get higher performance than traditional Inverted file index.
引用
收藏
页码:1663 / 1668
页数:6
相关论文
共 21 条
[11]  
Cronin B, 1998, J AM SOC INFORM SCI, V49, P1319, DOI 10.1002/(SICI)1097-4571(1998)49:14<1319::AID-ASI9>3.0.CO
[12]  
2-W
[13]   Mapping communication and collaboration in heterogeneous research networks [J].
Heimeriks, G ;
Hörlesberger, M ;
Van den Besselaar, P .
SCIENTOMETRICS, 2003, 58 (02) :391-413
[14]  
Henzinger M., 2006, Proceedings of the Twenty-Ninth Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, P284, DOI 10.1145/1148170.1148222
[15]   Google Scholar citations and Google Web/URL citations: A multi-discipline exploratory analysis [J].
Kousha, Kayvan ;
Thelwall, Mike .
JOURNAL OF THE AMERICAN SOCIETY FOR INFORMATION SCIENCE AND TECHNOLOGY, 2007, 58 (07) :1055-1065
[16]   The freshness of web search engine databases [J].
Lewandowski, D ;
Wahlig, H ;
Meyer-Bautor, G .
JOURNAL OF INFORMATION SCIENCE, 2006, 32 (02) :131-148
[17]  
MAYR P, 2005, GOOGLE WEB APIS INST
[18]   Internet search engines - Fluctuations in document accessibility [J].
Mettrop, W ;
Nieuwenhuysen, P .
JOURNAL OF DOCUMENTATION, 2001, 57 (05) :623-651
[19]   The Web as a parallel corpus [J].
Resnik, P ;
Smith, NA .
COMPUTATIONAL LINGUISTICS, 2003, 29 (03) :349-380
[20]  
DAILY TIME SERIES CO