A Cooperative Schema between Web Sever and Search Engine for Improving Freshness of Web Repository

被引：1

作者：

WEN Kun-mei

机构：

来源：

WuhanUniversityJournalofNaturalSciences | 2006年 / 01期

关键词：

search engine; freshness; cooperative schema;

D O I：

暂无

中图分类号：

TP393.092 [];

学科分类号：

080402 ;

摘要：

Because the web is huge and web pages are updated frequently, the index maintained by a search engine has to refresh web pages periodically. This is extremely resource consuming because the search engine needs to crawl the web and download web pages to refresh its index. Based on present technologies of web refreshing, we present a cooperative schema between web server and search engine for maintaining freshness of web repository. The web server provides meta-data defined through XML standard to describe web sites. Before updating the web page the crawler visits the meta-data files. If the meta-data indicates that the page is not modified, then the crawler will not update it. So this schema can save bandwidth resource. A primitive model based on the schema is implemented. The cost and efficiency of the schema are analyzed.

引用

页码：11 / 14

页数：4

共 50 条

[1] A cooperative schema between web sever and search engine for improving freshness of web repository
College of Computer Science and Technology, Huazhong University of Science and Technology, Wuhan 430074, China
Wuhan Univ J Nat Sci, 2006, 1 (11-14):
[2] The freshness of web search engine databases
Lewandowski, D
Wahlig, H
Meyer-Bautor, G
JOURNAL OF INFORMATION SCIENCE, 2006, 32 (02) : 131 - 148
[3] EMACrawler: web search engine database freshness optimization
Alanoglu, Zuelfue
Akcayol, M. Ali
JOURNAL OF POLYTECHNIC-POLITEKNIK DERGISI, 2024, 27 (06):
[4] Internet search engine freshness by web server help
Gupta, V
Campbell, R
2001 SYMPOSIUM ON APPLICATIONS AND THE INTERNET, PROCEEDINGS, 2001, : 113 - 119
[5] Analysis of Web freshness strategies and its improvement in search engine
Wen, Kunmei
Lu, Zhengding
Ye, Weiguo
Jin, Li
Huazhong Keji Daxue Xuebao (Ziran Kexue Ban)/Journal of Huazhong University of Science and Technology (Natural Science Edition), 2002, 30 (12):
[6] A three-year study on the freshness of web search engine databases
Lewandowski, Dirk
JOURNAL OF INFORMATION SCIENCE, 2008, 34 (06) : 817 - 831
[7] Cooperation schemes between a Web server and a Web search engine
Castillo, C
FIRST LATIN AMERICAN WEB CONGRESS, PROCEEDINGS, 2003, : 212 - 213
[8] The Infocious Web search engine: Improving Web searching through linguistic analysis
Ntoulas, Alexandros
Chao, Gerald
Cho, Junghoo
Journal of Digital Information Management, 2007, 5 (05): : 277 - 291
[9] Using web archive for improving search engine results
Jatowt, A
Kawail, Y
Tanaka, K
FRONTIERS OF WWW RESEARCH AND DEVELOPMENT - APWEB 2006, PROCEEDINGS, 2006, 3841 : 893 - 898
[10] Capturing Page Freshness for Web Search
Dai, Na
Davison, Brian D.
SIGIR 2010: PROCEEDINGS OF THE 33RD ANNUAL INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH DEVELOPMENT IN INFORMATION RETRIEVAL, 2010, : 871 - 872

← 1 2 3 4 5 →