An Ontology-Based Topical Crawling Algorithm for Accessing Deep Web Content

被引:3
作者
Arya, K. V. [1 ]
Vadlamudi, Baby Ramya [1 ]
机构
[1] IIITM, ABV, Gwalior 474010, India
来源
2012 THIRD INTERNATIONAL CONFERENCE ON COMPUTER AND COMMUNICATION TECHNOLOGY (ICCCT) | 2012年
关键词
Focused crawler; Domain ontology; Deep web; Form processing;
D O I
10.1109/ICCCT.2012.10
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Due to the large volume of the Web information and relatively high speed of information update, the coverage and quality of the retrieved pages by modern search engines is comparatively small. Given the volume of the Web and its frequency of content change, the coverage and quality of pages retrieved by modern search engines is relatively small since they crawl only hypertext links ignoring the search forms which are the entry points for accessing deep web content where two-thirds of information is resides. In this paper an algorithm has been designed to enable topical crawlers to access hidden web content by using domain based ontology to determine the forms' relevance to the domain. In this work scientific research publications domain has been considered. Experimental results show that proposed approach is better as compared to keyword based crawlers in terms of both relevancy and completeness.
引用
收藏
页码:1 / 6
页数:6
相关论文
共 10 条
  • [1] Alvarez Manuel, 2007, P 3 INT WORKSH DAT E, P18
  • [2] Bergman Michael K., 2004, P BRIGHTPLANET DEEP
  • [3] Chang KCC, 2004, SIGMOD REC, V33, P61, DOI 10.1145/1031570.1031584
  • [4] Estimating the recall performance of Web search engines
    Clarke, SJ
    Willett, P
    [J]. ASLIB PROCEEDINGS, 1997, 49 (07): : 184 - 189
  • [5] Ntoulas Alexandros, 2005, P JCDL 05 DENV COL U
  • [6] Pant G, 2004, WEB DYNAMICS: ADAPTING TO CHANGE IN CONTENT, SIZE TOPOLOG AND USE, P153
  • [7] Ratzan Lee, 2006, COMPUTERWORLD 1211
  • [8] Shafi S. M., 2005, P 8 INT C SCI INF WE, V2
  • [9] Shen Jin-Xing, 2008, P 4 IEEE INT C WIR C
  • [10] A Framework of Deep Web Crawler
    Xiang Peisu
    Tian Ke
    Huang Qinzhen
    [J]. PROCEEDINGS OF THE 27TH CHINESE CONTROL CONFERENCE, VOL 5, 2008, : 582 - +