Ontology-based web crawler

被引:19
|
作者
Ganesh, S [1 ]
Jayaraj, M [1 ]
Kalyan, V [1 ]
Murthy, S [1 ]
Aghila, G [1 ]
机构
[1] Pondicherry Engn Coll, Dept CSE & IT, Pondicherry, India
来源
ITCC 2004: INTERNATIONAL CONFERENCE ON INFORMATION TECHNOLOGY: CODING AND COMPUTING, VOL 2, PROCEEDINGS | 2004年
关键词
web crawler; ordering-metric; importance-metrics; association-metric; ontology;
D O I
10.1109/ITCC.2004.1286658
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The requirement of a web Crawler that downloads most relevant pages is still a major challenge in the field of Information Retrieval Systems. The use of link analysis algorithms like page rank and other Importance-metrics have shed a new approach in prioritizing the URL queue for downloading higher relevant pages. In this paper, the combination of these metrics along with a new metric called association-metric has been proposed. The association-metric estimates the semantic content of the URL based on the domain dependent ontology, which in turn strengthens the metric that is used for prioritizing the URL queue. In addition, after downloading the page, the association metric plays important role in estimating the relevancy of the links in that page. The proposed new metric will solve the major problem of finding the relevancy of the pages before the process of crawling, to an optimal level.
引用
收藏
页码:337 / 341
页数:5
相关论文
共 50 条
  • [1] An Ontology-Based Crawler for the Semantic Web
    Van de Maele, Felix
    Spyns, Peter
    Meersman, Robert
    ON THE MOVE TO MEANINGFUL INTERNET SYSTEMS: OTM 2008 WORKSHOPS, 2008, 5333 : 1056 - +
  • [2] An ontology-based focused crawler
    Kozanidis, Lefteris
    NATURAL LANGUAGE AND INFORMATION SYSTEMS, PROCEEDINGS, 2008, 5039 : 376 - 379
  • [3] Ontology-based focused crawler
    Lu, Gechao
    Zuo, Wanli
    Zhang, Aiqi
    Wang, Ying
    Ji, Wenyan
    Journal of Information and Computational Science, 2010, 7 (02): : 577 - 584
  • [4] The Research of Ontology-Based Focused Crawler
    Wu, Cong-Cong
    Zhao, Jian-li
    Ma, Hui-lin
    2012 7TH INTERNATIONAL CONFERENCE ON SYSTEM OF SYSTEMS ENGINEERING (SOSE), 2012, : 736 - 738
  • [5] An ontology-based Web engine
    Lee, MR
    Mizoguchi, R
    WEB TECHNOLOGIES AND APPLICATIONS, 1998, : 359 - 360
  • [6] Ontology-based Web navigation assistant
    Jung, H
    Yang, JY
    Choi, J
    INTELLIGENT DATA ENGINEERING AND AUTOMATED LEARNING IDEAL 2004, PROCEEDINGS, 2004, 3177 : 443 - 448
  • [7] Ontology-based web knowledge management
    Wang, YM
    Yang, ZH
    Kong, PHH
    Gay, RKL
    ICICS-PCM 2003, VOLS 1-3, PROCEEDINGS, 2003, : 1859 - 1863
  • [8] Ontology-Based Web Information Extraction
    Mo, Qian
    Chen, Yi-hong
    COMMUNICATIONS AND INFORMATION PROCESSING, PT 1, 2012, 288 : 118 - 126
  • [9] Ontology-Based Administration of Web Directories
    Horvat, Marko
    Gledec, Gordan
    Bogunovic, Nikola
    TRANSACTIONS ON COMPUTATIONAL COLLECTIVE INTELLIGENCE I, 2010, 6220 : 101 - 120
  • [10] Ontology-Based Web Application Testing
    Paydar, Samad
    Kahani, Mohsen
    NOVEL ALGORITHMS AND TECHNIQUES IN TELECOMMUNICATIONS AND NETWORKING, 2010, : 23 - 27