Keyword Focused Web Crawler

被引:0
作者
Agre, Gunjan H. [1 ]
Mahajan, Nikita V. [1 ]
机构
[1] GH Raisoni Coll Engn, Dept Comp Sci & Engn, Nagpur, Maharashtra, India
来源
2015 2ND INTERNATIONAL CONFERENCE ON ELECTRONICS AND COMMUNICATION SYSTEMS (ICECS) | 2015年
关键词
Web crawler; keyword; knowledge path; topic specific web crawler; ontology;
D O I
暂无
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Users and uses of internet is growing tremendously these days which causing an extreme trouble and efforts at user side to get web pages searched which are as per concern and relevant to user's requirementGenerally users approach to search web pages from a large available hierarchy of concepts or use a query to browse web pages from available search engine and receive results based on search pattern where few of the results are relevant to search and most of them are not. Web crawler plays an important role in search engine and act as a key element when performance is considered. This paper includes domain engineering concept and keyword driven crawling with relevancy decision mechanism and uses Ontology concepts which ensures the best path for improving crawler's performance. This paper introduces extraction of URLs based on keyword or search criteria. It extracts URLs for web pages which contains searched keyword in their content and considers such pages only as important and doesn't download web pages irrelevant to search. It offers high optimality comparing with traditional web crawler and can enhance search efficiency with more accuracy.
引用
收藏
页码:1089 / 1092
页数:4
相关论文
共 13 条
  • [1] Agarwal A., 2010, P 3 ACM C WEB SEARCH
  • [2] Ch N., 2006, P ACM IEEE WIC INT C
  • [3] Focused crawling: a new approach to topic-specific Web resource discovery
    Chakrabarti, S
    van den Berg, M
    Dom, B
    [J]. COMPUTER NETWORKS-THE INTERNATIONAL JOURNAL OF COMPUTER AND TELECOMMUNICATIONS NETWORKING, 1999, 31 (11-16): : 1623 - 1640
  • [4] Debajyoti Sukanta, 2007, P 10 INT C INF TECHN
  • [5] Diligenti Michelangelo, VLDB, P527
  • [6] Dumais S., 2000, SIGIR Forum, V34, P256
  • [7] Ontology-based web crawler
    Ganesh, S
    Jayaraj, M
    Kalyan, V
    Murthy, S
    Aghila, G
    [J]. ITCC 2004: INTERNATIONAL CONFERENCE ON INFORMATION TECHNOLOGY: CODING AND COMPUTING, VOL 2, PROCEEDINGS, 2004, : 337 - 341
  • [8] Gauch Susan, UMUAI
  • [9] Kumar Amritesh, 2010, P IEEE
  • [10] Menczer F, 2001, P 24 ANN INT ACM SIG, P241, DOI DOI 10.1145/383952.383995