Web Page Classification Using RNN

被引:15
|
作者
Buber, Ebubekir [1 ]
Diri, Banu [1 ]
机构
[1] Yildiz Tech Univ, Comp Engn Dept, Istanbul, Turkey
来源
PROCEEDINGS OF THE 9TH INTERNATIONAL CONFERENCE OF INFORMATION AND COMMUNICATION TECHNOLOGY [ICICT-2019] | 2019年 / 154卷
关键词
web page classification; classification; categorization; deep learning; RNN; transfer learning;
D O I
10.1016/j.procs.2019.06.011
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Web page classification is an information retrieval application that provides useful information that can be a basis for many different application domains. In this study, a deep learning-based system has been developed for the classification of web pages. The meta tag information contained in the web page is used to classify a web page. The meta tags used are title, description and keywords. RNN based deep learning architecture was used during the tests. Transfer learning is the name given to the approach to building a machine learning model with the use of pre-trained parameters to solve a problem. The effect of using transfer learning on the system has also been examined. According to the results obtained, success rate of web page classification system is approximately 85%. It is not observed that transfer learning has significant contribution to the success rates. However, the use of transfer learning has reduced the consumed system resources. (C) 2019 The Authors. Published by Elsevier Ltd.
引用
收藏
页码:62 / 72
页数:11
相关论文
共 50 条
  • [31] Web Page Classification Method Based on Semantics and Structure
    Li, Huaxin
    Zhang, Zhaoxin
    Xu, Yongdong
    2019 2ND INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND BIG DATA (ICAIBD 2019), 2019, : 238 - 243
  • [32] Exploiting the Social Capital of Folksonomies for Web Page Classification
    Godoy, Daniela
    Amandi, Analia
    SOFTWARE SERVICES FOR E-WORLD, 2010, 341 : 151 - 160
  • [33] Innovating Web page classification through reducing noise
    Xiaoli Li
    Zhongzhi Shi
    Journal of Computer Science and Technology, 2002, 17 : 9 - 17
  • [34] Innovating web page classification through reducing noise
    Li, XL
    Shi, ZZ
    JOURNAL OF COMPUTER SCIENCE AND TECHNOLOGY, 2002, 17 (01) : 9 - 17
  • [35] PEBL: Web page classification without negative examples
    Yu, HJ
    Han, JW
    Chang, KCC
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2004, 16 (01) : 70 - 81
  • [36] Efficient Machine Learning Technique for Web Page Classification
    Markkandeyan, S.
    Devi, M. Indra
    ARABIAN JOURNAL FOR SCIENCE AND ENGINEERING, 2015, 40 (12) : 3555 - 3566
  • [37] A web page classification algorithm based on feature selection
    Zhou, Hongfang
    Guo, Jie
    Wang, Xinyi
    Duan, Wencong
    Wang, Peng
    Cao, Wenquan
    Journal of Information and Computational Science, 2015, 12 (04): : 1549 - 1556
  • [38] Classifier and feature set ensembles for web page classification
    Onan, Aytug
    JOURNAL OF INFORMATION SCIENCE, 2016, 42 (02) : 150 - 165
  • [39] A Clique Based Web Page Classification Corrective Approach
    Belmouhcine, Abdelbadie
    Benkhalifa, Mohammed
    2014 IEEE/WIC/ACM INTERNATIONAL JOINT CONFERENCES ON WEB INTELLIGENCE (WI) AND INTELLIGENT AGENT TECHNOLOGIES (IAT), VOL 2, 2014, : 467 - 473
  • [40] Web page classification based on a support vector machine using a weighted vote schema
    Chen, Rung-Ching
    Hsieh, Chung-Hsun
    EXPERT SYSTEMS WITH APPLICATIONS, 2006, 31 (02) : 427 - 435