Web Page Classification Using RNN

被引:18
作者
Buber, Ebubekir [1 ]
Diri, Banu [1 ]
机构
[1] Yildiz Tech Univ, Comp Engn Dept, Istanbul, Turkey
来源
PROCEEDINGS OF THE 9TH INTERNATIONAL CONFERENCE OF INFORMATION AND COMMUNICATION TECHNOLOGY [ICICT-2019] | 2019年 / 154卷
关键词
web page classification; classification; categorization; deep learning; RNN; transfer learning;
D O I
10.1016/j.procs.2019.06.011
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Web page classification is an information retrieval application that provides useful information that can be a basis for many different application domains. In this study, a deep learning-based system has been developed for the classification of web pages. The meta tag information contained in the web page is used to classify a web page. The meta tags used are title, description and keywords. RNN based deep learning architecture was used during the tests. Transfer learning is the name given to the approach to building a machine learning model with the use of pre-trained parameters to solve a problem. The effect of using transfer learning on the system has also been examined. According to the results obtained, success rate of web page classification system is approximately 85%. It is not observed that transfer learning has significant contribution to the success rates. However, the use of transfer learning has reduced the consumed system resources. (C) 2019 The Authors. Published by Elsevier Ltd.
引用
收藏
页码:62 / 72
页数:11
相关论文
共 20 条
[11]  
MENCZER F., 2004, ACM Trans. Internet Technol, V4, P378, DOI DOI 10.1145/1031114.1031117
[12]  
Mikolov Tomas, 2013, P 1 INT C LEARN REPR
[13]  
Ozel Selma Ayse, 2011, 2011 International Symposium on Innovations in Intelligent Systems and Applications (INISTA 2011), P282, DOI 10.1109/INISTA.2011.5946076
[14]   A Web page classification system based on a genetic algorithm using tagged-terms as features [J].
Ozel, Selma Ayse .
EXPERT SYSTEMS WITH APPLICATIONS, 2011, 38 (04) :3407-3415
[15]  
Pennington J., 2014, P 2014 C EMP METH NA, P1532
[16]   Web Page Classification: Features and Algorithms [J].
Qi, Xiaoguang ;
Davison, Brian D. .
ACM COMPUTING SURVEYS, 2009, 41 (02)
[17]  
Ribeiro A, 2003, LECT NOTES ARTIF INT, V2663, P103
[18]   Web page feature selection and classification using neural networks [J].
Selamat, A ;
Omatu, S .
INFORMATION SCIENCES, 2004, 158 :69-88
[19]  
Sriurai Wongkot, 2010, INT JOPURNAL COMPUTE, V7
[20]  
Wakaki T., 2006, Web Intelligence and Agent Systems, V4, P431