Web Page Classification Using RNN

被引:15
|
作者
Buber, Ebubekir [1 ]
Diri, Banu [1 ]
机构
[1] Yildiz Tech Univ, Comp Engn Dept, Istanbul, Turkey
来源
PROCEEDINGS OF THE 9TH INTERNATIONAL CONFERENCE OF INFORMATION AND COMMUNICATION TECHNOLOGY [ICICT-2019] | 2019年 / 154卷
关键词
web page classification; classification; categorization; deep learning; RNN; transfer learning;
D O I
10.1016/j.procs.2019.06.011
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Web page classification is an information retrieval application that provides useful information that can be a basis for many different application domains. In this study, a deep learning-based system has been developed for the classification of web pages. The meta tag information contained in the web page is used to classify a web page. The meta tags used are title, description and keywords. RNN based deep learning architecture was used during the tests. Transfer learning is the name given to the approach to building a machine learning model with the use of pre-trained parameters to solve a problem. The effect of using transfer learning on the system has also been examined. According to the results obtained, success rate of web page classification system is approximately 85%. It is not observed that transfer learning has significant contribution to the success rates. However, the use of transfer learning has reduced the consumed system resources. (C) 2019 The Authors. Published by Elsevier Ltd.
引用
收藏
页码:62 / 72
页数:11
相关论文
共 50 条
  • [21] Data Mining Techniques for Web Page Classification
    Fiol-Roig, Gabriel
    Miro-Julia, Margaret
    Herraiz, Eduardo
    HIGHLIGHTS IN PRACTICAL APPLICATIONS OF AGENTS AND MULTIAGENT SYSTEMS, 2011, 89 : 61 - 68
  • [22] Automatic Web Page Classification Using Visual Content for Subjective and Functional Variables
    Goncalves, Nuno
    Videira, Antonio
    WEB INFORMATION SYSTEMS AND TECHNOLOGIES, WEBIST 2014, 2015, 226 : 279 - 294
  • [23] Web Page Classification using n-gram based URL Features
    Rajalakshmi, R.
    Aravindan, Chandrabose
    2013 FIFTH INTERNATIONAL CONFERENCE ON ADVANCED COMPUTING (ICOAC), 2013, : 15 - 21
  • [24] Web Page Classification based on Unsupervised Learning using MIME type Analysis
    Roberto Jimenez, Luis
    2021 INTERNATIONAL CONFERENCE ON COMMUNICATION SYSTEMS & NETWORKS (COMSNETS), 2021, : 375 - 377
  • [25] A review of machine learning algorithms for web page classification
    Lassri, Safae
    El Habib, Benlahmar
    Abderrahim, Tragha
    2018 IEEE 5TH INTERNATIONAL CONGRESS ON INFORMATION SCIENCE AND TECHNOLOGY (IEEE CIST'18), 2018, : 220 - 226
  • [26] Web page classification based on a simplified swarm optimization
    Lee, Ji-Hyun
    Yeh, Wei-Chang
    Chuang, Mei-Chi
    APPLIED MATHEMATICS AND COMPUTATION, 2015, 270 : 13 - 24
  • [27] A Tool for Link-Based Web Page Classification
    Hernandez, Inma
    Rivero, Carlos R.
    Ruiz, David
    Corchuelo, Rafael
    ADVANCES IN ARTIFICIAL INTELLIGENCE, 2011, 7023 : 443 - 452
  • [28] A Web Page Classification Algorithm Based On Link Information
    Xu, Zhaohui
    Yan, Fuliang
    Qin, Jie
    Zhu, Haifeng
    2011 TENTH INTERNATIONAL SYMPOSIUM ON DISTRIBUTED COMPUTING AND APPLICATIONS TO BUSINESS, ENGINEERING AND SCIENCE (DCABES), 2011, : 82 - 86
  • [29] Hybrid Dimensionality Reduction Approach for Web Page Classification
    Sarode, Shraddha
    Gadge, Jayant
    2015 International Conference on Communication, Information & Computing Technology (ICCICT), 2015,
  • [30] Efficient Machine Learning Technique for Web Page Classification
    S. Markkandeyan
    M. Indra Devi
    Arabian Journal for Science and Engineering, 2015, 40 : 3555 - 3566