Web Page Classification Using RNN

被引:15
作者
Buber, Ebubekir [1 ]
Diri, Banu [1 ]
机构
[1] Yildiz Tech Univ, Comp Engn Dept, Istanbul, Turkey
来源
PROCEEDINGS OF THE 9TH INTERNATIONAL CONFERENCE OF INFORMATION AND COMMUNICATION TECHNOLOGY [ICICT-2019] | 2019年 / 154卷
关键词
web page classification; classification; categorization; deep learning; RNN; transfer learning;
D O I
10.1016/j.procs.2019.06.011
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Web page classification is an information retrieval application that provides useful information that can be a basis for many different application domains. In this study, a deep learning-based system has been developed for the classification of web pages. The meta tag information contained in the web page is used to classify a web page. The meta tags used are title, description and keywords. RNN based deep learning architecture was used during the tests. Transfer learning is the name given to the approach to building a machine learning model with the use of pre-trained parameters to solve a problem. The effect of using transfer learning on the system has also been examined. According to the results obtained, success rate of web page classification system is approximately 85%. It is not observed that transfer learning has significant contribution to the success rates. However, the use of transfer learning has reduced the consumed system resources. (C) 2019 The Authors. Published by Elsevier Ltd.
引用
收藏
页码:62 / 72
页数:11
相关论文
共 50 条
  • [41] A new algorithm for uncertain problem of web page classification
    Zhang, X. (zhangshenyang@126.com), 1600, Academy Publisher (07): : 526 - 531
  • [42] Dictionary-based Bilingual Web Page Classification
    Liu, Jicheng
    Liang, Chunyan
    Qi, Jianxun
    2008 4TH INTERNATIONAL CONFERENCE ON WIRELESS COMMUNICATIONS, NETWORKING AND MOBILE COMPUTING, VOLS 1-31, 2008, : 11542 - 11545
  • [43] Classification of Extreme Reviews from Online Products Using RNN Model
    Naganjaneyulu, Satuluri
    Tarun, G.
    Sriram, Y.
    Devi, B. Rama
    Manoj, P.
    PROCEEDINGS OF SECOND INTERNATIONAL CONFERENCE ON SUSTAINABLE EXPERT SYSTEMS (ICSES 2021), 2022, 351 : 297 - 303
  • [44] An optimized approach for massive web page classification using entity similarity based on semantic network
    Li, Huakang
    Xu, Zheng
    Li, Tao
    Sun, Guozi
    Choo, Kim-Kwang Raymond
    FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2017, 76 : 510 - 518
  • [45] A semantic based Web page classification strategy using multi-layered domain ontology
    Ahmed I. Saleh
    Mohammed F. Al Rahmawy
    Arwa E. Abulwafa
    World Wide Web, 2017, 20 : 939 - 993
  • [46] Web Page Classification based on Context to the Content Extraction of Articles
    Patel, Ankit Dilip
    Pandya, Vimal N.
    2017 2ND INTERNATIONAL CONFERENCE FOR CONVERGENCE IN TECHNOLOGY (I2CT), 2017, : 539 - 541
  • [47] A Novel Feature Selection Framework for Automatic Web Page Classification
    J.Alamelu Mangai
    V.Santhosh Kumar
    S.Appavu alias Balamurugan
    International Journal of Automation and Computing, 2012, (04) : 442 - 448
  • [48] Stemming Text-based Web Page Classification using Machine Learning Algorithms: A Comparison
    Razali, Ansari
    Daud, Salwani Mohd
    Zin, Nor Azan Mat
    Shahidi, Faezehsadat
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2020, 11 (01) : 570 - 576
  • [49] A Web page classification system based on a genetic algorithm using tagged-terms as features
    Ozel, Selma Ayse
    EXPERT SYSTEMS WITH APPLICATIONS, 2011, 38 (04) : 3407 - 3415
  • [50] Deep Learning Based Classification of Visual Behavior on Web Page
    Zhang, Meng-jie
    Lv, Sheng-fu
    Li, Mi
    INTERNATIONAL CONFERENCE ON ENERGY, ENVIRONMENT AND CHEMICAL ENGINEERING (ICEECE 2015), 2015, : 266 - 270