Webpages Classification Based on Deep Belief Network Using Images and Text Information

被引:0
作者
Hu, Ruiguang [1 ]
Gao, Shibo [1 ]
Yang, Libo [1 ]
机构
[1] Beijing Aerosp Automat Control Inst, Natl Key Lab Sci & Technol Aerosp Intelligent Con, Beijing 100854, Peoples R China
来源
2018 IEEE CSAA GUIDANCE, NAVIGATION AND CONTROL CONFERENCE (CGNCC) | 2018年
关键词
WEB PAGES;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In this paper, Deep Belief Network(DBN) is used for drug-related webpages classification. DEVIL parsing is used to extract image-label text and body text, FOCARSS method is used to choose effective images. text representation is generated by BOW model, images representation is generated by BOF model. We concatenate images and text representation to generate final representation. It is shown that DBN's classification accuracy is higher than BPNN's classification accuracy, and better than that of single-modal information.
引用
收藏
页数:4
相关论文
共 13 条
  • [1] [Anonymous], 2017, P IEEE INT EL MACH D, DOI DOI 10.1109/IEMDC.2017.8002046
  • [2] [Anonymous], 2009, NIPS WORKSH DEEP LEA
  • [3] Estivill-Castro V., 2018, P AUSTR COMP SCI WEE, P17
  • [4] Fei-Fei L, 2005, PROC CVPR IEEE, P524
  • [5] Heinrich G, 2017, MEDIA BUS INNOV, P55, DOI 10.1007/978-3-319-27786-8_6
  • [6] Hu R. G., 2015, 9 INT S MULT IM PROC
  • [7] Hu R. G., 2013, INT C MULT TECHN
  • [8] Recognition of pornographic web pages by classifying texts and images
    Hu, Weiming
    Wu, Ou
    Chen, Zhouyao
    Fu, Zhouyu
    Maybank, Steve
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2007, 29 (06) : 1019 - 1034
  • [9] Web Pages Classification with Parliamentary Optimization Algorithm
    Kiziloluk, Soner
    Ozer, Ahmet Bedri
    [J]. INTERNATIONAL JOURNAL OF SOFTWARE ENGINEERING AND KNOWLEDGE ENGINEERING, 2017, 27 (03) : 499 - 513
  • [10] Mohamed T., 2017, INT J ENG COMPUTER S, V6, P35