Character-level text classification via convolutional neural network and gated recurrent unit

被引:8
作者
Bing Liu
Yong Zhou
Wei Sun
机构
[1] China University of Mining and Technology,School of Computer Science and Technology
[2] Mine Digitization Engineering Research Center of Minstry of Education of the People’s Republic of China,College of Information and Control Engineering
[3] Insititute of Electrics,undefined
[4] Chinese Academy of Sciences,undefined
[5] China University of Mining and Technology,undefined
来源
International Journal of Machine Learning and Cybernetics | 2020年 / 11卷
关键词
Text categorization; Convolutional neural network; Gated recurrent unit; Highway network;
D O I
暂无
中图分类号
学科分类号
摘要
Text categorization, or text classification, is one of key tasks for representing the semantic information of documents. Traditional deep leaning models for text categorization are generally time-consuming on large scale datasets due to slow convergence rate or heavily rely on the pre-trained word vectors. Motivated by fully convolutional networks in the field of image processing, we introduce fully convolutional layers to substantially reduce the number of parameters in the text classification model. A character-level model for short text classification, integrating convolutional neural network, bidirectional gated recurrent unit, highway network with the fully connected layers, is proposed to capture both the global and the local textual semantics at the fast convergence speed. Furthermore, In addition, error minimization extreme learning machine is incorporated into the proposed model to improve the classification accuracy further. Extensive experiments show that our approach achieves the state-of-the-art performance compared with the existing methods on the large scale text datasets.
引用
收藏
页码:1939 / 1949
页数:10
相关论文
共 70 条
  • [1] Zhang W(2015)TESC: an approach to text classification using semi-supervised clustering Knowl-Based Syst 75 152-160
  • [2] Tang X(2018)DRI-RCNN: an approach to deceptive review identification using recurrent convolutional neural network Inf Process Manag 54 576-592
  • [3] Yoshida T(2017)A review of affective computing: from unimodal analysis to multimodal fusion Inf Fusion 37 98-125
  • [4] Zhang W(2016)Affective computing and sentiment analysis IEEE Intell Syst 31 102-107
  • [5] Du Y(2014)A multi-view embedding space for modeling internet images, tags, and their semantics Int J Comput Vision 106 210-233
  • [6] Yoshida T(2016)Fusing audio, visual and textual clues for sentiment analysis from multimodal content Neurocomputing 174 50-59
  • [7] Wang Q(2014)Jumping NLP curves: a review of natural language processing research IEEE Comput Intell Mag 9 48-57
  • [8] Poria S(2017)Text classification method based on self-training and LDA topic models Expert Syst Appl 80 83-93
  • [9] Cambria E(2015)Open-categorical text classification based on multi-LDA models Soft Comput 19 29-38
  • [10] Bajpai R(2018)What is wrong with topic modeling? And how to fix it using search-based software engineering Inform Software Tech 98 74-88