Character-level text classification via convolutional neural network and gated recurrent unit

被引:8
作者
Bing Liu
Yong Zhou
Wei Sun
机构
[1] China University of Mining and Technology,School of Computer Science and Technology
[2] Mine Digitization Engineering Research Center of Minstry of Education of the People’s Republic of China,College of Information and Control Engineering
[3] Insititute of Electrics,undefined
[4] Chinese Academy of Sciences,undefined
[5] China University of Mining and Technology,undefined
来源
International Journal of Machine Learning and Cybernetics | 2020年 / 11卷
关键词
Text categorization; Convolutional neural network; Gated recurrent unit; Highway network;
D O I
暂无
中图分类号
学科分类号
摘要
Text categorization, or text classification, is one of key tasks for representing the semantic information of documents. Traditional deep leaning models for text categorization are generally time-consuming on large scale datasets due to slow convergence rate or heavily rely on the pre-trained word vectors. Motivated by fully convolutional networks in the field of image processing, we introduce fully convolutional layers to substantially reduce the number of parameters in the text classification model. A character-level model for short text classification, integrating convolutional neural network, bidirectional gated recurrent unit, highway network with the fully connected layers, is proposed to capture both the global and the local textual semantics at the fast convergence speed. Furthermore, In addition, error minimization extreme learning machine is incorporated into the proposed model to improve the classification accuracy further. Extensive experiments show that our approach achieves the state-of-the-art performance compared with the existing methods on the large scale text datasets.
引用
收藏
页码:1939 / 1949
页数:10
相关论文
共 70 条
  • [11] Hussain A(2015)A C-LSTM neural network for text classification Comput Sci 1 39-44
  • [12] Cambria E(2011)Natural language processing (almost) from scratch J Mach Learn Res 12 2493-2537
  • [13] Gong Y(2015)Character-level convolutional networks for text classification Adv Neural Inf Process Syst 28 1-9
  • [14] Ke Q(2007)Words versus character n-grams for anti-spam filtering Int J Artif Intell Tools 16 1047-1067
  • [15] Isard M(2016)Learning word dependencies in text by means of a deep recurrent belief network Knowl-Based Syst 108 144-154
  • [16] Lazebnik S(2016)Aspect extraction for opinion mining with a deep convolutional neural network Knowl-Based Syst 108 42-49
  • [17] Poria S(2017)Deep learning based document modeling for personality detection from text IEEE Intell Syst 32 74-79
  • [18] Cambria E(2009)Error minimized extreme learning machine with growth of hidden nodes and incremental learning IEEE Trans Neural Netw 20 1352-1357
  • [19] Howard N(2006)Multilabel neural networks with applications to functional genomics and text categorization IEEE Trans Knowl Data Eng 18 1338-1351
  • [20] Huang G-B(2015)Finding function in form: compositional character models for open vocabulary word representation Comput Sci 10 1899-1907