Efficient Deep Learning Model for Text Classification Based on Recurrent and Convolutional Layers

被引:32
作者
Hassan, Abdalraouf [1 ]
Mahmood, Ausif [1 ]
机构
[1] Univ Bridgeport, Dep Comp Sci & Engn, Bridgeport, CT 06604 USA
来源
2017 16TH IEEE INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND APPLICATIONS (ICMLA) | 2017年
关键词
Convlutional Neural Network; Bidirectional Recurrent Neural Network; Long Short-Term Memroy; Recurrent Neural Network;
D O I
10.1109/ICMLA.2017.00009
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Natural Language Processing (NLP) systems conventionally treat words as distinct atomic symbols. The model can leverage small amounts of information regarding the relationship between the individual symbols. Today when it comes to texts; one common technique to extract fixed-length features is bag-of-words. Despite its popularity the bag-of-words feature has two major weaknesses: it ignores semantics of the words and the order of words. In this paper, we propose a neural language model that relies on Convolutional Neural Network (CNN) and Bidirectional Recurrent Neural Network (BRNN) over pre-trained word vectors. We utilize bidirectional layers as a substitute of pooling layers in CNN in order to reduce the loss of detailed local information, and to capture long-term dependencies across input sequences. We validate the proposed model on two benchmark sentiment analysis datasets, Stanford Large Movie Review (IMDB), and Stanford Sentiment Treebank (SSTb). Our model achieves a competitive advantage compared with neural language models on the sentiment analysis datasets.
引用
收藏
页码:1108 / 1113
页数:6
相关论文
共 27 条
[1]  
[Anonymous], 2014, Generating sequences with recurrent neural networks
[2]  
[Anonymous], 2012, P 50 ANN M ASS COMP
[3]  
[Anonymous], 2013, P C EMPIRICAL METHOD
[4]  
[Anonymous], 2014, P 23 INT C WORLD WID
[5]  
[Anonymous], 2014, 52 ANN M ASS COMP LI
[6]  
[Anonymous], 2011, ICML
[7]  
[Anonymous], 2015, ADV NEURAL INFORM PR
[8]  
[Anonymous], 2015, P 2015 C EMP METH NA
[9]  
[Anonymous], 2011, P 49 ANN M ASS COMPU
[10]  
[Anonymous], 2016, Efficient character-level document classification by combining convolution and recurrent layers