Sentiment Analysis of Review Text Based on BiGRU-Attention and Hybrid CNN

被引:19
作者
Zhu, Qiannan [1 ]
Jiang, Xiaofan [2 ]
Ye, Renzhen [1 ]
机构
[1] Huazhong Agr Univ, Coll Sci, Wuhan 430070, Peoples R China
[2] China Construct Bank, Beijing 100033, Peoples R China
来源
IEEE ACCESS | 2021年 / 9卷
基金
中国国家自然科学基金;
关键词
Feature extraction; Convolutional neural networks; Sentiment analysis; Computational modeling; Analytical models; Data mining; Standards; Bidirectional gated recurrent unit; depthwise separable convolution; dilated convolution; focal loss; self-attention mechanism;
D O I
10.1109/ACCESS.2021.3118537
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Convolutional neural networks (CNN), recurrent neural networks (RNN), attention, and their variants are extensively applied in the sentiment analysis, and the effect of fusion model is expected to be better. However, fusion model is confronted with some problems such as complicated structure, excessive trainable parameters, and long training time. The classification effect of traditional model with cross entropy loss as loss function is undesirable since sample category imbalance as well as ease and difficulty of sample classification is not taken into account. In order to solve these problems, the model BiGRU-Att-HCNN is proposed on the basis of bidirectional gated recurrent unit (BiGRU), attention, and hybrid convolutional neural networks. In this model, BiGRU and self-attention are combined to acquire global information, and key information weight is supplemented. Two parallel convolutions (dilated convolution and standard convolution) are used to obtain multi-scale characteristic information with relatively less parameters, and the standard convolution is replaced with depthwise separable convolution with two-step calculations. Traditional max-pooling and average-pooling are discarded, and global average pooling is applied to substitute the pooling layer and the fully-connected layer simultaneously, making it possible to substantially decrease the number of model parameters and reduce over-fitting. In our model, focal loss is used as the loss function to tackle the problems of unbalanced sample categories and hard samples. Experimental results illustrate that in terms of multiple indicators, our model outperforms the 15 benchmark models, even with intermediate number of trainable parameters.
引用
收藏
页码:149077 / 149088
页数:12
相关论文
共 49 条
[1]   Social media sentiment analysis through parallel dilated convolutional neural network for smart city applications [J].
Alam, Muhammad ;
Abid, Fazeel ;
Cong Guangpei ;
Yunrong, L., V .
COMPUTER COMMUNICATIONS, 2020, 154 (154) :129-137
[2]  
Bahdanau D, 2016, Arxiv, DOI arXiv:1409.0473
[3]  
Bai S, ARXIV
[4]  
Bo Pang, 2008, Foundations and Trends in Information Retrieval, V2, P1, DOI 10.1561/1500000001
[5]   Sentiment Classification Based on Part-of-Speech and Self-Attention Mechanism [J].
Cheng, Kefei ;
Yue, Yanan ;
Song, Zhiwen .
IEEE ACCESS, 2020, 8 :16387-16396
[6]  
Cho K., 2014, P C EMP METH NAT LAN, P1724, DOI DOI 10.3115/V1/D14-1179
[7]   Xception: Deep Learning with Depthwise Separable Convolutions [J].
Chollet, Francois .
30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, :1800-1807
[8]  
Devlin J, 2019, 2019 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES (NAACL HLT 2019), VOL. 1, P4171
[9]   Jointly Modeling Aspects, Ratings and Sentiments for Movie Recommendation (JMARS) [J].
Diao, Qiming ;
Qiu, Minghui ;
Wu, Chao-Yuan ;
Smola, Alexander J. ;
Jiang, Jing ;
Wang, Chong .
PROCEEDINGS OF THE 20TH ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING (KDD'14), 2014, :193-202
[10]  
Duque A. B., ARXIV190109821