Document-Level Sentiment Analysis Using Attention-Based Bi-Directional Long Short-Term Memory Network and Two-Dimensional Convolutional Neural Network

被引:18
作者
Mao, Yanying [1 ,2 ]
Zhang, Yu [3 ,4 ]
Jiao, Liudan [3 ]
Zhang, Heshan [3 ]
机构
[1] Chongqing Coll Elect Engn, Dept Commun Engn, Chongqing 401331, Peoples R China
[2] Chongqing Univ Posts & Telecommun, Coll Comp Sci & Technol, Chongqing, Peoples R China
[3] Chongqing Jiaotong Univ, Sch Econ & Management, Chongqing 400074, Peoples R China
[4] TY Lin Int Engn Consulting China Co Ltd, Chongqing 401121, Peoples R China
基金
中国国家自然科学基金;
关键词
sentiment analysis; bidirectional LSTM; 2DCNN; attention mechanism; MECHANISM; CNN;
D O I
10.3390/electronics11121906
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Due to outstanding feature extraction ability, neural networks have recently achieved great success in sentiment analysis. However, one of the remaining challenges of sentiment analysis is to model long texts to consider the intrinsic relations between two sentences in the semantic meaning of a document. Moreover, most existing methods are not powerful enough to differentiate the importance of different document features. To address these problems, this paper proposes a new neural network model: AttBiLSTM-2DCNN, which entails two perspectives. First, a two-layer, bidirectional long short-term memory (BiLSTM) network is utilized to obtain the sentiment semantics of a document. The first BiLSTM layer learns the sentiment semantic representation from both directions of a sentence, and the second BiLSTM layer is used to encode the intrinsic relations of sentences into the document matrix representation with a feature dimension and a time-step dimension. Second, a two-dimensional convolutional neural network (2DCNN) is employed to obtain more sentiment dependencies between two sentences. Third, we utilize a two-layer attention mechanism to distinguish the importance of words and sentences in the document. Last, to validate the model, we perform an experiment on two public review datasets that are derived from Yelp2015 and IMDB. Accuracy, F1-Measure, and MSE are used as evaluation metrics. The experimental results show that our model can not only capture sentimental relations but also outperform certain state-of-the-art models.
引用
收藏
页数:15
相关论文
共 44 条
[1]  
Agarwal B, 2020, ALGO INTELL SY, P1, DOI 10.1007/978-981-15-1216-2
[2]  
Agarwal Basant, 2013, Computational Linguistics and Intelligent Text Processing. 14th International Conference, CICLing 2013. Proceedings, P13, DOI 10.1007/978-3-642-37256-8_2
[3]  
Ain QT, 2017, INT J ADV COMPUT SC, V8, P424
[4]   Transformer-Based Graph Convolutional Network for Sentiment Analysis [J].
AlBadani, Barakat ;
Shi, Ronghua ;
Dong, Jian ;
Al-Sabri, Raeed ;
Moctard, Oloulade Babatounde .
APPLIED SCIENCES-BASEL, 2022, 12 (03)
[5]  
[Anonymous], 2018, Speech and Language Processing
[6]  
[Anonymous], 2016, P 2016 C EMPIRICAL M, DOI 10.18653/v1/d16-1024
[7]   Cross-ratio uninorms as an effective aggregation mechanism in sentiment analysis [J].
Appel, Orestes ;
Chiclana, Francisco ;
Carter, Jenny ;
Fujita, Hamido .
KNOWLEDGE-BASED SYSTEMS, 2017, 124 :16-22
[8]   Co-LSTM: Convolutional LSTM model for sentiment analysis in social big data [J].
Behera, Ranjan Kumar ;
Jena, Monalisa ;
Rath, Santanu Kumar ;
Misra, Sanjay .
INFORMATION PROCESSING & MANAGEMENT, 2021, 58 (01)
[9]   Sentiment analysis for user reviews using Bi-LSTM self-attention based CNN model [J].
Bhuvaneshwari, P. ;
Rao, A. Nagaraja ;
Robinson, Y. Harold ;
Thippeswamy, M. N. .
MULTIMEDIA TOOLS AND APPLICATIONS, 2022, 81 (09) :12405-12419
[10]  
Bo Pang, 2008, Foundations and Trends in Information Retrieval, V2, P1, DOI 10.1561/1500000001