Attention-based aspect sentiment classification using enhanced learning through CNN-BiLSTM networks

被引:51
作者
Ayetiran, Eniafe Festus [1 ]
机构
[1] Achievers Univ, Dept Math Sci, Owo, Nigeria
关键词
Transfer learning; Attention; CNN; Bi LSTM; Joint learning;
D O I
10.1016/j.knosys.2022.109409
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Deep neural networks (DNN) techniques for aspect -based sentiment classification have been widely studied. The success of these methods depends largely on training data which are often inadequate because of the rigor involved in manually tagging large collection of opinionated texts. Attempts have been made to transfer knowledge from document -level to aspect -level sentiment task. However, the success of this approach is also dependent on the model because aspect sentiment data like other type of texts contain complex semantic features. In this paper, we present an attention -based deep learning technique which jointly learns on document and aspect -level sentiment data and which also transfers learning from the document -level data to aspect -level sentiment classification. It basically consists of a convolutional layer and a bidirectional long short-term memory (BiLSTM) layer. The first variant of our technique uses convolutional neural network (CNN) to extract high-level semantic features. The output of the feature extraction is then fed into the BiLSTM layer which captures the contextual feature representation of the texts. The second variant applies the BiLSTM layer directly on the input data. In both variants, the output hidden representation is passed to an output layer using softmax activation function for sentiment polarity classification. We evaluate our model on four standard benchmark datasets which shows the effectiveness of our approach with improvements over baselines. We also conduct ablation studies to show the effect of the different document -level weights on the learning techniques. (c) 2022 Elsevier B.V. All rights reserved.
引用
收藏
页数:8
相关论文
共 44 条
[1]  
Abadi M., 2015, TensorFlow: Large-scale machine learning on heterogeneous systems
[2]  
Ayetiran E.F., 2015, NATURAL LANGUAGE PRO, P15, DOI [10.1515/9781501501289.15, DOI 10.1515/9781501501289.15]
[3]   EDS-MEMBED: Multi-sense embeddings based on enhanced distributional semantic structures via a graph walk over word senses [J].
Ayetiran, Eniafe Festus ;
Sojka, Petr ;
Novotny, Vit .
KNOWLEDGE-BASED SYSTEMS, 2021, 219
[4]   An index-based joint multilingual/cross-lingual text categorization using topic expansion via BabelNet [J].
Ayetiran, Eniafe Festus .
TURKISH JOURNAL OF ELECTRICAL ENGINEERING AND COMPUTER SCIENCES, 2020, 28 (01) :224-237
[5]   An optimized Lesk-based algorithm for word sense disambiguation [J].
Ayetiran, Eniafe Festus ;
Agbele, Kehinde .
OPEN COMPUTER SCIENCE, 2018, 8 (01) :165-172
[6]  
Bao LX, 2019, 57TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2019:): STUDENT RESEARCH WORKSHOP, P253
[7]  
Chen P., 2017, P 2017 C EMP METH NA, P452, DOI [10.18653/v1/D17-1047, DOI 10.18653/V1/D17-1047]
[8]  
Chen Z, 2019, 57TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2019), P547
[9]   The effect of word of mouth on sales: Online book reviews [J].
Chevalier, Judith A. ;
Mayzlin, Dina .
JOURNAL OF MARKETING RESEARCH, 2006, 43 (03) :345-354
[10]  
Chollet F., 2015, Keras