Attention-based word embeddings using Artificial Bee Colony algorithm for aspect-level sentiment classification

被引:24
作者
Zhang, Ming [1 ]
Palade, Vasile [2 ]
Wang, Yan [3 ]
Ji, Zhicheng [3 ]
机构
[1] Wuhan Inst Technol, Sch Comp Sci & Engn, Hubei Prov Key Lab Intelligent Robot, Wuhan 430205, Peoples R China
[2] Coventry Univ, Sch Comp Elect & Math, Coventry CV1 5FB, W Midlands, England
[3] Jiangnan Univ, Minist Educ, Engn Res Ctr Internet Things Technol Applicat, Wuxi 214122, Jiangsu, Peoples R China
基金
中国国家自然科学基金; 国家重点研发计划;
关键词
Aspect-level sentiment classification; Attention mechanism; Word embeddings; artificial Bee Colony algorithm; Support Vector Machine; OPTIMIZATION;
D O I
10.1016/j.ins.2020.09.038
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Considering that most popular models solving aspect-level sentiment classification problems mainly focus on designing complicated neural networks to scale the importance of each word in the sentence, this paper addresses this problem from the view of semantic space. Motivated by the fact that the senses of a word can be sophisticatedly embedded into the semantic space using a distributed representation, this paper hypothesizes that each sense of a word can be represented by one or more specific dimensions, and thus the target of aspect-level sentiment classification can be simplified into searching the related dimensions for the aspects and sentiments concerned. Particularly, an Attention Vector (ATV) based on attention mechanism is designed for each aspect in terms of a specific task, which involves two sub-vectors, i.e., a Dimension Attention Vector (DATV) and a Sentiment Attention Vector (SATV). The DATV determines the significances of different dimensions based on their correlations with an aspect; and the SATV allocates weights for the attributes of words, which are decided by sentiment polarities and part-of-speech (PoS) tagging. Given a sub-dataset related to an aspect , the ATV will be optimized by an Artificial Bee Colony (ABC) algorithm with a Support Vector Machine (SVM) classifier, the objective of which is to maximize classification accuracy. Intrinsically, the DATV can reduce the ambiguity existing in polysemy, meanwhile, the SATV is an auxiliary means for the optimization of the DATV , which can help eliminate the misunderstandings caused by antonyms. Then, the optimized DATV will be applied on a Convolutional Neural Network (CNN) model via simply scaling the pretrained word embeddings as inputs (named as ATV-CNN model). Experimental results show that the ATV-CNN model can have substantial advantages when compared with state-of-the-art models. (C) 2020 Elsevier Inc. All rights reserved.
引用
收藏
页码:713 / 738
页数:26
相关论文
共 48 条
[1]   Sentiment analysis in multiple languages: Feature selection for opinion classification in Web forums [J].
Abbasi, Ahmed ;
Chen, Hsinchun ;
Salem, Arab .
ACM TRANSACTIONS ON INFORMATION SYSTEMS, 2008, 26 (03)
[2]   Feature Selection Using Multi-objective Optimization for Aspect Based Sentiment Analysis [J].
Akhtar, Md Shad ;
Kohail, Sarah ;
Kumar, Amit ;
Ekbal, Asif ;
Biemann, Chris .
NATURAL LANGUAGE PROCESSING AND INFORMATION SYSTEMS, NLDB 2017, 2017, 10260 :15-27
[3]  
[Anonymous], 2016, P 7 WORKSH COMP APPR
[4]  
[Anonymous], 2014, P 8 INT WORKSH SEM E, DOI DOI 10.3115/V1/S14-2076
[5]  
Augustyniak L, 2014, 2014 PROCEEDINGS OF THE IEEE/ACM INTERNATIONAL CONFERENCE ON ADVANCES IN SOCIAL NETWORKS ANALYSIS AND MINING (ASONAM 2014), P924, DOI 10.1109/ASONAM.2014.6921696
[6]  
Baccianella S., 2010, P 7 INT C LANG RES O, V10, P2200
[7]   A Statistical and Evolutionary Approach to Sentiment Analysis [J].
Carvalho, Jonnathan ;
Prado, Adriana ;
Plastino, Alexandre .
2014 IEEE/WIC/ACM INTERNATIONAL JOINT CONFERENCES ON WEB INTELLIGENCE (WI) AND INTELLIGENT AGENT TECHNOLOGIES (IAT), VOL 2, 2014, :110-117
[8]  
Cruse David Allan., 1986, Lexical semantics
[9]  
Firth J. R., 1957, STUDIES LINGUISTIC A
[10]   Enhancing artificial bee colony algorithm using more information-based search equations [J].
Gao, Wei-feng ;
Liu, San-yang ;
Huang, Ling-ling .
INFORMATION SCIENCES, 2014, 270 :112-133