Sparse attention based separable dilated convolutional neural network for targeted sentiment analysis

被引:70
作者
Gan, Chenquan [1 ]
Wang, Lu [1 ]
Zhang, Zufan [1 ]
Wang, Zhangyi [1 ]
机构
[1] Chongqing Univ Posts & Telecommun, Sch Commun & Informat Engn, Chongqing 400065, Peoples R China
关键词
Targeted sentiment analysis; Sparse attention; Separable dilated CNN; Multichannel embedding;
D O I
10.1016/j.knosys.2019.06.035
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Long short-term memory networks (LSTM) and classical convolutional neural networks (CNN) are two critical methods for the task of targeted sentiment analysis, but LSTM are difficult to parallelize and time-inefficient, and classical CNN can only capture local semantic features. To this end, this paper first proposes a sparse attention based separable dilated convolutional neural network (SA-SDCCN), which consists of multichannel embedding layer, separable dilated convolution module, sparse attention layer, and output layer. Specifically, our work is mainly concentrated on the first three parts. In multichannel embedding layer, semantic and sentiment embeddings are incorporated into an embedding tensor, which builds richer representations over the input sequence. In separable dilated convolution module, long-range contextual semantic information is explored and multi-scale contextual semantic dependencies are aggregated simultaneously through diverse dilation rates. Moreover, the separable structure further reduces the model parameters. In sparse attention layer, sentiment-oriented components are noticed according to the features of specific target entity. Finally, some experiments on three benchmark datasets demonstrate that SA-SDCCN achieves comparable or even better performance than state-of-the-art methods in terms of higher parallelism and lower computational cost. (C) 2019 Elsevier B.V. All rights reserved.
引用
收藏
页数:10
相关论文
共 39 条
[1]   Feature selection and ensemble construction: A two-step method for aspect based sentiment analysis [J].
Akhtar, Md Shad ;
Gupta, Deepak ;
Ekbal, Asif ;
Bhattacharyya, Pushpak .
KNOWLEDGE-BASED SYSTEMS, 2017, 125 :116-135
[2]  
[Anonymous], INT J RECENT SCI RES
[3]  
[Anonymous], P 2017 C EMP METH NA
[4]  
[Anonymous], MULTISCALE CONTEXT A
[5]  
[Anonymous], 2013, P 2013 C N AM CHAPT
[6]  
[Anonymous], 2016, P 2016 C EMP METH NA, DOI DOI 10.18653/V1/D16-1103
[7]  
Baccianella S, 2010, LREC 2010 - SEVENTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION
[8]  
Chollet F., DEPTHWISE SEPARABLE
[9]  
Dong L, 2014, PROCEEDINGS OF THE 52ND ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, VOL 2, P49
[10]   Infrared small-dim target detection based on Markov random field guided noise modeling [J].
Gao, Chenqiang ;
Wang, Lan ;
Xiao, Yongxing ;
Zhao, Qian ;
Meng, Deyu .
PATTERN RECOGNITION, 2018, 76 :463-475