A data-driven neural network architecture for sentiment analysis

被引:4
作者
Cano, Erion [1 ]
Morisio, Maurizio [1 ]
机构
[1] Politecn Torino, Dipartimento Automat & Informat, Turin, Italy
关键词
Sentiment analysis; Opinion mining; Convolution neural networks; Deep learning architectures; Text data set properties; Word embeddings;
D O I
10.1108/DTA-03-2018-0017
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Purpose The fabulous results of convolution neural networks in image-related tasks attracted attention of text mining, sentiment analysis and other text analysis researchers. It is, however, difficult to find enough data for feeding such networks, optimize their parameters, and make the right design choices when constructing network architectures. The purpose of this paper is to present the creation steps of two big data sets of song emotions. The authors also explore usage of convolution and max-pooling neural layers on song lyrics, product and movie review text data sets. Three variants of a simple and flexible neural network architecture are also compared. Design/methodology/approach The intention was to spot any important patterns that can serve as guidelines for parameter optimization of similar models. The authors also wanted to identify architecture design choices which lead to high performing sentiment analysis models. To this end, the authors conducted a series of experiments with neural architectures of various configurations. Findings The results indicate that parallel convolutions of filter lengths up to 3 are usually enough for capturing relevant text features. Also, max-pooling region size should be adapted to the length of text documents for producing the best feature maps. Originality/value Top results the authors got are obtained with feature maps of lengths 6-18. An improvement on future neural network models for sentiment analysis could be generating sentiment polarity prediction of documents using aggregation of predictions on smaller excerpt of the entire text.
引用
收藏
页码:2 / 19
页数:18
相关论文
共 35 条
  • [1] [Anonymous], THESIS
  • [2] [Anonymous], 2015, Arxiv.Org, DOI DOI 10.3389/FPSYG.2013.00124
  • [3] [Anonymous], INT C INT SYST MET S
  • [4] [Anonymous], 2017, Computer Science & Information Technology (CS & IT), DOI DOI 10.5121/CSIT.2017.70603
  • [5] Bertin-Mahieux Thierry, 2011, P 12 INT SOC MUSIC I
  • [6] Bischoff Kerstin., 2009, 10th International Society for Music Information Retrieval Conference, P657
  • [7] Cano Erion, 2018, Trends and Advances in Information Systems and Technologies. Advances in Intelligent Systems and Computing (AISC 745), P330, DOI 10.1007/978-3-319-77703-0_34
  • [8] Cano E., 2018, P INT C GEOINFORMATI, P122
  • [9] Quality of Word Embeddings on Sentiment Analysis Tasks
    Cano, Erion
    Morisio, Maurizio
    [J]. NATURAL LANGUAGE PROCESSING AND INFORMATION SYSTEMS, NLDB 2017, 2017, 10260 : 332 - 338
  • [10] A Review of Data Fusion Techniques
    Castanedo, Federico
    [J]. SCIENTIFIC WORLD JOURNAL, 2013,