A visual framework for dynamic emotional web analysis

被引:15
作者
Martin de Diego, Isaac [1 ]
Fernandez-Isabel, Alberto [1 ]
Ortega, Felipe [1 ]
Moguerza, Javier M. [1 ]
机构
[1] Rey Juan Carlos Univ, Data Sci Lab, C Tulipan S-N, Mostoles 28933, Spain
关键词
Sentiment analysis; Combination of information; Multidimensional scaling; Knowledge representation; Unsupervised learning system;
D O I
10.1016/j.knosys.2018.01.023
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Sentiment analysis is focused on detecting opinions and emotions directly linked to relevant topics in textual data. Its application for the automated analysis of large datasets with text from websites has become a major challenge today. Common approaches proposed for this task are based on predefined dictionaries of words, each one tagged with a positive or negative polarity beforehand. A known limitation of these systems is that they may return inaccurate estimations of the polarity of opinions, according to the actual number of words considered in the analysis. In addition, these systems do not usually include an intuitive graphical interface to facilitate the understanding of similarities between terms or gauge how their sentiment polarization evolves over time. In this paper we present EmoWeb, a prototype of a new tool for dynamic sentiment analysis of textual content from websites. This prototype includes a visual and dynamic framework to analyze texts, based on a well-established lexicon. An unsupervised learning algorithm can append new words and calculate or update their sentiment polarization and strength over time. Moreover, it can increase the number of words considered for sentiment analysis to improve the accuracy of results. A novel dynamic visualization module makes it easier for end users to interpret sentiments associated to terms and their changes. The prototype has been empirically evaluated in two experiments with real data gathered from news websites. Results are promising and illustrate the applicability of this approach for sentiment analysis of textual web content. (C) 2018 Elsevier B.V. All rights reserved.
引用
收藏
页码:264 / 273
页数:10
相关论文
共 50 条
[1]  
Agirre E, 2006, TEXT SPEECH LANG TEC, V33, P1, DOI 10.1007/978-1-4020-4809-8
[2]   A multi-stage method for content classification and opinion mining on weblog comments [J].
Alfaro, Cesar ;
Cano-Montero, Javier ;
Gomez, Javier ;
Moguerza, Javier M. ;
Ortega, Felipe .
ANNALS OF OPERATIONS RESEARCH, 2016, 236 (01) :197-213
[3]  
[Anonymous], 1999, MODERN INFORM RETRIE
[4]  
[Anonymous], 1978, MULTIDIMENSIONAL SCA
[5]  
[Anonymous], 2017, DEEP LEARNING KERAS
[6]  
[Anonymous], 2017, THE GUARDIAN
[7]  
[Anonymous], 2015, Sentic computing: a common-sense-based framework for concept-level sentiment analysis
[8]  
[Anonymous], 1984, Approaches to emotion
[9]  
[Anonymous], 2012, Mining text data
[10]  
[Anonymous], 2011, J COMPUT SCI-NETH, DOI DOI 10.1016/j.jocs.2010.12.007