More than Bags of Words: Sentiment Analysis with Word Embeddings

被引:113
作者
Rudkowsky, Elena [1 ]
Haselmayer, Martin [2 ]
Wastian, Matthias [3 ]
Jenny, Marcelo [4 ]
Emrich, Stefan [5 ]
Sedlmair, Michael [6 ]
机构
[1] Univ Vienna, Fac Comp Sci, Vienna, Austria
[2] Univ Vienna, Dept Govt, Vienna, Austria
[3] Vienna Univ Technol, Ctr Computat Complex Syst, Vienna, Austria
[4] Univ Innsbruck, Dept Polit Sci, Innsbruck, Austria
[5] Drahtwarenhandlung Dwh GmbH, Vienna, Austria
[6] Jacobs Univ Bremen, Comp Sci, Bremen, Germany
关键词
ELECTORAL CAMPAIGNS; TEXT ANALYSIS; BAD-NEWS; NEGATIVITY; FREQUENCY; MODELS;
D O I
10.1080/19312458.2018.1455817
中图分类号
G2 [信息与知识传播];
学科分类号
05 ; 0503 ;
摘要
Moving beyond the dominant bag-of-words approach to sentiment analysis we introduce an alternative procedure based on distributed word embeddings. The strength of word embeddings is the ability to capture similarities in word meaning. We use word embeddings as part of a supervised machine learning procedure which estimates levels of negativity in parliamentary speeches. The procedure's accuracy is evaluated with crowdcoded training sentences; its external validity through a study of patterns of negativity in Austrian parliamentary speeches. The results show the potential of the word embeddings approach for sentiment analysis in the social sciences.
引用
收藏
页码:140 / 157
页数:18
相关论文
共 66 条
[1]   Political leaders and the media. Can we measure political leadership images in newspapers using computer-assisted content analysis? [J].
Aaldering, Loes ;
Vliegenthart, Rens .
QUALITY & QUANTITY, 2016, 50 (05) :1871-1905
[2]  
[Anonymous], 2001, STERREICHISCHEN ABGE
[3]  
[Anonymous], 2012, P ACL 2012 SYST DEM, DOI 10.1145/1935826.1935854
[4]  
[Anonymous], 2013, P 17 C COMP NAT LANG
[5]  
[Anonymous], 2014, arXiv preprint arXiv:1410.5329
[6]  
Baumeister R. F., 2001, REV GEN PSYCHOL, V5, P323, DOI [10.1037//1089-2680.5.4.323, DOI 10.1037//1089-2680.5.4.323, https://doi.org/10.1037/1089-2680.5.4.323]
[7]   Crowd-sourced Text Analysis: Reproducible and Agile Production of Political Data [J].
Benoit, Kenneth ;
Conway, Drew ;
Lauderdale, Benjamin E. ;
Laver, Michael ;
Mikhaylov, Slava .
AMERICAN POLITICAL SCIENCE REVIEW, 2016, 110 (02) :278-295
[8]   TAKING STOCK OF THE TOOLKIT An overview of relevant automated content analysis approaches and techniques for digital journalism scholars [J].
Boumans, Jelle W. ;
Trilling, Damian .
DIGITAL JOURNALISM, 2016, 4 (01) :8-23
[9]   Teaching the Computer to Code Frames in News: Comparing Two Supervised Machine Learning Approaches to Frame Analysis [J].
Burscher, Bjoern ;
Odijk, Daan ;
Vliegenthart, Rens ;
de Rijke, Maarten ;
de Vreese, Claes H. .
COMMUNICATION METHODS AND MEASURES, 2014, 8 (03) :190-206
[10]  
Ceron A., 2017, POLITICS BIG DATA NO