Sentiment Strength Detection for the Social Web

被引:664
作者
Thelwall, Mike [1 ]
Buckley, Kevan [1 ]
Paltoglou, Georgios [1 ]
机构
[1] Wolverhampton Univ, Sch Technol, Stat Cybermetr Res Grp, Wolverhampton WV1 1SB, England
来源
JOURNAL OF THE AMERICAN SOCIETY FOR INFORMATION SCIENCE AND TECHNOLOGY | 2012年 / 63卷 / 01期
关键词
POLARITY; OPINIONS;
D O I
10.1002/asi.21662
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Sentiment analysis is concerned with the automatic extraction of sentiment-related information from text. Although most sentiment analysis addresses commercial tasks, such as extracting opinions from product reviews, there is increasing interest in the affective dimension of the social web, and Twitter in particular. Most sentiment analysis algorithms are not ideally suited to this task because they exploit indirect indicators of sentiment that can reflect genre or topic instead. Hence, such algorithms used to process social web texts can identify spurious sentiment patterns caused by topics rather than affective phenomena. This article assesses an improved version of the algorithm SentiStrength for sentiment strength detection across the social web that primarily uses direct indications of sentiment. The results from six diverse social web data sets (MySpace, Twitter, YouTube, Digg, Runners World, BBC Forums) indicate that SentiStrength 2 is successful in the sense of performing better than a baseline approach for all data sets in both supervised and unsupervised cases. SentiStrength is not always better than machine-learning approaches that exploit indirect indicators of sentiment, however, and is particularly weaker for positive sentiment in news-related discussions. Overall, the results suggest that, even unsupervised, SentiStrength is robust enough to be applied to a wide variety of different social web contexts.
引用
收藏
页码:163 / 173
页数:11
相关论文
共 54 条
  • [1] Andreevskaia A., 2008, Proceedings of ACL-2008: HLT, P290
  • [2] [Anonymous], 2011, MODELING PUBLIC MOOD
  • [3] [Anonymous], CONTENT ANAL INTRO I
  • [4] [Anonymous], 2005, Proceedings of the ACL student research workshop
  • [5] [Anonymous], 2009, P SIGDIAL 2009 C 10, DOI DOI 10.3115/1708376.1708385
  • [6] [Anonymous], P INT C REC ADV NAT
  • [7] [Anonymous], 2010, ICWSM, DOI DOI 10.1609/ICWSM.V4I1.14031
  • [8] Inter-Coder Agreement for Computational Linguistics
    Artstein, Ron
    Poesio, Massimo
    [J]. COMPUTATIONAL LINGUISTICS, 2008, 34 (04) : 555 - 596
  • [9] Baccianella S, 2010, LREC 2010 - SEVENTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION
  • [10] Balahur A., 2010, SENTIMENT ANAL NEWS