Visual Twitter Analytics (Vista) Temporally changing sentiment and the discovery of emergent themes within sport event tweets

被引:32
作者
Hoeber, Orland [1 ]
Hoeber, Larena [2 ]
El Meseery, Maha [1 ]
Odoh, Kenneth [1 ]
Gopi, Radhika [1 ]
机构
[1] Univ Regina, Dept Comp Sci, Regina, SK S4S 0A2, Canada
[2] Univ Regina, Fac Kinesiol & Hlth Studies, Regina, SK S4S 0A2, Canada
关键词
Twitter; Visual analytics; Exploratory data analysis; Sport analytics; BIG DATA; COMMUNICATION; CHALLENGES; AGENDA;
D O I
10.1108/OIR-02-2015-0067
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Purpose - Due to the size and velocity at which user generated content is created on social media services such as Twitter, analysts are often limited by the need to pre-determine the specific topics and themes they wish to follow. Visual analytics software may be used to support the interactive discovery of emergent themes. The paper aims to discuss these issues. Design/methodology/approach - Tweets collected from the live Twitter stream matching a user's query are stored in a database, and classified based on their sentiment. The temporally changing sentiment is visualized, along with sparklines showing the distribution of the top terms, hashtags, user mentions, and authors in each of the positive, neutral, and negative classes. Interactive tools are provided to support sub-querying and the examination of emergent themes. Findings - A case study of using Vista to analyze sport fan engagement within a mega-sport event (2013 Le Tour de France) is provided. The authors illustrate how emergent themes can be identified and isolated from the large collection of data, without the need to identify these a priori. Originality/value - Vista provides mechanisms that support the interactive exploration among Twitter data. By combining automatic data processing and machine learning methods with interactive visualization software, researchers are relieved of tedious data processing tasks, and can focus on the analysis of high-level features of the data. In particular, patterns of Twitter use can be identified, emergent themes can be isolated, and purposeful samples of the data can be selected by the researcher for further analysis.
引用
收藏
页码:25 / 41
页数:17
相关论文
共 33 条
[1]   Seeing beyond reading: a survey on visual text analytics [J].
Alencar, Aretha B. ;
de Oliveira, Maria Cristina F. ;
Paulovich, Fernando V. .
WILEY INTERDISCIPLINARY REVIEWS-DATA MINING AND KNOWLEDGE DISCOVERY, 2012, 2 (06) :476-492
[2]  
[Anonymous], 2011, BIG DATA ANAL
[3]  
[Anonymous], P 18 ACM SIGKDD INT
[4]  
[Anonymous], 2009, TECHNICAL REPORT
[5]  
[Anonymous], 2010, DESIGNING USER INTER
[6]  
Archambault D., 2011, Proceedings of the 3rd International Workshop on Search and Mining User Generated Contents, ACM CIKM, P77
[7]  
Blaszka M., 2012, International Journal of Sport Communication, V5, P435
[8]   D3: Data-Driven Documents [J].
Bostock, Michael ;
Ogievetsky, Vadim ;
Heer, Jeffrey .
IEEE TRANSACTIONS ON VISUALIZATION AND COMPUTER GRAPHICS, 2011, 17 (12) :2301-2309
[9]   "May No Act of Ours Bring Shame": Fan-Enacted Crisis Communication Surrounding the Penn State Sex Abuse Scandal [J].
Brown, Natalie A. ;
Brown, Kenon A. ;
Billings, Andrew C. .
COMMUNICATION & SPORT, 2015, 3 (03) :288-311
[10]  
Cheong Marc, 2010, Proceedings of the 2010 20th International Conference on Pattern Recognition (ICPR 2010), P3125, DOI 10.1109/ICPR.2010.765