Predicting TV programme audience by using twitter based metrics

被引:16
作者
Crisci, Alfonso [1 ,2 ]
Grasso, Valentina [1 ]
Nesi, Paolo [3 ]
Pantaleo, Gianni [3 ]
Paoli, Irene [3 ]
Zaza, Imad [3 ]
机构
[1] CNR, IBIMET, Natl Res Council, Florence, Italy
[2] Tuscany Reg CNR, LAMMA Consortium, Sesto Fiorentino, Italy
[3] Univ Florence, Dept Informat Engn DINFO, Distributed Syst & Internet Data Intelligence & T, DISIT Lab, Florence, Italy
基金
欧盟地平线“2020”;
关键词
Twitter monitoring; Social media monitoring; Predicting audience; Twitter data analysis; REGRESSION; SELECTION;
D O I
10.1007/s11042-017-4880-x
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The predictive capabilities of metrics based on Twitter data have been stressed in different fields: business, health, market, politics, etc. In specific cases, a deeper analysis is required to create useful metrics and models with predicting capabilities. In this paper, a set of metrics based on Twitter data have been identified and presented in order to predict the audience of scheduled television programmes, where the audience is highly involved such as it occurs with reality shows (i.e., X Factor and Pechino Express, in Italy). Identified suitable metrics are based on the volume of tweets, the distribution of linguistic elements, the volume of distinct users involved in tweeting, and the sentiment analysis of tweets. On this ground a number of predictive models have been identified and compared. The resulting method has been selected in the context of a validation and assessment by using real data, with the aim of building a flexible framework able to exploit the predicting capabilities of social media data. Further details are reported about the method adopted to build models which focus on the identification of predictors by their statistical significance. Experiments have been based on the collected Twitter data by using Twitter Vigilance platform, which is presented in this paper, as well.
引用
收藏
页码:12203 / 12232
页数:30
相关论文
共 54 条
[1]  
Achrekar Harshavardhan, 2012, Proceedings of the International Conference on Health Informatics. HEALTHINF 2012, P61
[2]   FACTOR-ANALYSIS AND AIC [J].
AKAIKE, H .
PSYCHOMETRIKA, 1987, 52 (03) :317-332
[3]  
[Anonymous], ED PSYCHOL MEAS
[4]  
[Anonymous], SOC SCI COMPUT REV
[5]  
[Anonymous], INT C LANG RES EV IS
[6]  
[Anonymous], 2002, P 1 INT WORDNET C MY
[7]  
[Anonymous], 2006, AAAI 2006 SPRING S C
[8]  
[Anonymous], PEER J PREPRINTS
[9]  
[Anonymous], P 5 INT C UB INF MAN
[10]  
[Anonymous], 2012, INT J COMPUT APPL, DOI DOI 10.5120/8852-2794