Taxonomy and Evaluation for Microblog Popularity Prediction

被引:64
作者
Gao, Xiaofeng [1 ]
Cao, Zhenhao [1 ]
Li, Sha [2 ]
Yao, Bin [3 ]
Chen, Guihai [4 ]
Tang, Shaojie [5 ]
机构
[1] Shanghai Jiao Tong Univ, F1503023, Shanghai 200240, Peoples R China
[2] Univ Illinois, 201 N Goodwin Ave, Urbana, IL 61801 USA
[3] Shanghai Jiao Tong Univ, Room 3-505,SEIEE Bldg,800 Dong Chuan Rd, Shanghai 200240, Peoples R China
[4] Shanghai Jiao Tong Univ, SEIEE 3-422,800 Dongchuan Rd, Shanghai 200240, Peoples R China
[5] Univ Texas Dallas, 800 West Campbell Rd,SM 33, Richardson, TX 75080 USA
基金
中国国家自然科学基金; 国家重点研发计划;
关键词
Social network; popularity prediction; evaluation; taxonomy; MODEL; HASHTAGS;
D O I
10.1145/3301303
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
As social networks become amajor source of information, predicting the outcome of information diffusion has appeared intriguing to both researchers and practitioners. By organizing and categorizing the joint efforts of numerous studies on popularity prediction, this article presents a hierarchical taxonomy and helps to establish a systematic overview of popularity prediction methods for microblog. Specifically, we uncover three lines of thoughts: the feature-based approach, time-series modelling, and the collaborative filtering approach and analyse them, respectively. Furthermore, we also categorize prediction methods based on their underlying rationale: whether they attempt to model the motivation of users or monitor the early responses. Finally, we put these prediction methods to test by performing experiments on real-life data collected from popular social networks Twitter and Weibo. We compare the methods in terms of accuracy, efficiency, timeliness, robustness, and bias. As far as we are concerned, there is no precedented survey aimed at microblog popularity prediction at the time of submission. By establishing a taxonomy and evaluation for the first time, we hope to provide an in-depth review of state-of-the-art prediction methods and point out directions for further research. Our evaluations show that time-series modelling has the advantage of high accuracy and the ability to improve over time. The feature-based methods using only temporal features performs nearly as well as using all possible features, producing average results. This suggests that temporal features do have strong predictive power and that power is better exploited with time-series models. On the other hand, this implies that we know little about the future popularity of an item before it is posted, which may be the focus of further research.
引用
收藏
页数:40
相关论文
共 104 条
[1]  
Ahmed Mohamed, 2013, P 6 ACM INT C WEB SE, P607, DOI [DOI 10.1145/2433396.2433473, 10.1145/2433396.2433473]
[2]  
Alzahrani Sultan, 2015, Social Computing, Behavioral-Cultural Modeling and Prediction. 8th International Conference, SBP 2015. Proceedings: LNCS 9021, P3, DOI 10.1007/978-3-319-16268-3_1
[3]  
[Anonymous], 2010, WORKSH COMP SOC SCI
[4]  
[Anonymous], 2010, 2010 IEEE 2 INT C SO, DOI DOI 10.1109/SOCIALCOM.2010.33
[5]  
[Anonymous], 2011, Proceedings of the 28th International Con- ference on Machine Learning (ICML 2011)
[6]  
[Anonymous], 2010, P 19 ACM INT C INF K, DOI DOI 10.1145/1871437.1871691
[7]  
[Anonymous], 2015, P 24 ACM INT C INFOR, DOI DOI 10.1145/2806416.2806650
[8]  
Asur S., 2010, Proceedings 2010 IEEE/ACM International Conference on Web Intelligence-Intelligent Agent Technology (WI-IAT), P492, DOI 10.1109/WI-IAT.2010.63
[9]  
Bakshy E, 2011, P 4 ACM INT C WEB SE, DOI DOI 10.1145/1935826.1935845
[10]   Modeling and Predicting Popularity Dynamics via an Influence-based Self-Excited Hawkes Process [J].
Bao, Peng .
CIKM'16: PROCEEDINGS OF THE 2016 ACM CONFERENCE ON INFORMATION AND KNOWLEDGE MANAGEMENT, 2016, :1897-1900