Do Many Models Make Light Work? Evaluating Ensemble Solutions for Improved Rumor Detection

被引:10
作者
Kim, Younghwan [1 ]
Kim, Huy Kang [1 ]
Kim, Hyoungshick [2 ,3 ]
Hong, Jin B. [4 ]
机构
[1] Korea Univ, Sch Cybersecur, Seoul 02841, South Korea
[2] Sungkyunkwan Univ, Dept Comp Sci & Engn, Coll Software, Seoul 16419, South Korea
[3] CSIRO Data61, Eveleigh, NSW 2015, Australia
[4] Univ Western Australia, Dept Comp Sci & Software Engn, Crawley, WA 6009, Australia
基金
新加坡国家研究基金会;
关键词
Feature extraction; Analytical models; Twitter; Machine learning; Australia; Forestry; Detectors; Feature analysis; machine learning; rumor detection; social media; ensemble solution;
D O I
10.1109/ACCESS.2020.3016664
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
There have been many efforts to detect rumors using various machine learning (ML) models, but there is still a lack of understanding of their performance against different rumor topics and available features, resulting in a significant performance degrade against completely new and unseen (unknown) rumors. To address this issue, we investigate the relationship between ML models, features, and rumor topics to select the best rumor detection model under specific conditions using 13 different ML models. Our experiment results demonstrate that there is no clear winner among the ML models in all different rumor topics with respect to the detection performance. To overcome this problem, a possible way is to use an ensemble of ML models. Although previous work presents an improved detection of rumors using ensemble solutions (ES), their evaluation did not consider detecting unknown rumors. Further, they did not present nor evaluate the configuration of the ES to ensure that it indeed performs better than using a single ML model. Based on these observations, we propose to evaluate the use of an ES by examining their unknown rumor detection performance compared with single ML models but as well as different configurations of the ESes. Our experimental results using real-world datasets found that an ES of Random Forest, XGBoost and Multilayer perceptron overall produced the best F1 score of 0.79 for detecting unknown rumors, a significant improvement compared with a single best ML model which only achieved a 0.58 F1 score. We also showed that not all ESes are the same, with significantly degraded detection and large variations in performance when different ML models are used to construct the ES. Hence, it is infeasible to rely on any single ML model-based rumor detector. Finally, our solution also performed better than other recent detectors, such as eventAI and NileTMRG that performed similar to using a single ML model - making it a much more attractive solution to detect unknown rumors in practice.
引用
收藏
页码:150709 / 150724
页数:16
相关论文
共 43 条
[1]  
[Anonymous], 2019, 2019 IEEE INT C INT
[2]  
[Anonymous], 2018, AER ADV ENG RES
[3]  
[Anonymous], 2015, 2015 18 INT C EL
[4]  
[Anonymous], 2017, SOCIAL INFORM
[5]  
[Anonymous], 2015, 2015 12 WEB INF, DOI DOI 10.1109/WISA.2015.19
[6]   An empirical comparison of voting classification algorithms: Bagging, boosting, and variants [J].
Bauer, E ;
Kohavi, R .
MACHINE LEARNING, 1999, 36 (1-2) :105-139
[7]   Hyperopt: A Python library for model selection and hyperparameter optimization [J].
Bergstra, James ;
Komer, Brent ;
Eliasmith, Chris ;
Yamins, Dan ;
Cox, David D .
Computational Science and Discovery, 2015, 8 (01)
[8]   Automatically Identifying Fake News in Popular Twitter Threads [J].
Buntain, Cody ;
Golbeck, Jennifer .
2017 IEEE INTERNATIONAL CONFERENCE ON SMART CLOUD (SMARTCLOUD), 2017, :208-215
[9]  
Castillo C., 2011, P 20 INT C WORLD WID, P675, DOI DOI 10.1145/1963405.1963500
[10]  
Chen T., 2018, P PAC AS C KNOWL DIS, P40