Continual Learning for Fake News Detection from Social Media

被引:6
作者
Han, Yi [1 ]
Karunasekera, Shanika [1 ]
Leckie, Christopher [1 ]
机构
[1] Univ Melbourne, Sch Comp & Informat Syst, Melbourne, Vic, Australia
来源
ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING - ICANN 2021, PT II | 2021年 / 12892卷
关键词
Fake news detection; Continual learning; Social media; FALSE NEWS;
D O I
10.1007/978-3-030-86340-1_30
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The prevalence of fake news over social media has a profound impact on justice, public trust and society as a whole. Although significant effort has been applied to mitigate its negative impact, our study shows that existing fake news detection algorithms may perform poorly on new data. In other words, the performance of a model trained on one dataset degrades on another and potentially vastly different dataset. Considering that in practice a deployed fake news detection system is likely to observe unseen data, it is crucial to solve this problem without re-training the model on the entire data from scratch, which would become prohibitively expensive as the data volumes grow. An intuitive solution is to further train the model on the new dataset, but our results show that this direct incremental training approach does not work, as the model only performs well on the latest dataset it is trained on, which is similar to the problem of catastrophic forgetting in the field of continual learning. Instead, in this work, (1) we first demonstrate that with only minor computational overhead, balanced performance can be restored on both existing and new datasets, by utilising Gradient Episodic Memory (GEM) and Elastic Weight Consolidation (EWC)-two techniques from continual learning. (2) We improve the algorithm of GEM so that the drop in model performance on the previous task can be further minimised. Specifically, we investigate different techniques to optimise the sampling process for GEM, as an improvement over random selection as originally designed. (3) We conduct extensive experiments on two datasets with thousands of labelled news items to verify our results.
引用
收藏
页码:372 / 384
页数:13
相关论文
共 38 条
[11]  
Lopez-Paz D, 2017, ADV NEUR IN, V30
[12]  
Lu YJ, 2020, 58TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2020), P505
[13]   Detect Rumors in Microblog Posts Using Propagation Structure via Kernel Learning [J].
Ma, Jing ;
Gao, Wei ;
Wong, Kam-Fai .
PROCEEDINGS OF THE 55TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2017), VOL 1, 2017, :708-717
[14]  
McCloskey M., 1989, Psychology of Learning and Motivation, VVolume 24, P109
[15]  
Mirzasoleiman B, 2015, AAAI CONF ARTIF INTE, P1812
[16]  
Monti F., ARXIV190206673
[17]   A Review of Relational Machine Learning for Knowledge Graphs [J].
Nickel, Maximilian ;
Murphy, Kevin ;
Tresp, Volker ;
Gabrilovich, Evgeniy .
PROCEEDINGS OF THE IEEE, 2016, 104 (01) :11-33
[18]   Continual lifelong learning with neural networks: A review [J].
Parisi, German I. ;
Kemker, Ronald ;
Part, Jose L. ;
Kanan, Christopher ;
Wermter, Stefan .
NEURAL NETWORKS, 2019, 113 :54-71
[19]  
Perez-Rosas Veronica, 2018, P 27 INT C COMP LING, P3391
[20]   False News On Social Media: A Data-Driven Survey [J].
Pierri, Francesco ;
Ceri, Stefano .
SIGMOD RECORD, 2019, 48 (02) :18-32