Continual Learning for Fake News Detection from Social Media

被引:5
作者
Han, Yi [1 ]
Karunasekera, Shanika [1 ]
Leckie, Christopher [1 ]
机构
[1] Univ Melbourne, Sch Comp & Informat Syst, Melbourne, Vic, Australia
来源
ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING - ICANN 2021, PT II | 2021年 / 12892卷
关键词
Fake news detection; Continual learning; Social media; FALSE NEWS;
D O I
10.1007/978-3-030-86340-1_30
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The prevalence of fake news over social media has a profound impact on justice, public trust and society as a whole. Although significant effort has been applied to mitigate its negative impact, our study shows that existing fake news detection algorithms may perform poorly on new data. In other words, the performance of a model trained on one dataset degrades on another and potentially vastly different dataset. Considering that in practice a deployed fake news detection system is likely to observe unseen data, it is crucial to solve this problem without re-training the model on the entire data from scratch, which would become prohibitively expensive as the data volumes grow. An intuitive solution is to further train the model on the new dataset, but our results show that this direct incremental training approach does not work, as the model only performs well on the latest dataset it is trained on, which is similar to the problem of catastrophic forgetting in the field of continual learning. Instead, in this work, (1) we first demonstrate that with only minor computational overhead, balanced performance can be restored on both existing and new datasets, by utilising Gradient Episodic Memory (GEM) and Elastic Weight Consolidation (EWC)-two techniques from continual learning. (2) We improve the algorithm of GEM so that the drop in model performance on the previous task can be further minimised. Specifically, we investigate different techniques to optimise the sampling process for GEM, as an improvement over random selection as originally designed. (3) We conduct extensive experiments on two datasets with thousands of labelled news items to verify our results.
引用
收藏
页码:372 / 384
页数:13
相关论文
共 38 条
[1]  
ang Y., 2018, ARXIV180600749
[2]  
Bian T, 2020, AAAI CONF ARTIF INTE, V34, P549
[3]  
Chen Z., 2019, REVISITING GRADIENT
[4]   DETERRENT: Knowledge Guided Graph Attention Network for Detecting Healthcare Misinformation [J].
Cui, Limeng ;
Seo, Haeseung ;
Tabar, Maryam ;
Ma, Fenglong ;
Wang, Suhang ;
Lee, Dongwon .
KDD '20: PROCEEDINGS OF THE 26TH ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY & DATA MINING, 2020, :492-502
[5]  
Goodfellow I. J., 2014, ial examples. CoRR abs/1412.6572
[6]  
Jin ZW, 2016, AAAI CONF ARTIF INTE, P2972
[7]   Novel Visual and Statistical Image Features for Microblogs News Verification [J].
Jin, Zhiwei ;
Cao, Juan ;
Zhang, Yongdong ;
Zhou, Jianshe ;
Tian, Qi .
IEEE TRANSACTIONS ON MULTIMEDIA, 2017, 19 (03) :598-608
[8]  
Kai Shu, 2017, ACM SIGKDD Explorations Newsletter, V19, P22, DOI 10.1145/3137597.3137600
[9]   Overcoming catastrophic forgetting in neural networks [J].
Kirkpatricka, James ;
Pascanu, Razvan ;
Rabinowitz, Neil ;
Veness, Joel ;
Desjardins, Guillaume ;
Rusu, Andrei A. ;
Milan, Kieran ;
Quan, John ;
Ramalho, Tiago ;
Grabska-Barwinska, Agnieszka ;
Hassabis, Demis ;
Clopath, Claudia ;
Kumaran, Dharshan ;
Hadsell, Raia .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2017, 114 (13) :3521-3526
[10]  
Liu Y, 2018, AAAI CONF ARTIF INTE, P354